Enterprise Data Workflows with Cascading

Streamlined Enterprise Data Management and Analysis



Bookstore > Books > Enterprise Data Workflows with Cascading

Price$27.49 - $42.36
Rating
AuthorPaco Nathan
PublisherO'Reilly Media
Published2013
Pages170
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101449358721
ISBN-139781449358723
EBook Hardcover Paperback

There is an easier way to build Hadoop applications. With this hands-on book, you'll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications - without having to learn the intricacies of MapReduce.

Working with sample apps based on Java and other JVM languages, you'll quickly learn Cascading's streamlined approach to data processing, data filtering, and workflow optimization. This book demonstrates how this framework can help your business extract meaningful information from large amounts of distributed data.





4 5 13

Similar Books


Practical Enterprise Data Lake Insights

Practical Enterprise Data Lake Insights

by Saurabh Gupta, Venkata Giri

Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data...

Price:  $24.14  |  Publisher:  Apress  |  Release:  2018

Data-oriented Development with AngularJS

Data-oriented Development with AngularJS

by Manoj Waikar

AngularJS is one of the most popular JavaScript frameworks used to write single page applications and is suitable for developing large-scale enterprise applications. With Firebase, you can easily store and sync data in real time. It has libraries for all the major web and mobile platforms (including AngularJS) and bindings for the most po...

Price:  $19.99  |  Publisher:  Packt Publishing  |  Release:  2015

EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g

EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g

by Deepak Vohra

EJB (Enterprise JavaBeans) 3.0 is a commonly used database persistence technology in Java EE applications. EJB 3.0 has simplified the development of EJBs with an annotations-based API that eliminates the use of remote/local interfaces, home/local home interfaces, and deployment descriptors. A number of other books are available on EJB 3.0...

Price:  $5.50  |  Publisher:  Packt Publishing  |  Release:  2010

Modern Data Access with Entity Framework Core

Modern Data Access with Entity Framework Core

by Holger Schwichtenberg

C# developers, here's your opportunity to learn the ins-and-outs of Entity Framework Core, Microsoft's recently redesigned object-relational mapper. Benefit from hands-on learning that will teach you how to tackle frustrating database challenges, such as workarounds to missing features in Entity Framework Core, and learn how to optimize t...

Price:  $34.19  |  Publisher:  Apress  |  Release:  2018

Data Analysis with R

Data Analysis with R

by Tony Fischetti

Frequently the tool of choice for academics, R has spread deep into the private sector and can be found in the production pipelines at some of the most advanced and successful enterprises. The power and domain-specificity of R allows the user to express complex analytics easily, quickly, and succinctly. With over 7,000 user contributed pa...

Price:  $43.99  |  Publisher:  Packt Publishing  |  Release:  2015

Data Science with SQL Server Quick Start Guide

Data Science with SQL Server Quick Start Guide

by Dejan Sarka

SQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you.This book is the ideal introduction to data science w...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2018

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Big Data Analytics with Spark

Big Data Analytics with Spark

by Mohammed Guller

This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX, MLlib, and Spark ML.Big Data Analytics w...

Price:  $25.00  |  Publisher:  Apress  |  Release:  2016