Enterprise Data Workflows with Cascading

Streamlined Enterprise Data Management and Analysis



Bookstore > Books > Enterprise Data Workflows with Cascading

Price$27.49 - $42.36
Rating
AuthorPaco Nathan
PublisherO'Reilly Media
Published2013
Pages170
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101449358721
ISBN-139781449358723
EBook Hardcover Paperback

There is an easier way to build Hadoop applications. With this hands-on book, you'll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications - without having to learn the intricacies of MapReduce.

Working with sample apps based on Java and other JVM languages, you'll quickly learn Cascading's streamlined approach to data processing, data filtering, and workflow optimization. This book demonstrates how this framework can help your business extract meaningful information from large amounts of distributed data.




5 5 13

Similar Books


Practical Enterprise Data Lake Insights

Practical Enterprise Data Lake Insights

by Saurabh Gupta, Venkata Giri

Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data...

Price:  $24.14  |  Publisher:  Apress  |  Release:  2018

Practical Data Science with R, 2nd Edition

Practical Data Science with R, 2nd Edition

by Nina Zumel, John Mount

Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2019

Data Engineering with Alteryx

Data Engineering with Alteryx

by Paul Houghton

Alteryx is a GUI-based development platform for data analytic applications.Data Engineering with Alteryx will help you leverage Alteryx's code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have.This book will teach you the principles of DataOps and how they can be...

Price:  $44.99  |  Publisher:  Packt Publishing  |  Release:  2022

Productive and Efficient Data Science with Python

Productive and Efficient Data Science with Python

by Tirthajyoti Sarkar

This book focuses on the Python-based tools and techniques to help you become highly productive at all aspects of typical data science stacks such as statistical analysis, visualization, model selection, and feature engineering.You'll review the inefficiencies and bottlenecks lurking in the daily business process and solve them with ...

Price:  $49.99  |  Publisher:  Apress  |  Release:  2022

Data-oriented Development with AngularJS

Data-oriented Development with AngularJS

by Manoj Waikar

AngularJS is one of the most popular JavaScript frameworks used to write single page applications and is suitable for developing large-scale enterprise applications. With Firebase, you can easily store and sync data in real time. It has libraries for all the major web and mobile platforms (including AngularJS) and bindings for the most po...

Price:  $24.99  |  Publisher:  Packt Publishing  |  Release:  2015

EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g

EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g

by Deepak Vohra

EJB (Enterprise JavaBeans) 3.0 is a commonly used database persistence technology in Java EE applications. EJB 3.0 has simplified the development of EJBs with an annotations-based API that eliminates the use of remote/local interfaces, home/local home interfaces, and deployment descriptors. A number of other books are available on EJB 3.0...

Price:  $5.79  |  Publisher:  Packt Publishing  |  Release:  2010

Modern Data Access with Entity Framework Core

Modern Data Access with Entity Framework Core

by Holger Schwichtenberg

C# developers, here's your opportunity to learn the ins-and-outs of Entity Framework Core, Microsoft's recently redesigned object-relational mapper. Benefit from hands-on learning that will teach you how to tackle frustrating database challenges, such as workarounds to missing features in Entity Framework Core, and learn how to ...

Price:  $34.19  |  Publisher:  Apress  |  Release:  2018

Data Analysis with R

Data Analysis with R

by Tony Fischetti

Frequently the tool of choice for academics, R has spread deep into the private sector and can be found in the production pipelines at some of the most advanced and successful enterprises. The power and domain-specificity of R allows the user to express complex analytics easily, quickly, and succinctly. With over 7,000 user contributed pa...

Price:  $54.99  |  Publisher:  Packt Publishing  |  Release:  2015