Apache Oozie

The Workflow Scheduler for Hadoop



Bookstore > Books > Apache Oozie

Price$24.68 - $40.69
Rating
AuthorsMohammad Kamrul Islam, Aravind Srinivasan
PublisherO'Reilly Media
Published2015
Pages272
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101449369928
ISBN-139781449369927
EBook Hardcover Paperback

Get a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases.

Once you set up your Oozie server, you'll dive into techniques for writing and coordinating workflows, and learn how to write complex data pipelines. Advanced topics show you how to handle shared libraries in Oozie, as well as how to implement and manage Oozie's security capabilities.




3 5 16

Similar Books


Apache HTTP Server Cookbook

Apache HTTP Server Cookbook

by JCGs

The Apache HTTP Server, colloquially called Apache, is the world's most used web server software. Originally based on the NCSA HTTPd server, development of Apache began in early 1995 after work on the NCSA code stalled. Apache played a key role in the initial growth of the World Wide Web, quickly overtaking NCSA HTTPd as the dominant...

Free ebook  |  Publisher:  Self-publishing  |  Release:  2016

Apache Hadoop YARN

Apache Hadoop YARN

by Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham

Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to...

Price:  $4.49  |  Publisher:  Addison-Wesley  |  Release:  2014

Apache Cookbook, 2nd Edition

Apache Cookbook, 2nd Edition

by Rich Bowen, Ken Coar

There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounte...

Price:  $4.79  |  Publisher:  O'Reilly Media  |  Release:  2007

Pro Apache Hadoop, 2nd Edition

Pro Apache Hadoop, 2nd Edition

by Sameer Wadkar, Madhu Siddalingaiah, Jason Venner

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop - the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too...

Price:  $22.99  |  Publisher:  Apress  |  Release:  2014

Expert Apache Cassandra Administration

Expert Apache Cassandra Administration

by Sam R. Alapati

Follow this handbook to build, configure, tune, and secure Apache Cassandra databases. Start with the installation of Cassandra and move on to the creation of a single instance, and then a cluster of Cassandra databases.Cassandra is increasingly a key player in many big data environments, and this book shows you how to use Cassandra with ...

Price:  $35.60  |  Publisher:  Apress  |  Release:  2017

Apache Cordova in Action

Apache Cordova in Action

by Raymond K. Camden

Apache Cordova in Action teaches you how to design, create, and launch hybrid mobile apps people will want to use. With the help of straightforward, real-world examples, you'll learn to build apps from the Cordova CLI and to make use of native device features like the camera and accelerometer. You'll learn testing techniques and...

Price:  $22.09  |  Publisher:  Manning  |  Release:  2015

Apache Solr 4 Cookbook

Apache Solr 4 Cookbook

by Rafal Kuc

Learn how to make Apache Solr search faster, more complete, and comprehensively scalable. Solve performance, setup, configuration, analysis, and query problems in no time. Get to grips with, and master, the new exciting features of Apache Solr 4....

Price:  $26.99  |  Publisher:  Packt Publishing  |  Release:  2013

Apache Kafka

Apache Kafka

by Nishant Garg

Message publishing is a mechanism of connecting heterogeneous applications together with messages that are routed between them, for example by using a message broker like Apache Kafka. Such solutions deal with real-time volumes of information and route it to multiple consumers without letting information producers know who the final consu...

Price:  $20.99  |  Publisher:  Packt Publishing  |  Release:  2013