Apache Oozie
The Workflow Scheduler for Hadoop
Price | $24.68 - $40.69
|
Rating | |
Authors | Mohammad Kamrul Islam, Aravind Srinivasan |
Publisher | O'Reilly Media |
Published | 2015 |
Pages | 272 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1449369928 |
ISBN-13 | 9781449369927 |
Get a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases.
Once you set up your Oozie server, you'll dive into techniques for writing and coordinating workflows, and learn how to write complex data pipelines. Advanced topics show you how to handle shared libraries in Oozie, as well as how to implement and manage Oozie's security capabilities.
- Mohammad Kamrul Islam
- Aravind Srinivasan
3 5 16
Similar Books
by JCGs
The Apache HTTP Server, colloquially called Apache, is the world's most used web server software. Originally based on the NCSA HTTPd server, development of Apache began in early 1995 after work on the NCSA code stalled. Apache played a key role in the initial growth of the World Wide Web, quickly overtaking NCSA HTTPd as the dominant...
Free ebook | Publisher: Self-publishing | Release: 2016
by Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to...
Price: $4.49 | Publisher: Addison-Wesley | Release: 2014
by Rich Bowen, Ken Coar
There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounte...
Price: $4.79 | Publisher: O'Reilly Media | Release: 2007
Pro Apache Hadoop, 2nd Edition
by Sameer Wadkar, Madhu Siddalingaiah, Jason Venner
Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop - the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too...
Price: $22.99 | Publisher: Apress | Release: 2014
Expert Apache Cassandra Administration
by Sam R. Alapati
Follow this handbook to build, configure, tune, and secure Apache Cassandra databases. Start with the installation of Cassandra and move on to the creation of a single instance, and then a cluster of Cassandra databases.Cassandra is increasingly a key player in many big data environments, and this book shows you how to use Cassandra with ...
Price: $35.60 | Publisher: Apress | Release: 2017
by Raymond K. Camden
Apache Cordova in Action teaches you how to design, create, and launch hybrid mobile apps people will want to use. With the help of straightforward, real-world examples, you'll learn to build apps from the Cordova CLI and to make use of native device features like the camera and accelerometer. You'll learn testing techniques and...
Price: $22.09 | Publisher: Manning | Release: 2015
by Rafal Kuc
Learn how to make Apache Solr search faster, more complete, and comprehensively scalable. Solve performance, setup, configuration, analysis, and query problems in no time. Get to grips with, and master, the new exciting features of Apache Solr 4....
Price: $26.99 | Publisher: Packt Publishing | Release: 2013
by Nishant Garg
Message publishing is a mechanism of connecting heterogeneous applications together with messages that are routed between them, for example by using a message broker like Apache Kafka. Such solutions deal with real-time volumes of information and route it to multiple consumers without letting information producers know who the final consu...
Price: $20.99 | Publisher: Packt Publishing | Release: 2013