FREE EBOOK - Mastering AWS Development
by Uchit Vyas
This book is a practical guide to developing, administering, and managing applications and infrastructures with AWS. With this, you'll be able to create, design, and manage an entire application life cycle on AWS by using the AWS SDKs, APIs, and the AWS Management Console.You'll start with the basics of the AWS development platform and look into creating stable and scalable infrastructures using E...
Price: $49.99 | Publisher: Packt Publishing | Release: 2015
by Dharmesh Kakadia
Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks. It allows developers to concurrently run the likes of Hadoop, Spark, Storm, and other applications on a dynamically shared pool of nodes. With Mesos, you have the power to manage a wide range of resources in a multi-tenant environment.Starting with the basics, this book wi...
Price: $39.99 | Publisher: Packt Publishing | Release: 2015
by Chandramani Tiwary
In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing...
Price: $44.99 | Publisher: Packt Publishing | Release: 2015
Fast Data Processing with Spark, 2nd Edition
by Krishna Sankar, Holden Karau
Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactiv...
Price: $29.99 | Publisher: Packt Publishing | Release: 2015
Learning Apache Kafka, 2nd Edition
by Nishant Garg
Kafka is one of those systems that is very simple to describe at a high level but has an incredible depth of technical detail when you dig deeper.Learning Apache Kafka Second Edition provides you with step-by-step, practical examples that help you take advantage of the real power of Kafka and handle hundreds of megabytes of messages per second from multiple clients. This book teaches you everything you need...
Price: $13.07 | Publisher: Packt Publishing | Release: 2015
by Amit Nandi
Looking for a cluster computing system that provides high-level APIs? Apache Spark is your answer - an open source, fast, and general purpose cluster computing system. Spark's multi-stage memory primitives provide performance up to 100 times faster than Hadoop, and it is also well-suited for machine learning algorithms.Are you a Python developer inclined to work with Spark engine? If so, this book will...
Price: $39.99 | Publisher: Packt Publishing | Release: 2015
by Naoya Hashimoto
Amazon S3 is one of the most famous and trailblazing cloud object storage services, which is highly scalable, low-latency, and economical. Users only pay for what they use and can store and retrieve any amount of data at any time over the Internet, which attracts Hadoop users who run clusters on EC2.The book starts by showing you how to install several AWS SDKs such as iOS, Java, Node.js, PHP, Python, and R...
Price: $49.99 | Publisher: Packt Publishing | Release: 2015
by Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage th...
Price: $4.49 | Publisher: Addison-Wesley | Release: 2014
Pro Apache Hadoop, 2nd Edition
by Sameer Wadkar, Madhu Siddalingaiah, Jason Venner
Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop - the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too, giving the latest on the ins and outs of MapReduce, cluster design, ...
Price: $22.99 | Publisher: Apress | Release: 2014
by Bhushan Lakhe
clusters. A detailed guide to the security options and configuration within Hadoop itself, author Bhushan Lakhe takes you through a comprehensive study of how to implement defined security within a Hadoop cluster in a hands-on way.You will start with a detailed overview of all the security options available for Hadoop, including popular extensions like Kerberos and OpenSSH, and then delve into a hands-on im...
Price: $46.98 | Publisher: Apress | Release: 2014