by Michael Brzustowicz
Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today's data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz explains the basic math theory behind each step of the data science p...
Price: $27.33 | Publisher: O'Reilly Media | Release: 2017
by Bhushan Lakhe
Re-architect relational applications to NoSQL, integrate relational database management systems with the Hadoop ecosystem, and transform and migrate relational data to and from Hadoop components. This book covers the best-practice design approaches to re-architecting your relational applications and transforming your relational data for usage with the Hadoop ecosystem while considering concurrency, security...
Price: $44.99 | Publisher: Apress | Release: 2016
by Benoy Antony, Konstantin Boudnik, Cheryl Adams, Branky Shao, Cazen Lee, Kai Sasaki
Professional Hadoop is the complete reference and resource for experienced developers looking to employ Apache Hadoop in real-world settings. Written by an expert team of certified Hadoop developers, committers, and Summit speakers, this book details every key aspect of Hadoop technology to enable optimal processing of large data sets. Designed expressly for the professional developer, this book skips over ...
Price: $34.32 | Publisher: Wrox | Release: 2016
by Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard
Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardwar...
Price: $35.66 | Publisher: Apress | Release: 2016
Scalable Big Data Architecture
by Bahaaldine Azarmi
This book highlights the different types of data architecture and illustrates the many possibilities hidden behind the term "Big Data", from the usage of No-SQL databases to the deployment of stream analytics architecture, machine learning, and governance.Scalable Big Data Architecture covers real-world, concrete industry use cases that leverage complex distributed applications , which inv...
Price: $23.50 | Publisher: Apress | Release: 2016
Kubernetes Microservices with Docker
by Deepak Vohra
This book on Kubernetes, a container cluster manager, discusses all aspects of using Kubernetes in today's complex big data and enterprise applications, with Docker containers.Starting with installing Kubernetes on a single node, Kubernetes Microservices with Docker introduces Kubernetes with a simple Hello example and discusses using environment variables in Kubernetes.Next, the book discusses using K...
Price: $49.99 | Publisher: Apress | Release: 2016
Cassandra: The Definitive Guide, 2nd Edition
by Eben Hewitt, Jeff Carpenter
Imagine what you could do if scalability wasn't a problem. With this hands-on guide, you'll learn how the Cassandra database management system handles hundreds of terabytes of data while remaining highly available across multiple data centers. This expanded second edition - updated for Cassandra 3.0 - provides the technical details and practical examples you need to put this database to work in a ...
Price: $28.24 | Publisher: O'Reilly Media | Release: 2016
by Cyrus Dasadia, Amol Nayak
MongoDB is a high-performance and feature-rich NoSQL database that forms the backbone of the systems that power many different organizations - it's easy to see why it's the most popular NoSQL database on the market. Packed with many features that have become essential for many different types of software professionals and incredibly easy to use, this cookbook contains many solutions to the everyda...
Price: $44.99 | Publisher: Packt Publishing | Release: 2016
by Kevin Sitto, Marshall Presser
If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You'll quickly understand how Hadoop's projects, subprojects, and relate...
Price: $26.96 | Publisher: O'Reilly Media | Release: 2015
Hadoop MapReduce v2 Cookbook, 2nd Edition
by Thilina Gunarathne
Starting with installing Hadoop YARN, MapReduce, HDFS, and other Hadoop ecosystem components, with this book, you will soon learn about many exciting topics such as MapReduce patterns, using Hadoop to solve analytics, classifications, online marketing, recommendations, and data indexing and searching. You will learn how to take advantage of Hadoop ecosystem projects including Hive, HBase, Pig, Mahout, Nutch...
Price: $39.08 | Publisher: Packt Publishing | Release: 2015