by Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard
Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardwar...
Price: $35.66 | Publisher: Apress | Release: 2016
by Dayong Du
In this book, we prepare you for your journey into big data by firstly introducing you to backgrounds in the big data domain along with the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skill in using the Hive language in an efficient manner. To...
Price: $39.99 | Publisher: Packt Publishing | Release: 2015
by Matei Zaharia, Holden Karau, Andy Konwinski, Patrick Wendell
Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition includes new information on Spark SQL, Spark Streaming, set...
Price: $32.23 | Publisher: O'Reilly Media | Release: 2015
by Deepak Vohra
In this fast-paced book on the Docker open standards platform for developing, packaging and running portable distributed applications, author Deepak Vohra discusses how to build, ship and run applications on any platform such as a PC, the cloud, data center or a virtual machine. He describes how to install Docker images and create Docker containers, and the advantages of Docker containers.The remainder of t...
Price: $44.99 | Publisher: Apress | Release: 2015
Hadoop MapReduce v2 Cookbook, 2nd Edition
by Thilina Gunarathne
Starting with installing Hadoop YARN, MapReduce, HDFS, and other Hadoop ecosystem components, with this book, you will soon learn about many exciting topics such as MapReduce patterns, using Hadoop to solve analytics, classifications, online marketing, recommendations, and data indexing and searching. You will learn how to take advantage of Hadoop ecosystem projects including Hive, HBase, Pig, Mahout, Nutch...
Price: $39.08 | Publisher: Packt Publishing | Release: 2015
by Bahaaldine Azarmi
Talend, a successful Open Source Data Integration Solution, accelerates the adoption of new big data technologies and efficiently integrates them into your existing IT infrastructure. It is able to do this because of its intuitive graphical language, its multiple connectors to the Hadoop ecosystem, and its array of tools for data integration, quality, management, and governance.This is a concise, pragmatic ...
Price: $20.99 | Publisher: Packt Publishing | Release: 2014
by Sandeep Karanth
Hadoop is synonymous with Big Data processing. Its simple programming model, "code once and deploy at any scale" paradigm, and an ever-growing ecosystem makes Hadoop an all-encompassing platform for programmers with different levels of expertise.This book explores the industry guidelines to optimize MapReduce jobs and higher-level abstractions such as Pig and Hive in Hadoop 2.0. Then, it d...
Price: $49.99 | Publisher: Packt Publishing | Release: 2014
Hadoop Real-World Solutions Cookbook
by Jonathan R. Owens, Brian Femiano, Jon Lentz
Helping developers become more comfortable and proficient with solving problems in the Hadoop space. People will become more familiar with a wide variety of Hadoop related tools and best practices for implementation.Hadoop Real-World Solutions Cookbook will teach readers how to build solutions using tools such as Apache Hive, Pig, MapReduce, Mahout, Giraph, HDFS, Accumulo, Redis, and Ganglia.Hadoop Real-Wor...
Price: $29.99 | Publisher: Packt Publishing | Release: 2013
by Edward Capriolo, Dean Wampler, Jason Rutherglen
Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure. You'll quickly learn how to use Hive's SQL dialect - HiveQL - to summarize, query, and analyze large datasets stored in Hadoop's distributed filesystem.This example-driven guide shows you how to set up and configure Hive in your environ...
Price: $24.98 | Publisher: O'Reilly Media | Release: 2012
Hadoop: The Definitive Guide, 2nd Edition
by Tom White
Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework - an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoo...
Price: $4.22 | Publisher: O'Reilly Media | Release: 2010