Hadoop Books



Bookstore > Books > Hadoop

Modern Big Data Processing with Hadoop

Modern Big Data Processing with Hadoop

by Naresh Kumar, Prashant Shindgikar

The complex structure of data these days requires sophisticated solutions for data transformation, to make the information more accessible to the users.This book empowers you to build such solutions with relative ease with the help of Apache Hadoop, along with a host of other Big Data tools.This book will give you a complete understanding of the data lifecycle management with Hadoop, followed by modeling of...

Price:  $31.99  |  Publisher:  Packt Publishing  |  Release:  2018

Apache Hadoop 3 Quick Start Guide

Apache Hadoop 3 Quick Start Guide

by Hrishikesh Karambelkar

Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.The book begins with an overview of big data and Apache Hadoop. Then, you will set up a p...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

PySpark Recipes

PySpark Recipes

by Raju Kumar Mishra

Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...

Price:  $33.31  |  Publisher:  Apress  |  Release:  2018

Advanced Data Analytics Using Python

Advanced Data Analytics Using Python

by Sayan Mukhopadhyay

Gain a broad foundation of advanced data analytics concepts and discover the recent revolution in databases such as Neo4j, Elasticsearch, and MongoDB. This book discusses how to implement ETL techniques including topical crawling, which is applied in domains such as high-frequency algorithmic trading and goal-oriented dialog systems. You'll also see examples of machine learning concepts such as semi-supervi...

Price:  $34.82  |  Publisher:  Apress  |  Release:  2018

Practical Enterprise Data Lake Insights

Practical Enterprise Data Lake Insights

by Saurabh Gupta, Venkata Giri

Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go t...

Price:  $24.14  |  Publisher:  Apress  |  Release:  2018

Beginning Apache Spark 2

Beginning Apache Spark 2

by Hien Luu

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data;...

Price:  $25.33  |  Publisher:  Apress  |  Release:  2018

Block Trace Analysis and Storage System Optimization

Block Trace Analysis and Storage System Optimization

by Jun Xu

Understand the fundamental factors of data storage system performance and master an essential analytical skill using block trace via applications such as MATLAB and Python tools. You will increase your productivity and learn the best techniques for doing specific tasks (such as analyzing the IO pattern in a quantitative way, identifying the storage system bottleneck, and designing the cache policy).In the n...

Price:  $30.00  |  Publisher:  Apress  |  Release:  2018

Getting Started with Kudu

Getting Started with Kudu

by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator - either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data....

Price:  $40.44  |  Publisher:  O'Reilly Media  |  Release:  2018

Learning Apache Drill

Learning Apache Drill

by Paul Rogers, Charles Givre

...

Price:  $53.50  |  Publisher:  O'Reilly Media  |  Release:  2018

Big Data Architect's Handbook

Big Data Architect's Handbook

by Syed Muhammad Fahad Akhtar

The big data architects are the “masters” of data, and hold high value in today's market. Handling big data, be it of good or bad quality, is not an easy task. The prime job for any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden insights.Big Data Architect's Handbook takes you through developing a complete...

Price:  $54.99  |  Publisher:  Packt Publishing  |  Release:  2018

Pages: 1, 2, 3 ... 9 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.