Big Data Books



Bookstore > Books > Big Data

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming.You'll begin by...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Complete Guide to Open Source Big Data Stack

Complete Guide to Open Source Big Data Stack

by Mike Frampton

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one ...

Price:  $37.75  |  Publisher:  Apress  |  Release:  2018

Next-Generation Big Data

Next-Generation Big Data

by Butch Quinto

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies.Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used fo...

Price:  $33.51  |  Publisher:  Apress  |  Release:  2018

Veracity of Big Data

Veracity of Big Data

by Vishnu Pendyala

Examine the problem of maintaining the quality of big data and discover novel solutions. You will learn the four V's of big data, including veracity, and study the problem from various angles. The solutions discussed are drawn from diverse areas of engineering and math, including machine learning, statistics, formal methods, and the Blockchain technology. Veracity of Big Data serves as an introduction to ma...

Price:  $29.99  |  Publisher:  Apress  |  Release:  2018

Spark: The Definitive Guide

Spark: The Definitive Guide

by Matei Zaharia, Bill Chambers

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.You'll explore the basic operations and common functions of Spark's structured A...

Price:  $27.99  |  Publisher:  O'Reilly Media  |  Release:  2018

Hands-On Big Data Modeling

Hands-On Big Data Modeling

by James Lee, Tao Wei, Suresh Kumar Mukhiya

Modeling and managing data is a central focus of all big data projects. In fact, a database is considered to be effective only if you have a logical and sophisticated data model. This book will help you develop practical skills in modeling your own big data projects and improve the performance of analytical queries for your specific business requirements.To start with, you'll get a quick introduction to big...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Apache Hadoop 3 Quick Start Guide

Apache Hadoop 3 Quick Start Guide

by Hrishikesh Karambelkar

Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.The book begins with an overview of big data and Apache Hadoop. Then, you will set up a p...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Machine Learning with Apache Spark Quick Start Guide

Machine Learning with Apache Spark Quick Start Guide

by Jillur Quddus

Every person and every organization in the world manages data, whether they realize it or not. Data is used to describe the world around us and can be used for almost any purpose, from analyzing consumer habits to fighting disease and serious organized crime. Ultimately, we manage data in order to derive value from it, and many organizations around the world have traditionally invested in technology to help...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Apache Spark 2: Data Processing and Real-Time Analytics

Apache Spark 2: Data Processing and Real-Time Analytics

by Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei

Apache Spark is an in-memory, cluster-based data processing system that provides a wide range of functionalities such as big data processing, analytics, machine learning, and more. With this Learning Path, you can take your knowledge of Apache Spark to the next level by learning how to expand Spark's functionality and building your own data flow and machine learning programs on this platform.You will work w...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

PySpark Recipes

PySpark Recipes

by Raju Kumar Mishra

Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...

Price:  $35.10  |  Publisher:  Apress  |  Release:  2018

Pages: ←Previous | 1, 2, 3, 4 ... 16 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.