Big Data Analytics with Spark

A Practitioner's Guide to Using Spark for Large Scale Data Analysis



Bookstore > Books > Big Data Analytics with Spark

Price$29.99 - $39.99
Rating
AuthorMohammed Guller
PublisherApress
Published2016
Pages504
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484209656
ISBN-139781484209653
EBook Hardcover Paperback

This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX, MLlib, and Spark ML.

Big Data Analytics with Spark shows you how to use Spark and leverage its easy-to-use features to increase your productivity. You learn to perform fast data analysis using its in-memory caching and advanced execution engine, employ in-memory computing capabilities for building high-performance machine learning and low-latency interactive analytics applications, and much more. Moreover, the book shows you how to use Spark as a single integrated platform for a variety of data processing tasks, including ETL pipelines, BI, live data stream processing, graph analytics, and machine learning.

The book also includes a chapter on Scala, the hottest functional programming language, and the language that underlies Spark. You'll learn the basics of functional programming in Scala, so that you can write Spark applications in it.




5 5 9

Similar Books


Big Data Analytics with R and Hadoop

Big Data Analytics with R and Hadoop

by Vignesh Prajapati

Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased revenue. New...

Price:  $5.77  |  Publisher:  Packt Publishing  |  Release:  2013

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Fast Data Processing with Spark, 2nd Edition

Fast Data Processing with Spark, 2nd Edition

by Krishna Sankar, Holden Karau

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (G...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2015

Fast Data Processing with Spark

Fast Data Processing with Spark

by Holden Karau

Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and ...

Price:  $22.99  |  Publisher:  Packt Publishing  |  Release:  2013

Scaling Big Data with Hadoop and Solr

Scaling Big Data with Hadoop and Solr

by Hrishikesh Vijay Karambelkar

As data grows exponentially day-by-day, extracting information becomes a tedious activity in itself. Technologies like Hadoop are trying to address some of the concerns, while Solr provides high-speed faceted search. Bringing these two technologies together is helping organizations resolve the problem of information extraction from Big Da...

Price:  $26.99  |  Publisher:  Packt Publishing  |  Release:  2013

Modern Big Data Processing with Hadoop

Modern Big Data Processing with Hadoop

by Naresh Kumar, Prashant Shindgikar

The complex structure of data these days requires sophisticated solutions for data transformation, to make the information more accessible to the users.This book empowers you to build such solutions with relative ease with the help of Apache Hadoop, along with a host of other Big Data tools.This book will give you a complete understanding...

Price:  $50.55  |  Publisher:  Packt Publishing  |  Release:  2018

Getting Started with Greenplum for Big Data Analytics

Getting Started with Greenplum for Big Data Analytics

by Sunila Gollapudi

Organizations are leveraging the use of data and analytics to gain a competitive advantage over their opposition. Therefore, organizations are quickly becoming more and more data driven. With the advent of Big Data, existing Data Warehousing and Business Intelligence solutions are becoming obsolete, and a requisite for new agile platforms...

Price:  $23.99  |  Publisher:  Packt Publishing  |  Release:  2013

Advanced Analytics with Spark

Advanced Analytics with Spark

by Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.You'll start with an introduction to Spark and i...

Price:  $20.00  |  Publisher:  O'Reilly Media  |  Release:  2015