Fast Data Processing with Spark, 2nd Edition

Perform real-time analytics using Spark in a fast, distributed, and scalable way



Bookstore > Books > Fast Data Processing with Spark, 2nd Edition

Price$29.99 - $38.33
Rating
AuthorsKrishna Sankar, Holden Karau
PublisherPackt Publishing
Published2015
Pages184
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-10178439257X
ISBN-139781784392574
EBook Hardcover Paperback

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big datasets.

Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes.


  1. (5 books)


3 5 29

Similar Books


Practical Data Science with R, 2nd Edition

Practical Data Science with R, 2nd Edition

by Nina Zumel, John Mount

Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2019

Fast Data Processing with Spark

Fast Data Processing with Spark

by Holden Karau

Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and ...

Price:  $22.99  |  Publisher:  Packt Publishing  |  Release:  2013

Angular Development with Typescript, 2nd Edition

Angular Development with Typescript, 2nd Edition

by Yakov Fain, Anton Moiseev

Angular Development with TypeScript, 2nd Edition is an intermediate-level tutorial that introduces Angular and TypeScript to developers comfortable with building web applications using other frameworks and tools.Whether you're building lightweight web clients or full-featured SPAs, Angular is a clear choice. The Angular framework is ...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2018

Java Persistence with Hibernate, 2nd Edition

Java Persistence with Hibernate, 2nd Edition

by Christian Bauer, Gavin King, Gary Gregory

Java Persistence with Hibernate, 2nd Edition explores Hibernate by developing an application that ties together hundreds of individual examples. You'll immediately dig into the rich programming model of Hibernate, working through mappings, queries, fetching strategies, transactions, conversations, caching, and more. Along the way you...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2015

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Spring Persistence with Hibernate, 2nd Edition

Spring Persistence with Hibernate, 2nd Edition

by Brian D. Murphy, Paul Fisher

Learn how to use the core Hibernate APIs and tools as part of the Spring Framework. This book illustrates how these two frameworks can be best utilized. Other persistence solutions available in Spring are also shown including the Java Persistence API (JPA).Spring Persistence with Hibernate, Second Edition has been updated to cover Spring ...

Price:  $33.19  |  Publisher:  Apress  |  Release:  2016

Advanced Analytics with Spark, 2nd Edition

Advanced Analytics with Spark, 2nd Edition

by Sandy Ryza, Uri Laserson, Josh Wills, Sean Owen

In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this ed...

Price:  $29.85  |  Publisher:  O'Reilly Media  |  Release:  2017

Natural Language Processing with TensorFlow, 2nd Edition

Natural Language Processing with TensorFlow, 2nd Edition

by Thushan Ganegedara

Learning how to solve natural language processing (NLP) problems is an important skill to master due to the explosive growth of data combined with the demand for machine learning solutions in production. Natural Language Processing with TensorFlow, Second Edition, will teach you how to solve common real-world NLP problems with a variety o...

Price:  $37.99  |  Publisher:  Packt Publishing  |  Release:  2022