Fast Data Processing with Spark, 2nd Edition
Perform real-time analytics using Spark in a fast, distributed, and scalable way
Price | $29.99 - $38.33
|
Rating | |
Authors | Krishna Sankar, Holden Karau |
Publisher | Packt Publishing |
Published | 2015 |
Pages | 184 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 178439257X |
ISBN-13 | 9781784392574 |
Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big datasets.
Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes.
- Krishna Sankar
- Holden Karau (5 books)
3 5 29
Similar Books
Practical Data Science with R, 2nd Edition
by Nina Zumel, John Mount
Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...
Price: $39.99 | Publisher: Manning | Release: 2019
Fast Data Processing with Spark
by Holden Karau
Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and ...
Price: $22.99 | Publisher: Packt Publishing | Release: 2013
Angular Development with Typescript, 2nd Edition
by Yakov Fain, Anton Moiseev
Angular Development with TypeScript, 2nd Edition is an intermediate-level tutorial that introduces Angular and TypeScript to developers comfortable with building web applications using other frameworks and tools.Whether you're building lightweight web clients or full-featured SPAs, Angular is a clear choice. The Angular framework is ...
Price: $39.99 | Publisher: Manning | Release: 2018
Java Persistence with Hibernate, 2nd Edition
by Christian Bauer, Gavin King, Gary Gregory
Java Persistence with Hibernate, 2nd Edition explores Hibernate by developing an application that ties together hundreds of individual examples. You'll immediately dig into the rich programming model of Hibernate, working through mappings, queries, fetching strategies, transactions, conversations, caching, and more. Along the way you...
Price: $39.99 | Publisher: Manning | Release: 2015
Big Data Processing with Apache Spark
by Manuel Ignacio Franco Galeano
Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
Spring Persistence with Hibernate, 2nd Edition
by Brian D. Murphy, Paul Fisher
Learn how to use the core Hibernate APIs and tools as part of the Spring Framework. This book illustrates how these two frameworks can be best utilized. Other persistence solutions available in Spring are also shown including the Java Persistence API (JPA).Spring Persistence with Hibernate, Second Edition has been updated to cover Spring ...
Price: $33.19 | Publisher: Apress | Release: 2016
Advanced Analytics with Spark, 2nd Edition
by Sandy Ryza, Uri Laserson, Josh Wills, Sean Owen
In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this ed...
Price: $29.85 | Publisher: O'Reilly Media | Release: 2017
Natural Language Processing with TensorFlow, 2nd Edition
by Thushan Ganegedara
Learning how to solve natural language processing (NLP) problems is an important skill to master due to the explosive growth of data combined with the demand for machine learning solutions in production. Natural Language Processing with TensorFlow, Second Edition, will teach you how to solve common real-world NLP problems with a variety o...
Price: $37.99 | Publisher: Packt Publishing | Release: 2022