Fast Data Processing with Spark, 2nd Edition
Perform real-time analytics using Spark in a fast, distributed, and scalable way
Price | $29.99 - $38.33
|
Rating | ![]() ![]() ![]() ![]() ![]() |
Authors | Krishna Sankar, Holden Karau |
Publisher | Packt Publishing |
Published | 2015 |
Pages | 184 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 178439257X |
ISBN-13 | 9781784392574 |
Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big datasets.
Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes.
- Krishna Sankar
- Holden Karau (5 books)
3 5 23
Similar Books
Practical Data Science with R, 2nd Edition
by Nina Zumel, John Mount
Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...
Price: $39.99 | Publisher: Manning | Release: 2019
Fast Data Processing with Spark
by Holden Karau
Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and ...
Price: $22.99 | Publisher: Packt Publishing | Release: 2013
Angular Development with Typescript, 2nd Edition
by Yakov Fain, Anton Moiseev
Angular Development with TypeScript, 2nd Edition is an intermediate-level tutorial that introduces Angular and TypeScript to developers comfortable with building web applications using other frameworks and tools.Whether you're building lightweight web clients or full-featured SPAs, Angular is a clear choice. The Angular framework is ...
Price: $39.99 | Publisher: Manning | Release: 2018
Big Data Processing with Apache Spark
by Manuel Ignacio Franco Galeano
Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
Advanced Analytics with Spark, 2nd Edition
by Sandy Ryza, Uri Laserson, Josh Wills, Sean Owen
In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this ed...
Price: $29.85 | Publisher: O'Reilly Media | Release: 2017
Natural Language Processing with TensorFlow, 2nd Edition
by Thushan Ganegedara
Learning how to solve natural language processing (NLP) problems is an important skill to master due to the explosive growth of data combined with the demand for machine learning solutions in production. Natural Language Processing with TensorFlow, Second Edition, will teach you how to solve common real-world NLP problems with a variety o...
Price: $37.99 | Publisher: Packt Publishing | Release: 2022
by Mohammed Guller
This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX, MLlib, and Spark ML.Big Data Analytics w...
Price: $29.99 | Publisher: Apress | Release: 2016
Advanced Data Analytics Using Python, 2nd Edition
by Sayan Mukhopadhyay, Pratip Samanta
Understand advanced data analytics concepts such as time series and principal component analysis with ETL, supervised learning, and PySpark using Python. This book covers architectural patterns in data analytics, text and image classification, optimization techniques, natural language processing, and computer vision in the cloud environme...
Price: $32.09 | Publisher: Apress | Release: 2023