Spark in Action, 2nd Edition

Covers Apache Spark 3 with Examples in Java, Python, and Scala



Bookstore > Books > Spark in Action, 2nd Edition

Price$59.99
Rating
AuthorJean-Georges Perrin
PublisherManning
Published2020
Pages576
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101617295523
ISBN-139781617295522
EBook Hardcover Paperback

The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, 2nd Edition, you'll learn to take advantage of Spark's core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark's powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop.

Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem.

Spark in Action, 2nd Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you'll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you'll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms.





Similar Books


PostGIS in Action, 2nd Edition

PostGIS in Action, 2nd Edition

by Regina O. Obe, Leo S. Hsu

Processing data tied to location and topology requires specialized know-how. PostGIS is a free spatial database extender for PostgreSQL, every bit as good as proprietary software. With it, you can easily create location-aware queries in just a few lines of SQL code and build the back end for a mapping, raster analysis, or routing applicat...

Price:  $35.84  |  Publisher:  Manning  |  Release:  2015

Clojure in Action, 2nd Edition

Clojure in Action, 2nd Edition

by Amit Rathore

Clojure in Action, 2nd Edition is an expanded and improved version that's been updated to cover the new features of Clojure 1.6. The book gives you a rapid introduction to the Clojure language, moving from abstract theory to practical examples. You'll start by learning how to use Clojure as a general-purpose language. Next, you'll explore...

Price:  $38.08  |  Publisher:  Manning  |  Release:  2015

MongoDB in Action, 2nd Edition

MongoDB in Action, 2nd Edition

by Kyle Banker, Peter Bakkum, Shaun Verch, Douglas Garrett, Tim Hawkins

MongoDB in Action, 2nd Edition is a completely revised and updated version. It introduces MongoDB 3.0 and the document-oriented database model. This perfectly paced book gives you both the big picture you'll need as a developer and enough low-level detail to satisfy system engineers.MongoDB in Action, 2nd Edition is a completely revised a...

Price:  $19.99  |  Publisher:  Manning  |  Release:  2016

Node.js in Action, 2nd Edition

Node.js in Action, 2nd Edition

by Alex Young, Bradley Meck, Mike Cantelon, Tim Oxley, Marc Harter, T.J. Holowaychuk, Nathan Rajlich

You already know JavaScript. The trick to mastering Node.js is learning how to build applications that fully exploit its powerful asynchronous event handling and non-blocking I/O features. The Node server radically simplifies event-driven real-time apps like chat, games, and live data analytics, and with its incredibly rich ecosystem of m...

Price:  $16.72  |  Publisher:  Manning  |  Release:  2017

Camel in Action, 2nd Edition

Camel in Action, 2nd Edition

by Claus Ibsen, Jonathan Anstey

Apache Camel is a Java framework that implements enterprise integration patterns (EIPs) and comes with over 200 adapters to third-party systems. A concise DSL lets you build integration logic into your app with just a few lines of Java or XML. By using Camel, you benefit from the testing and experience of a large and vibrant open source c...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2018

D3.js in Action, 2nd Edition

D3.js in Action, 2nd Edition

by Elijah Meeks

Visualizing complex data is hard. Visualizing complex data on the web is darn near impossible without D3.js. D3 is a JavaScript library that provides a simple but powerful data visualization API over HTML, CSS, and SVG. Start with a structure, dataset, or algorithm; mix in D3; and you can programmatically generate static, animated, or int...

Price:  $15.63  |  Publisher:  Manning  |  Release:  2017

C++ Concurrency in Action, 2nd Edition

C++ Concurrency in Action, 2nd Edition

by Anthony Williams

This bestseller has been updated and revised to cover all the latest changes to C++ 14 and 17! C++ Concurrency in Action, 2nd Edition teaches you everything you need to write robust and elegant multithreaded applications in C++17.You choose C++ when your applications need to run fast. Well-designed concurrency makes them go even faster. C...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2019

Docker in Action, 2nd Edition

Docker in Action, 2nd Edition

by Jeff Nickoloff, Stephen Kuenzli

Docker in Action, 2nd Edition teaches you the skills and knowledge you need to create, deploy, and manage applications hosted in Docker containers. This bestseller has been fully updated with new examples, best practices, and a number of entirely new chapters.The idea behind Docker is simple - package just your application and its depende...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2019