Spark in Action, 2nd Edition

Covers Apache Spark 3 with Examples in Java, Python, and Scala



Bookstore > Books > Spark in Action, 2nd Edition

Price$35.89 - $59.99
Rating
AuthorJean-Georges Perrin
PublisherManning
Published2020
Pages576
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101617295523
ISBN-139781617295522
EBook Hardcover Paperback

The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, 2nd Edition, you'll learn to take advantage of Spark's core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark's powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop.

Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem.

Spark in Action, 2nd Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you'll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you'll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms.




4 5 43

Similar Books


PostGIS in Action, 2nd Edition

PostGIS in Action, 2nd Edition

by Regina O. Obe, Leo S. Hsu

Processing data tied to location and topology requires specialized know-how. PostGIS is a free spatial database extender for PostgreSQL, every bit as good as proprietary software. With it, you can easily create location-aware queries in just a few lines of SQL code and build the back end for a mapping, raster analysis, or routing applicat...

Price:  $4.40  |  Publisher:  Manning  |  Release:  2015

MongoDB in Action, 2nd Edition

MongoDB in Action, 2nd Edition

by Kyle Banker, Peter Bakkum, Shaun Verch, Douglas Garrett, Tim Hawkins

MongoDB in Action, 2nd Edition is a completely revised and updated version. It introduces MongoDB 3.0 and the document-oriented database model. This perfectly paced book gives you both the big picture you'll need as a developer and enough low-level detail to satisfy system engineers.MongoDB in Action, 2nd Edition is a completely revi...

Price:  $19.99  |  Publisher:  Manning  |  Release:  2016

Node.js in Action, 2nd Edition

Node.js in Action, 2nd Edition

by Alex Young, Bradley Meck, Mike Cantelon, Tim Oxley, Marc Harter, T.J. Holowaychuk, Nathan Rajlich

You already know JavaScript. The trick to mastering Node.js is learning how to build applications that fully exploit its powerful asynchronous event handling and non-blocking I/O features. The Node server radically simplifies event-driven real-time apps like chat, games, and live data analytics, and with its incredibly rich ecosystem of m...

Price:  $25.18  |  Publisher:  Manning  |  Release:  2017

Camel in Action, 2nd Edition

Camel in Action, 2nd Edition

by Claus Ibsen, Jonathan Anstey

Apache Camel is a Java framework that implements enterprise integration patterns (EIPs) and comes with over 200 adapters to third-party systems. A concise DSL lets you build integration logic into your app with just a few lines of Java or XML. By using Camel, you benefit from the testing and experience of a large and vibrant open source c...

Price:  $50.26  |  Publisher:  Manning  |  Release:  2018

Docker in Practice, 2nd Edition

Docker in Practice, 2nd Edition

by Ian Miell, Aidan Hobson Sayers

Docker in Practice, 2ond Edition presents over 100 practical techniques, hand-picked to help you get the most out of Docker. Following a Problem/Solution/Discussion format, you'll walk through specific examples that you can use immediately, and you'll get expert guidance on techniques that you can apply to a whole range of scena...

Price:  $46.45  |  Publisher:  Manning  |  Release:  2019

Windows PowerShell in Action, 3rd Edition

Windows PowerShell in Action, 3rd Edition

by Bruce Payette, Richard Siddaway

In 2006, Windows PowerShell reinvented the way administrators and developers interact with Windows. Today, PowerShell is required knowledge for Windows admins and devs. This powerful, dynamic language provides command-line control of the Windows OS and most Windows servers, such as Exchange and SCCM. And because it's a first-class .N...

Price:  $31.03  |  Publisher:  Manning  |  Release:  2017

Clojure in Action, 2nd Edition

Clojure in Action, 2nd Edition

by Amit Rathore

Clojure in Action, 2nd Edition is an expanded and improved version that's been updated to cover the new features of Clojure 1.6. The book gives you a rapid introduction to the Clojure language, moving from abstract theory to practical examples. You'll start by learning how to use Clojure as a general-purpose language. Next, you&...

Price:  $38.08  |  Publisher:  Manning  |  Release:  2015

D3.js in Action, 2nd Edition

D3.js in Action, 2nd Edition

by Elijah Meeks

Visualizing complex data is hard. Visualizing complex data on the web is darn near impossible without D3.js. D3 is a JavaScript library that provides a simple but powerful data visualization API over HTML, CSS, and SVG. Start with a structure, dataset, or algorithm; mix in D3; and you can programmatically generate static, animated, or int...

Price:  $15.63  |  Publisher:  Manning  |  Release:  2017