Beginning Apache Spark 2

With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library



Bookstore > Books > Beginning Apache Spark 2

Price$25.33 - $53.04
Rating
AuthorHien Luu
PublisherApress
Published2018
Pages393
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484235789
ISBN-139781484235782
EBook Hardcover Paperback

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.

Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you'll learn the fundamentals of Spark ML for machine learning and much more.

After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications.

Understand Spark unified data processing platform; How to run Spark in Spark Shell or Databricks; Use and manipulate RDDs; Deal with structured data using Spark SQL through its operations and advanced functions; Build real-time applications using Spark Structured Streaming; Develop intelligent applications with the Spark Machine Learning library.





Similar Books


Apache Spark 2: Data Processing and Real-Time Analytics

Apache Spark 2: Data Processing and Real-Time Analytics

by Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei

Apache Spark is an in-memory, cluster-based data processing system that provides a wide range of functionalities such as big data processing, analytics, machine learning, and more. With this Learning Path, you can take your knowledge of Apache Spark to the next level by learning how to expand Spark's functionality and building your own da...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Mastering Apache Spark

Mastering Apache Spark

by Mike Frampton

Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations.This book aims to take your limited knowledge of Spark to th...

Price:  $35.25  |  Publisher:  Packt Publishing  |  Release:  2015

Sams Teach Yourself Apache Spark in 24 Hours

Sams Teach Yourself Apache Spark in 24 Hours

by Jeffrey Aven

Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage Spark's amazing s...

Price:  $31.49  |  Publisher:  SAMS Publishing  |  Release:  2016

Apache Cookbook, 2nd Edition

Apache Cookbook, 2nd Edition

by Rich Bowen, Ken Coar

There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounter with the new ...

Price:  $4.79  |  Publisher:  O'Reilly Media  |  Release:  2007

Beginning Apache Cassandra Development

Beginning Apache Cassandra Development

by Vivek Mishra

Beginning Apache Cassandra Development introduces you to one of the most robust and best-performing NoSQL database platforms on the planet. Apache Cassandra is a document database following the JSON document model. It is specifically designed to manage large amounts of data across many commodity servers without there being any single poin...

Price:  $46.83  |  Publisher:  Apress  |  Release:  2014

Beginning HTML5 Media, 2nd Edition

Beginning HTML5 Media, 2nd Edition

by Tom Green, Silvia Pfeiffer

Beginning HTML5 Media, 2nd Edition is a comprehensive introduction to HTML5 video and audio. The HTML5 video standard enables browsers to support audio and video elements natively. This makes it very easy for web developers to publish audio and video, integrating both within the general presentation of web pages. For example, media elemen...

Price:  $30.26  |  Publisher:  Apress  |  Release:  2015

Beginning SQL Queries, 2nd Edition

Beginning SQL Queries, 2nd Edition

by Clare Churcher

Get started on mastering the one language binding the entire database industry. That language is SQL, and how it works is must-have knowledge for anyone involved with relational databases, and surprisingly also for anyone involved with NoSQL databases. SQL is universally used in querying and reporting on large data sets in order to genera...

Price:  $22.59  |  Publisher:  Apress  |  Release:  2016

Beginning Visual Basic 2015

Beginning Visual Basic 2015

by Bryan Newsome

Beginning Visual Basic 2015 is the ideal guide for new programmers, especially those learning their first language. This new edition has been updated to align with Visual Studio 2015, and also refocused to concentrate on key beginner topics. Precise, step-by-step instructions walk you through important tasks, and clear explanations target...

Price:  $18.09  |  Publisher:  Wrox  |  Release:  2015