Beginning Apache Spark 2

With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library



Bookstore > Books > Beginning Apache Spark 2

Beginning Apache Spark 2
Price$25.33 - $53.04
Rating
AuthorHien Luu
PublisherApress
Published2018
Pages393
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484235789
ISBN-139781484235782
EBook Hardcover Paperback

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.

Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you'll learn the fundamentals of Spark ML for machine learning and much more.

After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications.

Understand Spark unified data processing platform; How to run Spark in Spark Shell or Databricks; Use and manipulate RDDs; Deal with structured data using Spark SQL through its operations and advanced functions; Build real-time applications using Spark Structured Streaming; Develop intelligent applications with the Spark Machine Learning library.





Similar Books


Mastering Apache Spark

Mastering Apache Spark

by Mike Frampton

Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations.This book aims to take your limited knowledge of Spark to th...

Price:  $34.63  |  Publisher:  Packt Publishing  |  Release:  2015

Sams Teach Yourself Apache Spark in 24 Hours

Sams Teach Yourself Apache Spark in 24 Hours

by Jeffrey Aven

Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage Spark's amazing s...

Price:  $32.51  |  Publisher:  SAMS Publishing  |  Release:  2016

Apache Cookbook, 2nd Edition

Apache Cookbook, 2nd Edition

by Rich Bowen, Ken Coar

There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounter with the new ...

Price:  $4.79  |  Publisher:  O'Reilly Media  |  Release:  2007

Beginning SQL Queries, 2nd Edition

Beginning SQL Queries, 2nd Edition

by Clare Churcher

Get started on mastering the one language binding the entire database industry. That language is SQL, and how it works is must-have knowledge for anyone involved with relational databases, and surprisingly also for anyone involved with NoSQL databases. SQL is universally used in querying and reporting on large data sets in order to genera...

Price:  $28.08  |  Publisher:  Apress  |  Release:  2016

Beginning Visual Basic 2015

Beginning Visual Basic 2015

by Bryan Newsome

Beginning Visual Basic 2015 is the ideal guide for new programmers, especially those learning their first language. This new edition has been updated to align with Visual Studio 2015, and also refocused to concentrate on key beginner topics. Precise, step-by-step instructions walk you through important tasks, and clear explanations target...

Price:  $18.09  |  Publisher:  Wrox  |  Release:  2015

Beginning T-SQL 2012, 2nd Edition

Beginning T-SQL 2012, 2nd Edition

by Scott Shaw, Kathi Kellenberger

Beginning T-SQL 2012 is the first step toward learning the T-SQL language that underlies Microsoft's SQL Server database engine. T-SQL is essential in writing SQL statements to get data into and out of a database. T-SQL is the foundation for business logic embedded in the database in the form of stored procedures and functions. Beginning ...

Price:  $35.10  |  Publisher:  Apress  |  Release:  2012

Beginning Database Design, 2nd Edition

Beginning Database Design, 2nd Edition

by Clare Churcher

Database design is not an exact science. Many are surprised to find that problems with their databases are caused by poor design rather than by difficulties in using the database management software. Beginning Database Design, 2nd Edition helps you ask and answer important questions about your data so you can understand the problem you ar...

Price:  $16.00  |  Publisher:  Apress  |  Release:  2012

Beginning Apache Cassandra Development

Beginning Apache Cassandra Development

by Vivek Mishra

Beginning Apache Cassandra Development introduces you to one of the most robust and best-performing NoSQL database platforms on the planet. Apache Cassandra is a document database following the JSON document model. It is specifically designed to manage large amounts of data across many commodity servers without there being any single poin...

Price:  $49.04  |  Publisher:  Apress  |  Release:  2014