Beginning Apache Spark 2
With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
|Price||$25.33 - $53.04
|Format||Paper book / ebook (PDF)|
Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.
Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you'll learn the fundamentals of Spark ML for machine learning and much more.
After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications.
Understand Spark unified data processing platform; How to run Spark in Spark Shell or Databricks; Use and manipulate RDDs; Deal with structured data using Spark SQL through its operations and advanced functions; Build real-time applications using Spark Structured Streaming; Develop intelligent applications with the Spark Machine Learning library.
by Mike Frampton
Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations.This book aims to take your limited knowledge of Spark to th...
Price: $34.63 | Publisher: Packt Publishing | Release: 2015
by Jeffrey Aven
Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage Spark's amazing s...
Price: $32.51 | Publisher: SAMS Publishing | Release: 2016
by Rich Bowen, Ken Coar
There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounter with the new ...
Price: $4.79 | Publisher: O'Reilly Media | Release: 2007
by Clare Churcher
Get started on mastering the one language binding the entire database industry. That language is SQL, and how it works is must-have knowledge for anyone involved with relational databases, and surprisingly also for anyone involved with NoSQL databases. SQL is universally used in querying and reporting on large data sets in order to genera...
Price: $28.08 | Publisher: Apress | Release: 2016
by Bryan Newsome
Beginning Visual Basic 2015 is the ideal guide for new programmers, especially those learning their first language. This new edition has been updated to align with Visual Studio 2015, and also refocused to concentrate on key beginner topics. Precise, step-by-step instructions walk you through important tasks, and clear explanations target...
Price: $18.09 | Publisher: Wrox | Release: 2015
by Scott Shaw, Kathi Kellenberger
Beginning T-SQL 2012 is the first step toward learning the T-SQL language that underlies Microsoft's SQL Server database engine. T-SQL is essential in writing SQL statements to get data into and out of a database. T-SQL is the foundation for business logic embedded in the database in the form of stored procedures and functions. Beginning ...
Price: $35.10 | Publisher: Apress | Release: 2012
by Clare Churcher
Database design is not an exact science. Many are surprised to find that problems with their databases are caused by poor design rather than by difficulties in using the database management software. Beginning Database Design, 2nd Edition helps you ask and answer important questions about your data so you can understand the problem you ar...
Price: $16.00 | Publisher: Apress | Release: 2012
by Vivek Mishra
Beginning Apache Cassandra Development introduces you to one of the most robust and best-performing NoSQL database platforms on the planet. Apache Cassandra is a document database following the JSON document model. It is specifically designed to manage large amounts of data across many commodity servers without there being any single poin...
Price: $49.04 | Publisher: Apress | Release: 2014