Apache Spark Books



Bookstore > Books > Apache Spark

Machine Learning with PySpark

Machine Learning with PySpark

by Pramod Singh

Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using PySpark.Machine Learning with PySpark shows you ...

Price:  $20.41  |  Publisher:  Apress  |  Release:  2019

Beginning Apache Spark 2

Beginning Apache Spark 2

by Hien Luu

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data;...

Price:  $25.33  |  Publisher:  Apress  |  Release:  2018

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming.You'll begin by...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Next-Generation Big Data

Next-Generation Big Data

by Butch Quinto

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies.Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used fo...

Price:  $33.51  |  Publisher:  Apress  |  Release:  2018

Practical Apache Spark

Practical Apache Spark

by Subhashini Chellappan, Dharanitharan Ganesan

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You'l...

Price:  $31.66  |  Publisher:  Apress  |  Release:  2018

Spark: The Definitive Guide

Spark: The Definitive Guide

by Matei Zaharia, Bill Chambers

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.You'll explore the basic operations and common functions of Spark's structured A...

Price:  $45.69  |  Publisher:  O'Reilly Media  |  Release:  2018

Scala Programming Projects

Scala Programming Projects

by Mikaƫl Valot, Nicolas Jorand

Scala is a type-safe JVM language that incorporates object-oriented and functional programming (OOP and FP) aspects. This book gets you started with essentials of software development by guiding you through various aspects of Scala programming, helping you bridge the gap between learning and implementing. You will learn about the unique features of Scala through diverse applications and experience simple ye...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Jupyter Cookbook

Jupyter Cookbook

by Dan Toomey

Jupyter has garnered a strong interest in the data science community of late, as it makes common data processing and analysis tasks much simpler. This book is for data science professionals who want to master various tasks related to Jupyter to create efficient, easy-to-share, scientific applications.The book starts with recipes on installing and running the Jupyter Notebook system on various platforms and ...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Complete Guide to Open Source Big Data Stack

Complete Guide to Open Source Big Data Stack

by Mike Frampton

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one ...

Price:  $37.75  |  Publisher:  Apress  |  Release:  2018

PySpark Recipes

PySpark Recipes

by Raju Kumar Mishra

Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...

Price:  $33.53  |  Publisher:  Apress  |  Release:  2018

Pages: 1, 2, 3 ... 7 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.