Apache Spark Books



Bookstore > Books > Apache Spark

Beginning Apache Spark 2

Beginning Apache Spark 2

by Hien Luu

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data;...

Price:  $25.33  |  Publisher:  Apress  |  Release:  2018

Big Data Processing with Apache Spark

Big Data Processing with Apache Spark

by Manuel Ignacio Franco Galeano

Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming.You'll begin by...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Next-Generation Big Data

Next-Generation Big Data

by Butch Quinto

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies.Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used fo...

Price:  $33.51  |  Publisher:  Apress  |  Release:  2018

Spark: The Definitive Guide

Spark: The Definitive Guide

by Matei Zaharia, Bill Chambers

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.You'll explore the basic operations and common functions of Spark's structured A...

Price:  $45.31  |  Publisher:  O'Reilly Media  |  Release:  2018

Scala Programming Projects

Scala Programming Projects

by Mikaƫl Valot, Nicolas Jorand

Scala is a type-safe JVM language that incorporates object-oriented and functional programming (OOP and FP) aspects. This book gets you started with essentials of software development by guiding you through various aspects of Scala programming, helping you bridge the gap between learning and implementing. You will learn about the unique features of Scala through diverse applications and experience simple ye...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Jupyter Cookbook

Jupyter Cookbook

by Dan Toomey

Jupyter has garnered a strong interest in the data science community of late, as it makes common data processing and analysis tasks much simpler. This book is for data science professionals who want to master various tasks related to Jupyter to create efficient, easy-to-share, scientific applications.The book starts with recipes on installing and running the Jupyter Notebook system on various platforms and ...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Complete Guide to Open Source Big Data Stack

Complete Guide to Open Source Big Data Stack

by Mike Frampton

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one ...

Price:  $43.33  |  Publisher:  Apress  |  Release:  2018

PySpark Recipes

PySpark Recipes

by Raju Kumar Mishra

Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...

Price:  $33.31  |  Publisher:  Apress  |  Release:  2018

Advanced Data Analytics Using Python

Advanced Data Analytics Using Python

by Sayan Mukhopadhyay

Gain a broad foundation of advanced data analytics concepts and discover the recent revolution in databases such as Neo4j, Elasticsearch, and MongoDB. This book discusses how to implement ETL techniques including topical crawling, which is applied in domains such as high-frequency algorithmic trading and goal-oriented dialog systems. You'll also see examples of machine learning concepts such as semi-supervi...

Price:  $34.82  |  Publisher:  Apress  |  Release:  2018

The Business Value of Developer Relations

The Business Value of Developer Relations

by Mary Thengvall

Discover the true value of Developer Relations as you learn to build and maintain positive relationships with your developer community. Use the principles laid out in this book to walk through your company goals and discover how you can formulate a plan tailored to your specific needs.First you will understand the value of a technical community: why you need to foster a community and how to do it. Then you ...

Price:  $19.21  |  Publisher:  Apress  |  Release:  2018

Pages: 1, 2, 3 ... 6 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.