Apache Spark Books



Bookstore > Books > Apache Spark

Microsoft Excel VBA and Macros

Microsoft Excel VBA and Macros

by Bill Jelen, Tracy Syrstad

Use this guide to automate virtually any routine Excel task: save yourself hours, days, maybe even weeks. Make Excel do things you thought were impossible, discover macro techniques you wont find anywhere else, and create automated reports that are amazingly powerful. Bill Jelen and Tracy Syrstad help you instantly visualize information to make it actionable; capture data from anywhere, and use it anywhere;...

Price:  $43.95  |  Publisher:  Microsoft Press  |  Release:  2022

Data Science on the Google Cloud Platform, 2nd Edition

Data Science on the Google Cloud Platform, 2nd Edition

by Valliappa Lakshmanan

Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build using Google Cloud Platform (GCP). This hands-on guide shows data engineers and data scientists how to implement an end-to-end data pipeline with cloud native tools on GCP.Throughout this updated second edition, you'll work through a sample business decision by employing a variety ...

Price:  $67.34  |  Publisher:  O'Reilly Media  |  Release:  2022

Machine Learning with PySpark, 2nd Edition

Machine Learning with PySpark, 2nd Edition

by Pramod Singh

Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable machine learning models, to natural language processing, to recommender systems.Machine Learning with PySpark, Second Edition begins with the fundamentals of Apache Spark, including the latest updates to the framework. Next, you will learn the full spectr...

Price:  $49.05  |  Publisher:  Apress  |  Release:  2022

Data Analysis with Python and PySpark

Data Analysis with Python and PySpark

by Jonathan Rioux

Data Analysis with Python and PySpark is your guide to delivering successful Python-driven data projects. Packed with relevant examples and essential techniques, this practical book teaches you to build pipelines for reporting, machine learning, and other data-centric tasks. Quick exercises in every chapter help you practice what you've learned, and rapidly start implementing PySpark into your data sys...

Price:  $57.69  |  Publisher:  Manning  |  Release:  2022

Grokking Streaming Systems

Grokking Streaming Systems

by Josh Fischer, Ning Wang

Grokking Streaming Systems is a simple guide to the complex concepts behind streaming systems. This friendly and framework-agnostic tutorial teaches you how to handle real-time events, and even design and build your own streaming job that's a perfect fit for your needs. Each new idea is carefully explained with diagrams, clear examples, and fun dialogue between perplexed personalities!Streaming systems...

Price:  $59.99  |  Publisher:  Manning  |  Release:  2022

Introducing .NET for Apache Spark

Introducing .NET for Apache Spark

by Ed Elliott

Get started using Apache Spark via C# or F# and the .NET for Apache Spark bindings. This book is an introduction to both Apache Spark and the .NET bindings. Readers new to Apache Spark will get up to speed quickly using Spark for data processing tasks performed against large and very large datasets. You will learn how to combine your knowledge of .NET with Apache Spark to bring massive computing power to be...

Price:  $40.99  |  Publisher:  Apress  |  Release:  2021

Practical Weak Supervision

Practical Weak Supervision

by Wee Hyong Tok, Amit Bahree, Senja Filipi

Most data scientists and engineers today rely on quality labeled data to train machine learning models. But building a training set manually is time-consuming and expensive, leaving many companies with unfinished ML projects. There's a more practical approach. In this book, Wee Hyong Tok, Amit Bahree, and Senja Filipi show you how to create products using weakly supervised learning models.You'll l...

Price:  $61.93  |  Publisher:  O'Reilly Media  |  Release:  2021

Data Science at the Command Line, 2nd Edition

Data Science at the Command Line, 2nd Edition

by Jeroen Janssens

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools-useful whether you work wit...

Price:  $48.49  |  Publisher:  O'Reilly Media  |  Release:  2021

Designing Cloud Data Platforms

Designing Cloud Data Platforms

by Danil Zburivsky, Lynda Partner

Centralized data warehouses, the long-time defacto standard for housing data for analytics, are rapidly giving way to multi-faceted cloud data platforms. Companies that embrace modern cloud data platforms benefit from an integrated view of their business using all of their data and can take advantage of advanced analytic practices to drive predictions and as yet unimagined data services. Designing Cloud Dat...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2021

Next-Generation Machine Learning with Spark

Next-Generation Machine Learning with Spark

by Butch Quinto

Access real-world documentation and examples for the Spark platform for building large-scale, enterprise-grade machine learning applications.The past decade has seen an astonishing series of advances in machine learning. These breakthroughs are disrupting our everyday life and making an impact across every industry.Next-Generation Machine Learning with Spark provides a gentle introduction to Spark and Spark...

Price:  $26.41  |  Publisher:  Apress  |  Release:  2020

Pages: 1, 2, 3 ... 9 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.