Machine Learning with PySpark

With Natural Language Processing and Recommender Systems



Bookstore > Books > Machine Learning with PySpark

Price$20.41 - $28.15
Rating
AuthorPramod Singh
PublisherApress
Published2019
Pages223
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484241304
ISBN-139781484241301
EBook Hardcover Paperback

Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using PySpark.

Machine Learning with PySpark shows you how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forest. You'll also see unsupervised machine learning models such as K-means and hierarchical clustering. A major portion of the book focuses on feature engineering to create useful features with PySpark to train the machine learning models. The natural language processing section covers text processing, text mining, and embedding for classification.

After reading this book, you will understand how to use PySpark's machine learning library to build and train various machine learning models. Additionally you'll become comfortable with related PySpark components, such as data ingestion, data processing, and data analysis, that you can use to develop data-driven intelligent applications.

Build a spectrum of supervised and unsupervised machine learning algorithms; Implement machine learning algorithms with Spark MLlib libraries; Develop a recommender system with Spark MLlib libraries; Handle issues related to feature engineering, class balance, bias and variance, and cross validation for building an optimal fit model.


  1. (3 books)


4 5 12

Similar Books


Machine Learning with Spark

Machine Learning with Spark

by Nick Pentreath

Apache Spark is a framework for distributed computing that is designed from the ground up to be optimized for low latency tasks and in-memory data storage. It is one of the few frameworks for parallel computing that combines speed, scalability, in-memory processing, and fault tolerance with ease of programming and a flexible, expressive, ...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2015

Machine Learning with PySpark, 2nd Edition

Machine Learning with PySpark, 2nd Edition

by Pramod Singh

Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable machine learning models, to natural language processing, to recommender systems.Machine Learning with PySpark, Second Edition begins with the fundamentals of Apache Spark, including the...

Price:  $49.05  |  Publisher:  Apress  |  Release:  2022

Machine Learning with TensorFlow

Machine Learning with TensorFlow

by Nishant Shukla, Kenneth Fricklas

TensorFlow, Google's library for large-scale machine learning, simplifies often-complex computations by representing them as graphs and efficiently mapping parts of the graphs to machines in a cluster or to the processors of a single machine.Machine Learning with TensorFlow gives readers a solid foundation in machine-learning concept...

Price:  $5.60  |  Publisher:  Manning  |  Release:  2018

Machine Learning with R, 2nd Edition

Machine Learning with R, 2nd Edition

by Brett Lantz

Updated and upgraded to the latest libraries and most modern thinking, Machine Learning with R, Second Edition provides you with a rigorous introduction to this essential skill of professional data science. Without shying away from technical theory, it is written to provide focused and practical knowledge to get you building algorithms an...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2015

Hands-On Machine Learning with Azure

Hands-On Machine Learning with Azure

by Thomas K Abraham, Parashar Shah, Jen Stirrup, Lauri Lehman, Anindita Basak

Implementing Machine learning (ML) and Artificial Intelligence (AI) in the cloud had not been possible earlier due to the lack of processing power and storage. However, Azure has created ML and AI services that are easy to implement in the cloud. Hands-On Machine Learning with Azure teaches you how to perform advanced ML projects in the c...

Price:  $36.50  |  Publisher:  Packt Publishing  |  Release:  2018

Machine Learning with R, 4th Edition

Machine Learning with R, 4th Edition

by Brett Lantz

Machine learning, at its core, is concerned with transforming data into actionable knowledge. R offers a powerful set of machine learning methods to quickly and easily gain insight from your data. Machine Learning with R, Fourth Edition, provides a hands-on, accessible, and readable guide to applying machine learning to real-world problem...

Price:  $35.99  |  Publisher:  Packt Publishing  |  Release:  2023

Machine Learning with PyTorch and Scikit-Learn

Machine Learning with PyTorch and Scikit-Learn

by Sebastian Raschka, Yuxi (Hayden) Liu, Vahid Mirjalili

Machine Learning with PyTorch and Scikit-Learn is a comprehensive guide to machine learning and deep learning with PyTorch. It acts as both a step-by-step tutorial and a reference you'll keep coming back to as you build your machine learning systems.Packed with clear explanations, visualizations, and examples, the book covers all the...

Price:  $40.00  |  Publisher:  Packt Publishing  |  Release:  2022

Mastering Azure Machine Learning, 2nd Edition

Mastering Azure Machine Learning, 2nd Edition

by Christoph Korner, Marcel Alsdorf

Azure Machine Learning is a cloud service for accelerating and managing the machine learning (ML) project life cycle that ML professionals, data scientists, and engineers can use in their day-to-day workflows. This book covers the end-to-end ML process using Microsoft Azure Machine Learning, including data preparation, performing and logg...

Price:  $41.99  |  Publisher:  Packt Publishing  |  Release:  2022