Machine Learning with PySpark, 2nd Edition

With Natural Language Processing and Recommender Systems



Bookstore > Books > Machine Learning with PySpark, 2nd Edition

Price$49.05 - $57.72
Rating
AuthorPramod Singh
PublisherApress
Published2022
Pages220
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484277767
ISBN-139781484277768
EBook Hardcover Paperback

Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable machine learning models, to natural language processing, to recommender systems.

Machine Learning with PySpark, Second Edition begins with the fundamentals of Apache Spark, including the latest updates to the framework. Next, you will learn the full spectrum of traditional machine learning algorithm implementations, along with natural language processing and recommender systems. You'll gain familiarity with the critical process of selecting machine learning algorithms, data ingestion, and data processing to solve business problems. You'll see a demonstration of how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forests. You'll also learn how to automate the steps using Spark pipelines, followed by unsupervised models such as K-means and hierarchical clustering. A section on Natural Language Processing (NLP) covers text processing, text mining, and embeddings for classification. This new edition also introduces Koalas in Spark and how to automate data workflow using Airflow and PySpark's latest ML library.

After completing this book, you will understand how to use PySpark's machine learning library to build and train various machine learning models, along with related components such as data ingestion, processing and visualization to develop data-driven intelligent applications.


  1. (3 books)


Similar Books


Machine Learning with R, 2nd Edition

Machine Learning with R, 2nd Edition

by Brett Lantz

Updated and upgraded to the latest libraries and most modern thinking, Machine Learning with R, Second Edition provides you with a rigorous introduction to this essential skill of professional data science. Without shying away from technical theory, it is written to provide focused and practical knowledge to get you building algorithms an...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2015

Machine Learning with R, 4th Edition

Machine Learning with R, 4th Edition

by Brett Lantz

Machine learning, at its core, is concerned with transforming data into actionable knowledge. R offers a powerful set of machine learning methods to quickly and easily gain insight from your data. Machine Learning with R, Fourth Edition, provides a hands-on, accessible, and readable guide to applying machine learning to real-world problem...

Price:  $35.99  |  Publisher:  Packt Publishing  |  Release:  2023

Practical Data Science with R, 2nd Edition

Practical Data Science with R, 2nd Edition

by Nina Zumel, John Mount

Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2019

Machine Learning with Spark

Machine Learning with Spark

by Nick Pentreath

Apache Spark is a framework for distributed computing that is designed from the ground up to be optimized for low latency tasks and in-memory data storage. It is one of the few frameworks for parallel computing that combines speed, scalability, in-memory processing, and fault tolerance with ease of programming and a flexible, expressive, ...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2015

Machine Learning with PySpark

Machine Learning with PySpark

by Pramod Singh

Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and reco...

Price:  $20.41  |  Publisher:  Apress  |  Release:  2019

Java Persistence with Hibernate, 2nd Edition

Java Persistence with Hibernate, 2nd Edition

by Christian Bauer, Gavin King, Gary Gregory

Java Persistence with Hibernate, 2nd Edition explores Hibernate by developing an application that ties together hundreds of individual examples. You'll immediately dig into the rich programming model of Hibernate, working through mappings, queries, fetching strategies, transactions, conversations, caching, and more. Along the way you...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2015

Machine Learning with TensorFlow

Machine Learning with TensorFlow

by Nishant Shukla, Kenneth Fricklas

TensorFlow, Google's library for large-scale machine learning, simplifies often-complex computations by representing them as graphs and efficiently mapping parts of the graphs to machines in a cluster or to the processors of a single machine.Machine Learning with TensorFlow gives readers a solid foundation in machine-learning concept...

Price:  $5.60  |  Publisher:  Manning  |  Release:  2018

Angular Development with Typescript, 2nd Edition

Angular Development with Typescript, 2nd Edition

by Yakov Fain, Anton Moiseev

Angular Development with TypeScript, 2nd Edition is an intermediate-level tutorial that introduces Angular and TypeScript to developers comfortable with building web applications using other frameworks and tools.Whether you're building lightweight web clients or full-featured SPAs, Angular is a clear choice. The Angular framework is ...

Price:  $39.99  |  Publisher:  Manning  |  Release:  2018