Machine Learning with PySpark, 2nd Edition
With Natural Language Processing and Recommender Systems
Price | $49.05 - $57.72
|
Rating | |
Author | Pramod Singh |
Publisher | Apress |
Published | 2022 |
Pages | 220 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1484277767 |
ISBN-13 | 9781484277768 |
Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable machine learning models, to natural language processing, to recommender systems.
Machine Learning with PySpark, Second Edition begins with the fundamentals of Apache Spark, including the latest updates to the framework. Next, you will learn the full spectrum of traditional machine learning algorithm implementations, along with natural language processing and recommender systems. You'll gain familiarity with the critical process of selecting machine learning algorithms, data ingestion, and data processing to solve business problems. You'll see a demonstration of how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forests. You'll also learn how to automate the steps using Spark pipelines, followed by unsupervised models such as K-means and hierarchical clustering. A section on Natural Language Processing (NLP) covers text processing, text mining, and embeddings for classification. This new edition also introduces Koalas in Spark and how to automate data workflow using Airflow and PySpark's latest ML library.
After completing this book, you will understand how to use PySpark's machine learning library to build and train various machine learning models, along with related components such as data ingestion, processing and visualization to develop data-driven intelligent applications.
- Pramod Singh (3 books)
Similar Books
Machine Learning with R, 2nd Edition
by Brett Lantz
Updated and upgraded to the latest libraries and most modern thinking, Machine Learning with R, Second Edition provides you with a rigorous introduction to this essential skill of professional data science. Without shying away from technical theory, it is written to provide focused and practical knowledge to get you building algorithms an...
Price: $39.99 | Publisher: Packt Publishing | Release: 2015
Machine Learning with R, 4th Edition
by Brett Lantz
Machine learning, at its core, is concerned with transforming data into actionable knowledge. R offers a powerful set of machine learning methods to quickly and easily gain insight from your data. Machine Learning with R, Fourth Edition, provides a hands-on, accessible, and readable guide to applying machine learning to real-world problem...
Price: $35.99 | Publisher: Packt Publishing | Release: 2023
Practical Data Science with R, 2nd Edition
by Nina Zumel, John Mount
Practical Data Science with R, Second Edition takes a practice-oriented approach to explaining basic principles in the ever expanding field of data science. You'll jump right to real-world use cases as you apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business...
Price: $39.99 | Publisher: Manning | Release: 2019
by Nick Pentreath
Apache Spark is a framework for distributed computing that is designed from the ground up to be optimized for low latency tasks and in-memory data storage. It is one of the few frameworks for parallel computing that combines speed, scalability, in-memory processing, and fault tolerance with ease of programming and a flexible, expressive, ...
Price: $34.99 | Publisher: Packt Publishing | Release: 2015
by Pramod Singh
Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and reco...
Price: $20.41 | Publisher: Apress | Release: 2019
Java Persistence with Hibernate, 2nd Edition
by Christian Bauer, Gavin King, Gary Gregory
Java Persistence with Hibernate, 2nd Edition explores Hibernate by developing an application that ties together hundreds of individual examples. You'll immediately dig into the rich programming model of Hibernate, working through mappings, queries, fetching strategies, transactions, conversations, caching, and more. Along the way you...
Price: $39.99 | Publisher: Manning | Release: 2015
Machine Learning with TensorFlow
by Nishant Shukla, Kenneth Fricklas
TensorFlow, Google's library for large-scale machine learning, simplifies often-complex computations by representing them as graphs and efficiently mapping parts of the graphs to machines in a cluster or to the processors of a single machine.Machine Learning with TensorFlow gives readers a solid foundation in machine-learning concept...
Price: $5.60 | Publisher: Manning | Release: 2018
Angular Development with Typescript, 2nd Edition
by Yakov Fain, Anton Moiseev
Angular Development with TypeScript, 2nd Edition is an intermediate-level tutorial that introduces Angular and TypeScript to developers comfortable with building web applications using other frameworks and tools.Whether you're building lightweight web clients or full-featured SPAs, Angular is a clear choice. The Angular framework is ...
Price: $39.99 | Publisher: Manning | Release: 2018