Feature Engineering for Machine Learning

Principles and Techniques for Data Scientists



Bookstore > Books > Feature Engineering for Machine Learning

Price$29.93 - $43.49
Rating
AuthorsAlice Zheng, Amanda Casari
PublisherO'Reilly Media
Published2018
Pages218
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101491953241
ISBN-139781491953242
EBook Hardcover Paperback

Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you'll learn techniques for extracting and transforming features - the numeric representations of raw data - into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering.

Rather than simply teach these principles, authors Alice Zheng and Amanda Casari focus on practical application with exercises throughout the book. The closing chapter brings everything together by tackling a real-world, structured dataset with several feature-engineering techniques. Python packages including numpy, Pandas, Scikit-learn, and Matplotlib are used in code examples.

Feature engineering for numeric data: filtering, binning, scaling, log transforms, and power transforms; Natural text techniques: bag-of-words, n-grams, and phrase detection; Frequency-based filtering and feature scaling for eliminating uninformative features; Encoding techniques of categorical variables, including feature hashing and bin-counting; Model-based feature engineering with principal component analysis; The concept of model stacking, using k-means as a featurization technique; Image feature extraction with manual and deep-learning techniques.




4 5 251

Similar Books


Clojure for Machine Learning

Clojure for Machine Learning

by Akhil Wali

Clojure for Machine Learning is an introduction to machine learning techniques and algorithms. This book demonstrates how you can apply these techniques to real-world problems using the Clojure programming language.It explores many machine learning techniques and also describes how to use Clojure to build machine learning systems. This bo...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2014

Python Feature Engineering Cookbook

Python Feature Engineering Cookbook

by Soledad Galli

Feature engineering is invaluable for developing and enriching your machine learning models. In this cookbook, you will work with the best tools to streamline your feature engineering pipelines and techniques and simplify and improve the quality of your code.Using Python libraries such as pandas, scikit-learn, Featuretools, and Feature-en...

Price:  $31.83  |  Publisher:  Packt Publishing  |  Release:  2020

Python Feature Engineering Cookbook, 2nd Edition

Python Feature Engineering Cookbook, 2nd Edition

by Soledad Galli

Feature engineering, the process of transforming variables and creating features, albeit time-consuming, ensures that your machine learning models perform seamlessly. This second edition of Python Feature Engineering Cookbook will take the struggle out of feature engineering by showing you how to use open source Python libraries to accele...

Price:  $32.99  |  Publisher:  Packt Publishing  |  Release:  2022

Machine Learning with Spark

Machine Learning with Spark

by Nick Pentreath

Apache Spark is a framework for distributed computing that is designed from the ground up to be optimized for low latency tasks and in-memory data storage. It is one of the few frameworks for parallel computing that combines speed, scalability, in-memory processing, and fault tolerance with ease of programming and a flexible, expressive, ...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2015

Scala for Machine Learning

Scala for Machine Learning

by Patrick R. Nicolas

The discovery of information through data clustering and classification is becoming a key differentiator for competitive organizations. Machine learning applications are everywhere, from self-driving cars, engineering designs, biometrics, and trading strategies, to detection of genetic anomalies.The book begins with an introduction to the...

Price:  $59.99  |  Publisher:  Packt Publishing  |  Release:  2014

Machine Learning with R, 4th Edition

Machine Learning with R, 4th Edition

by Brett Lantz

Machine learning, at its core, is concerned with transforming data into actionable knowledge. R offers a powerful set of machine learning methods to quickly and easily gain insight from your data. Machine Learning with R, Fourth Edition, provides a hands-on, accessible, and readable guide to applying machine learning to real-world problem...

Price:  $35.99  |  Publisher:  Packt Publishing  |  Release:  2023

Mastering Azure Machine Learning, 2nd Edition

Mastering Azure Machine Learning, 2nd Edition

by Christoph Korner, Marcel Alsdorf

Azure Machine Learning is a cloud service for accelerating and managing the machine learning (ML) project life cycle that ML professionals, data scientists, and engineers can use in their day-to-day workflows. This book covers the end-to-end ML process using Microsoft Azure Machine Learning, including data preparation, performing and logg...

Price:  $41.99  |  Publisher:  Packt Publishing  |  Release:  2022

F# for Machine Learning Essentials

F# for Machine Learning Essentials

by Sudipta Mukherjee

The F# functional programming language enables developers to write simple code to solve complex problems. With F#, developers create consistent and predictable programs that are easier to test and reuse, simpler to parallelize, and are less prone to bugs.If you want to learn how to use F# to build machine learning systems, then this is th...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2016