Practical Statistics for Data Scientists

50 Essential Concepts



Bookstore > Books > Practical Statistics for Data Scientists

Price$13.46 - $42.99
Rating
AuthorsPeter Bruce, Andrew Bruce
PublisherO'Reilly Media
Published2017
Pages318
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101491952962
ISBN-139781491952962
EBook Hardcover Paperback

Statistical methods are a key part of of data science, yet very few data scientists have any formal statistics training. Courses and books on basic statistics rarely cover the topic from a data science perspective. This practical guide explains how to apply various statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R programming language, and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

Why exploratory data analysis is a key preliminary step in data science; How random sampling can reduce bias and yield a higher quality dataset, even with big data; How the principles of experimental design yield definitive answers to questions; How to use regression to estimate outcomes and detect anomalies; Key classification techniques for predicting which categories a record belongs to; Statistical machine learning methods that "learn" from data; Unsupervised learning methods for extracting meaning from unlabeled data.





4 5 170

Similar Books


Clojure for Data Science

Clojure for Data Science

by Henry Garner

The term "data science" has been widely used to define this new profession that is expected to interpret vast datasets and translate them to improved decision-making and performance. Clojure is a powerful language that combines the interactivity of a scripting language with the speed of a compiled language. Together with its ric...

Price:  $35.99  |  Publisher:  Packt Publishing  |  Release:  2015

Data Science with Java

Data Science with Java

by Michael Brzustowicz

Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today's data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz expl...

Price:  $27.33  |  Publisher:  O'Reilly Media  |  Release:  2017

Python for Data Analysis

Python for Data Analysis

by Wes McKinney

Python for Data Analysis is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you'll need to effec...

Price:  $33.58  |  Publisher:  O'Reilly Media  |  Release:  2012

Learn R for Applied Statistics

Learn R for Applied Statistics

by Goh Hui

Gain the R programming language fundamentals for doing the applied statistics useful for data exploration and analysis in data science and data mining. This book covers topics ranging from R syntax basics, descriptive statistics, and data visualizations to inferential statistics and regressions. After learning R's syntax, you will work th...

Price:  $29.23  |  Publisher:  Apress  |  Release:  2019

Feature Engineering for Machine Learning

Feature Engineering for Machine Learning

by Alice Zheng, Amanda Casari

Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you'll learn techniques for extracting and transforming features - the numeric representations of raw data - into formats for machine-learning models. Each chapter guides you through a single data...

Price:  $43.95  |  Publisher:  O'Reilly Media  |  Release:  2018

Thoughtful Data Science

Thoughtful Data Science

by David Taieb

Thoughtful Data Science brings new strategies and a carefully crafted programmer's toolset to work with modern, cutting-edge data analysis. This new approach is designed specifically to give developers more efficiency and power to create cutting-edge data analysis and artificial intelligence insights.Industry expert David Taieb bridges th...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Programming Skills for Data Science

Programming Skills for Data Science

by Michael Freeman, Joel Ross

Using data science techniques, you can transform raw data into actionable insights for domains ranging from urban planning to precision medicine. Programming Skills for Data Science brings together all the foundational skills you need to get started, even if you have no programming or data science experience.Leading instructors Michael Fr...

Price:  $40.99  |  Publisher:  Addison-Wesley  |  Release:  2018

Data Scientists at Work

Data Scientists at Work

by Sebastian Gutierrez

Data Scientists at Work is a collection of interviews with sixteen of the world's most influential and innovative data scientists from across the spectrum of this hot new profession. "Data scientist is the sexiest job in the 21st century," according to the Harvard Business Review. By 2018, the United States will experien...

Price:  $34.99  |  Publisher:  Apress  |  Release:  2014