Data Science at the Command Line, 2nd Edition

Obtain, Scrub, Explore, and Model Data with Unix Power Tools



Bookstore > Books > Data Science at the Command Line, 2nd Edition

Price$48.49 - $59.99
Rating
AuthorJeroen Janssens
PublisherO'Reilly Media
Published2021
Pages282
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101492087912
ISBN-139781492087915
EBook Hardcover Paperback

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools-useful whether you work with Windows, macOS, or Linux.

You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.

Obtain data from websites, APIs, databases, and spreadsheets; Perform scrub operations on text, CSV, HTM, XML, and JSON files; Explore data, compute descriptive statistics, and create visualizations; Manage your data science workflow; Create reusable command-line tools from one-liners and existing Python or R code; Parallelize and distribute data-intensive pipelines; Model data with dimensionality reduction, clustering, regression, and classification algorithms; Leverage the command line from Python. Jupyter. R. RStudio. and Apache Spark.


  1. (2 books)



Similar Books


Data Science at the Command Line

Data Science at the Command Line

by Jeroen Janssens

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.To get you started - whether you're on Windows, OS X, or Linux - author Jero...

Price:  $7.67  |  Publisher:  O'Reilly Media  |  Release:  2014

Beginning the Linux Command Line, 2nd edition

Beginning the Linux Command Line, 2nd edition

by Sander van Vugt

This is Linux for those of us who don't mind typing. All Linux users and administrators tend to like the flexibility and speed of Linux administration from the command line in byte-sized chunks, instead of fairly standard graphical user interfaces. Beginning the Linux Command Line is verified against all of the most important Linux distri...

Price:  $33.04  |  Publisher:  Apress  |  Release:  2015

The Linux Command Line, 2nd Edition

The Linux Command Line, 2nd Edition

by William Shotts

The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, pattern ma...

Price:  $25.31  |  Publisher:  No Starch Press  |  Release:  2019

Practical DataOps

Practical DataOps

by Harvinder Atwal

Gain a practical introduction to DataOps, a new discipline for delivering data science at scale inspired by practices at companies such as Facebook, Uber, LinkedIn, Twitter, and eBay. Organizations need more than the latest AI algorithms, hottest tools, and best people to turn data into insight-driven action and useful analytical data pro...

Price:  $32.99  |  Publisher:  Apress  |  Release:  2020

Command Line Fundamentals

Command Line Fundamentals

by Vivek N

The most basic interface to a computer - the command line - remains the most flexible and powerful way of processing data and performing and automating various day-to-day tasks.Command Line Fundamentals begins by exploring the basics, and then focuses on the most common tool, the Bash shell (which is standard on all Linux and iOS systems)...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2018

Information Security The Complete Reference, 2nd Edition

Information Security The Complete Reference, 2nd Edition

by Mark Rhodes-Ousley

Today's complex world of mobile platforms, cloud computing, and ubiquitous data access puts new security demands on every IT professional. Information Security: The Complete Reference, 2nd Edition is the only comprehensive book that offers vendor-neutral details on all aspects of information protection, with an eye toward the evolving thr...

Price:  $41.62  |  Publisher:  McGraw-Hill  |  Release:  2013

Data Science with SQL Server Quick Start Guide

Data Science with SQL Server Quick Start Guide

by Dejan Sarka

SQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you.This book is the ideal introduction to data science w...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2018

Data Science Algorithms in a Week, 2nd Edition

Data Science Algorithms in a Week, 2nd Edition

by David Natingga

Machine learning applications are highly automated and self-modifying, and continue to improve over time with minimal human intervention, as they learn from the trained data. To address the complex nature of various real-world data problems, specialized machine learning algorithms have been developed. Through algorithmic and statistical a...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018