Data Science at the Command Line, 2nd Edition

Obtain, Scrub, Explore, and Model Data with Unix Power Tools



Bookstore > Books > Data Science at the Command Line, 2nd Edition

Price$54.85 - $59.99
Rating
AuthorJeroen Janssens
PublisherO'Reilly Media
Published2021
Pages282
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101492087912
ISBN-139781492087915
EBook Hardcover Paperback

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools-useful whether you work with Windows, macOS, or Linux.

You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.

Obtain data from websites, APIs, databases, and spreadsheets; Perform scrub operations on text, CSV, HTM, XML, and JSON files; Explore data, compute descriptive statistics, and create visualizations; Manage your data science workflow; Create reusable command-line tools from one-liners and existing Python or R code; Parallelize and distribute data-intensive pipelines; Model data with dimensionality reduction, clustering, regression, and classification algorithms; Leverage the command line from Python. Jupyter. R. RStudio. and Apache Spark.


  1. (2 books)


4 5 28

Similar Books


Efficient Linux at the Command Line

Efficient Linux at the Command Line

by Daniel J. Barrett

Take your Linux skills to the next level! Whether you're a system administrator, software developer, site reliability engineer, or enthusiastic hobbyist, this practical, hands-on book will help you work faster, smarter, and more efficiently. You'll learn how to create and run complex commands that solve real business problems, p...

Price:  $37.10  |  Publisher:  O'Reilly Media  |  Release:  2022

Data Science at the Command Line

Data Science at the Command Line

by Jeroen Janssens

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.To get you started - whether you're on Windows, OS X, or Linux - a...

Price:  $7.67  |  Publisher:  O'Reilly Media  |  Release:  2014

Beginning the Linux Command Line, 2nd edition

Beginning the Linux Command Line, 2nd edition

by Sander van Vugt

This is Linux for those of us who don't mind typing. All Linux users and administrators tend to like the flexibility and speed of Linux administration from the command line in byte-sized chunks, instead of fairly standard graphical user interfaces. Beginning the Linux Command Line is verified against all of the most important Linux d...

Price:  $33.04  |  Publisher:  Apress  |  Release:  2015

The Linux Command Line, 2nd Edition

The Linux Command Line, 2nd Edition

by William Shotts

The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, patte...

Price:  $25.31  |  Publisher:  No Starch Press  |  Release:  2019

Practical DataOps

Practical DataOps

by Harvinder Atwal

Gain a practical introduction to DataOps, a new discipline for delivering data science at scale inspired by practices at companies such as Facebook, Uber, LinkedIn, Twitter, and eBay. Organizations need more than the latest AI algorithms, hottest tools, and best people to turn data into insight-driven action and useful analytical data pro...

Price:  $32.99  |  Publisher:  Apress  |  Release:  2020

Command Line Fundamentals

Command Line Fundamentals

by Vivek N

The most basic interface to a computer - the command line - remains the most flexible and powerful way of processing data and performing and automating various day-to-day tasks.Command Line Fundamentals begins by exploring the basics, and then focuses on the most common tool, the Bash shell (which is standard on all Linux and iOS systems)...

Price:  $34.99  |  Publisher:  Packt Publishing  |  Release:  2018

Information Security The Complete Reference, 2nd Edition

Information Security The Complete Reference, 2nd Edition

by Mark Rhodes-Ousley

Today's complex world of mobile platforms, cloud computing, and ubiquitous data access puts new security demands on every IT professional. Information Security: The Complete Reference, 2nd Edition is the only comprehensive book that offers vendor-neutral details on all aspects of information protection, with an eye toward the evolvin...

Price:  $41.62  |  Publisher:  McGraw-Hill  |  Release:  2013

Tomcat: The Definitive Guide, 2nd Edition

Tomcat: The Definitive Guide, 2nd Edition

by Jason Brittain, Ian F. Darwin

It takes a book as versatile as its subject to cover Apache Tomcat. This book is a valuable reference for administrators and webmasters, a useful guide for programmers who want to use Tomcat as their web application server during development or in production, and an excellent introduction for anyone interested in Tomcat. The new edition o...

Price:  $4.28  |  Publisher:  O'Reilly Media  |  Release:  2007