Data Science at the Command Line, 2nd Edition
Obtain, Scrub, Explore, and Model Data with Unix Power Tools

Price | $48.49 - $59.99
|
Rating | ![]() ![]() ![]() ![]() ![]() |
Author | Jeroen Janssens |
Publisher | O'Reilly Media |
Published | 2021 |
Pages | 282 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1492087912 |
ISBN-13 | 9781492087915 |
This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools-useful whether you work with Windows, macOS, or Linux.
You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.
Obtain data from websites, APIs, databases, and spreadsheets; Perform scrub operations on text, CSV, HTM, XML, and JSON files; Explore data, compute descriptive statistics, and create visualizations; Manage your data science workflow; Create reusable command-line tools from one-liners and existing Python or R code; Parallelize and distribute data-intensive pipelines; Model data with dimensionality reduction, clustering, regression, and classification algorithms; Leverage the command line from Python. Jupyter. R. RStudio. and Apache Spark.
Source Code:
→ https://github.com/jeroenjanssens/data-science-at-the-command-line/archive/refs/heads/master.zip
- Jeroen Janssens (2 books)
Similar Books
Data Science at the Command Line
by Jeroen Janssens
This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.To get you started - whether you're on Windows, OS X, or Linux - a...
Price: $7.67 | Publisher: O'Reilly Media | Release: 2014
Efficient Linux at the Command Line
by Daniel J. Barrett
Take your Linux skills to the next level! Whether you're a system administrator, software developer, site reliability engineer, or enthusiastic hobbyist, this practical, hands-on book will help you work faster, smarter, and more efficiently. You'll learn how to create and run complex commands that solve real business problems, p...
Price: $37.10 | Publisher: O'Reilly Media | Release: 2022
Beginning the Linux Command Line, 2nd edition
by Sander van Vugt
This is Linux for those of us who don't mind typing. All Linux users and administrators tend to like the flexibility and speed of Linux administration from the command line in byte-sized chunks, instead of fairly standard graphical user interfaces. Beginning the Linux Command Line is verified against all of the most important Linux d...
Price: $33.04 | Publisher: Apress | Release: 2015
The Linux Command Line, 2nd Edition
by William Shotts
The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, patte...
Price: $25.31 | Publisher: No Starch Press | Release: 2019
by Harvinder Atwal
Gain a practical introduction to DataOps, a new discipline for delivering data science at scale inspired by practices at companies such as Facebook, Uber, LinkedIn, Twitter, and eBay. Organizations need more than the latest AI algorithms, hottest tools, and best people to turn data into insight-driven action and useful analytical data pro...
Price: $32.99 | Publisher: Apress | Release: 2020
by Vivek N
The most basic interface to a computer - the command line - remains the most flexible and powerful way of processing data and performing and automating various day-to-day tasks.Command Line Fundamentals begins by exploring the basics, and then focuses on the most common tool, the Bash shell (which is standard on all Linux and iOS systems)...
Price: $34.99 | Publisher: Packt Publishing | Release: 2018
Information Security The Complete Reference, 2nd Edition
by Mark Rhodes-Ousley
Today's complex world of mobile platforms, cloud computing, and ubiquitous data access puts new security demands on every IT professional. Information Security: The Complete Reference, 2nd Edition is the only comprehensive book that offers vendor-neutral details on all aspects of information protection, with an eye toward the evolvin...
Price: $41.62 | Publisher: McGraw-Hill | Release: 2013
Data Science with SQL Server Quick Start Guide
by Dejan Sarka
SQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you.This book is the ideal introduction to data science w...
Price: $34.99 | Publisher: Packt Publishing | Release: 2018