Data Science at the Command Line, 2nd Edition
Obtain, Scrub, Explore, and Model Data with Unix Power Tools
Price | $54.85 - $59.99
|
Rating | |
Author | Jeroen Janssens |
Publisher | O'Reilly Media |
Published | 2021 |
Pages | 282 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1492087912 |
ISBN-13 | 9781492087915 |
This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools-useful whether you work with Windows, macOS, or Linux.
You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.
Obtain data from websites, APIs, databases, and spreadsheets; Perform scrub operations on text, CSV, HTM, XML, and JSON files; Explore data, compute descriptive statistics, and create visualizations; Manage your data science workflow; Create reusable command-line tools from one-liners and existing Python or R code; Parallelize and distribute data-intensive pipelines; Model data with dimensionality reduction, clustering, regression, and classification algorithms; Leverage the command line from Python. Jupyter. R. RStudio. and Apache Spark.
Source Code:
→ https://github.com/jeroenjanssens/data-science-at-the-command-line/archive/refs/heads/master.zip
- Jeroen Janssens (2 books)
4 5 28
Similar Books
Efficient Linux at the Command Line
by Daniel J. Barrett
Take your Linux skills to the next level! Whether you're a system administrator, software developer, site reliability engineer, or enthusiastic hobbyist, this practical, hands-on book will help you work faster, smarter, and more efficiently. You'll learn how to create and run complex commands that solve real business problems, p...
Price: $37.10 | Publisher: O'Reilly Media | Release: 2022
Data Science at the Command Line
by Jeroen Janssens
This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.To get you started - whether you're on Windows, OS X, or Linux - a...
Price: $7.67 | Publisher: O'Reilly Media | Release: 2014
Beginning the Linux Command Line, 2nd edition
by Sander van Vugt
This is Linux for those of us who don't mind typing. All Linux users and administrators tend to like the flexibility and speed of Linux administration from the command line in byte-sized chunks, instead of fairly standard graphical user interfaces. Beginning the Linux Command Line is verified against all of the most important Linux d...
Price: $33.04 | Publisher: Apress | Release: 2015
The Linux Command Line, 2nd Edition
by William Shotts
The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, patte...
Price: $25.31 | Publisher: No Starch Press | Release: 2019
by Harvinder Atwal
Gain a practical introduction to DataOps, a new discipline for delivering data science at scale inspired by practices at companies such as Facebook, Uber, LinkedIn, Twitter, and eBay. Organizations need more than the latest AI algorithms, hottest tools, and best people to turn data into insight-driven action and useful analytical data pro...
Price: $32.99 | Publisher: Apress | Release: 2020
by Vivek N
The most basic interface to a computer - the command line - remains the most flexible and powerful way of processing data and performing and automating various day-to-day tasks.Command Line Fundamentals begins by exploring the basics, and then focuses on the most common tool, the Bash shell (which is standard on all Linux and iOS systems)...
Price: $34.99 | Publisher: Packt Publishing | Release: 2018
Information Security The Complete Reference, 2nd Edition
by Mark Rhodes-Ousley
Today's complex world of mobile platforms, cloud computing, and ubiquitous data access puts new security demands on every IT professional. Information Security: The Complete Reference, 2nd Edition is the only comprehensive book that offers vendor-neutral details on all aspects of information protection, with an eye toward the evolvin...
Price: $41.62 | Publisher: McGraw-Hill | Release: 2013
Tomcat: The Definitive Guide, 2nd Edition
by Jason Brittain, Ian F. Darwin
It takes a book as versatile as its subject to cover Apache Tomcat. This book is a valuable reference for administrators and webmasters, a useful guide for programmers who want to use Tomcat as their web application server during development or in production, and an excellent introduction for anyone interested in Tomcat. The new edition o...
Price: $4.28 | Publisher: O'Reilly Media | Release: 2007