Practical Web Scraping for Data Science

Best Practices and Examples with Python



Bookstore > Books > Practical Web Scraping for Data Science

Practical Web Scraping for Data Science
Price$36.37 - $39.99
Rating
AuthorsSeppe vanden Broucke, Bart Baesens
PublisherApress
Published2018
Pages306
LanguageEnglish
FormatPaper book / ebook
ISBN-101484235819
ISBN-139781484235812
EBook Hardcover Paperback

This book provides a complete and modern guide to web scraping, using Python as the programming language, without glossing over important details or best practices. Written with a data science audience in mind, the book explores both scraping and the larger context of web technologies in which it operates, to ensure full understanding. The authors recommend web scraping as a powerful tool for any data scientist's arsenal, as many data science projects start by obtaining an appropriate data set.

Starting with a brief overview on scraping and real-life use cases, the authors explore the core concepts of HTTP, HTML, and CSS to provide a solid foundation. Along with a quick Python primer, they cover Selenium for JavaScript-heavy sites, and web crawling in detail. The book finishes with a recap of best practices and a collection of examples that bring together everything you've learned and illustrate various data science use cases.

Leverage well-established best practices and commonly-used Python packages; Handle today's web, including JavaScript, cookies, and common web scraping mitigation techniques; Understand the managerial and legal concerns regarding web scraping.


  1. (2 books)
  2. (2 books)



Similar Books


R for Data Science

R for Data Science

R is a powerful, open source, functional programming language. It can be used for a wide range of programming tasks and is best suited to produce data and visual analytics through customizable scripts and commands.The purpose of the book is to explore the core topics that data scientists are interested in. This book draws from a wide vari...
R for Data Science

R for Data Science

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you ...
PHP Web Scraping

PHP Web Scraping

With the proliferation of the web, there has never been a larger body of data freely available for common use. Harvesting and processing this data can be a time consuming task if done manually. However, web scraping can provide the tools and framework to accomplish this with the click of a button. It's no wonder, then, that web scraping i...
Clojure for Data Science

Clojure for Data Science

The term "data science" has been widely used to define this new profession that is expected to interpret vast datasets and translate them to improved decision-making and performance. Clojure is a powerful language that combines the interactivity of a scripting language with the speed of a compiled language. Together with its ric...
Data Science with Java

Data Science with Java

Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today's data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz expl...
Practical Data Science Cookbook

Practical Data Science Cookbook

Starting with the basics, this book will cover how to set up your numerical programming environment, introduce you to the data science pipeline (an iterative process by which data science projects are completed), and guide you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter...