Exploring the Data Jungle

Finding, Preparing, and Using Real-World Data



Bookstore > Books > Exploring the Data Jungle

Exploring the Data Jungle
Free Download
eBookFree
Rating
AuthorBrian Godsey
PublisherManning
Published2017
Pages101
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-10161729506X
ISBN-139781617295065
EBook Hardcover Paperback

Some people like to believe that all data is ready to be used immediately. Not so! Data in the wild is hard to track and harder to understand, and the first job of data scientists to identify and prepare data so it can be used. To find your way through the data jungle successfully, you need the right perspective and guidance. (There's no point hacking at overgrowth with a spoon after all!) Identify and prepare your data well, and you'll be well set to create insight from chaos and discover important analytic patterns - to set your business on the right track.

Exploring the Data Jungle: Finding, Preparing, and Using Real-World Data is a collection of three hand-picked chapters introducing you to the often-overlooked art of putting unfamiliar data to good use. Brian Godsey, author of Think Like a Data Scientist, has selected these chapters to help you navigate data in the wild, identify and prepare raw data for analysis, modeling, machine learning, or visualization. As you explore the data jungle you'll discover real-world examples in Python, R, and other languages suitable for data science.


  1. (2 books)



5 5 2

Similar Books


The Data Warehouse Toolkit, 3rd Edition

The Data Warehouse Toolkit, 3rd Edition

by Ralph Kimball, Margy Ross

The first edition of Ralph Kimball's The Data Warehouse Toolkit introduced the industry to dimensional modeling, and now his books are considered the most authoritative guides in this space. This new third edition is a complete library of updated dimensional modeling techniques, the most comprehensive collection ever. It covers new and en...

Price:  $34.85  |  Publisher:  Wiley  |  Release:  2013

The Data-Driven Project Manager

The Data-Driven Project Manager

by Mario Vanhoucke

Discover solutions to common obstacles faced by project managers. Written as a business novel, the book is highly interactive, allowing readers to participate and consider options at each stage of a project. The book is based on years of experience, both through the author's research projects as well as his teaching lectures at business s...

Price:  $20.96  |  Publisher:  Apress  |  Release:  2018

Numerical Computing with Python

Numerical Computing with Python

by Pratap Dangeti, Allen Yu, Claire Chung, Aldrin Yim, Theodore Petrou

Data mining, or parsing the data to extract useful insights, is a niche skill that can transform your career as a data scientist Python is a flexible programming language that is equipped with a strong suite of libraries and toolkits, and gives you the perfect platform to sift through your data and mine the insights you seek. This Learnin...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

scikit-learn Cookbook

scikit-learn Cookbook

by Trent Hauck

Python is quickly becoming the go-to language for analysts and data scientists due to its simplicity and flexibility, and within the Python data space, scikit-learn is the unequivocal choice for machine learning. Its consistent API and plethora of features help solve any machine learning problem it comes across.The book starts by walking ...

Price:  $26.99  |  Publisher:  Packt Publishing  |  Release:  2014

Hands-On Cloud Solutions with Azure

Hands-On Cloud Solutions with Azure

by Greg Leonardo

Azure provides cloud-based solutions to support your business demands. Building and running solutions on Azure will help your business maximize the return on investment and minimize the total cost of ownership.Hands-On Cloud Solutions with Azure focuses on addressing the architectural decisions that usually arise when you design or migrat...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

PostgreSQL Server Programming

PostgreSQL Server Programming

by Hannu Krosing, Kirk Roybal, Jim Mlodgenski

Learn how to work with PostgreSQL as if you spent the last decade working on it. PostgreSQL is capable of providing you with all of the options that you have in your favourite development language and then extending that right on to the database server. With this knowledge in hand, you will be able to respond to the current demand for adv...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2013

Hadoop: Beginner's Guide

Hadoop: Beginner's Guide

by Garry Turkington

Data is arriving faster than you can process it and the overall volumes keep growing at a rate that keeps you awake at night. Hadoop can help you tame the data beast. Effective use of Hadoop however requires a mixture of programming, design, and system administration skills.Hadoop Beginner's Guide - removes the mystery from Hadoop, presen...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2013

Think Like a Data Scientist

Think Like a Data Scientist

by Brian Godsey

Data collected from customers, scientific measurements, IoT sensors, and so on is valuable only if you understand it. Data scientists revel in the interesting and rewarding challenge of observing, exploring, analyzing, and interpreting this data. Getting started with data science means more than mastering analytic tools and techniques, ho...

Price:  $30.96  |  Publisher:  Manning  |  Release:  2017