Exploring the Data Jungle

Finding, Preparing, and Using Real-World Data



Bookstore > Books > Exploring the Data Jungle

Exploring the Data Jungle

Free Download
eBookFree
Rating
AuthorBrian Godsey
PublisherManning
Published2017
Pages101
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-10161729506X
ISBN-139781617295065
EBook Hardcover Paperback

Some people like to believe that all data is ready to be used immediately. Not so! Data in the wild is hard to track and harder to understand, and the first job of data scientists to identify and prepare data so it can be used. To find your way through the data jungle successfully, you need the right perspective and guidance. (There's no point hacking at overgrowth with a spoon after all!) Identify and prepare your data well, and you'll be well set to create insight from chaos and discover important analytic patterns - to set your business on the right track.

Exploring the Data Jungle: Finding, Preparing, and Using Real-World Data is a collection of three hand-picked chapters introducing you to the often-overlooked art of putting unfamiliar data to good use. Brian Godsey, author of Think Like a Data Scientist, has selected these chapters to help you navigate data in the wild, identify and prepare raw data for analysis, modeling, machine learning, or visualization. As you explore the data jungle you'll discover real-world examples in Python, R, and other languages suitable for data science.


  1. (2 books)


5 5 8

Similar Books


Modernizing the Datacenter with Windows Server and Hybrid Cloud

Modernizing the Datacenter with Windows Server and Hybrid Cloud

by John McCabe, Ward Ralston

Transform your datacenter for breakthrough flexibility, agility, and scalability!Using public, private, and hybrid cloud services, you can transform your datacenter to serve fast-changing workloads, process and analyze enormous amounts of data, and achieve unprecedented flexibility and value. In this guide, two world-renowned experts in M...

Price:  $40.99  |  Publisher:  Microsoft Press  |  Release:  2019

The Data Warehouse Toolkit, 3rd Edition

The Data Warehouse Toolkit, 3rd Edition

by Ralph Kimball, Margy Ross

The first edition of Ralph Kimball's The Data Warehouse Toolkit introduced the industry to dimensional modeling, and now his books are considered the most authoritative guides in this space. This new third edition is a complete library of updated dimensional modeling techniques, the most comprehensive collection ever. It covers new a...

Price:  $48.99  |  Publisher:  Wiley  |  Release:  2013

The Data-Driven Project Manager

The Data-Driven Project Manager

by Mario Vanhoucke

Discover solutions to common obstacles faced by project managers. Written as a business novel, the book is highly interactive, allowing readers to participate and consider options at each stage of a project. The book is based on years of experience, both through the author's research projects as well as his teaching lectures at busin...

Price:  $20.96  |  Publisher:  Apress  |  Release:  2018

Numerical Computing with Python

Numerical Computing with Python

by Pratap Dangeti, Allen Yu, Claire Chung, Aldrin Yim, Theodore Petrou

Data mining, or parsing the data to extract useful insights, is a niche skill that can transform your career as a data scientist Python is a flexible programming language that is equipped with a strong suite of libraries and toolkits, and gives you the perfect platform to sift through your data and mine the insights you seek. This Learnin...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Exploring the .NET Core 3.0 Runtime

Exploring the .NET Core 3.0 Runtime

by Roger Villela

Explore advanced .NET APIs and create a basic .NET core library with dynamic code generation and metadata inspection to be used by other libraries or client applications. This book starts with the benefits of .NET including its fundamental tasks and tools where you will learn .NET SDK tools and the ILDasm tool. This is followed by a detai...

Price:  $26.28  |  Publisher:  Apress  |  Release:  2019

Learn Programming

Learn Programming

by Antti Salonen

This book is aimed at readers who are interested in software development but have very little to no prior experience. The book focuses on teaching the core principles around software development. It uses several technologies to this goal (e.g. C, Python, JavaScript, HTML, etc.) but is not a book about the technologies themselves. The read...

Price:  $16.83  |  Free ebook  |  Publisher:  Self-publishing  |  Release:  2018

scikit-learn Cookbook

scikit-learn Cookbook

by Trent Hauck

Python is quickly becoming the go-to language for analysts and data scientists due to its simplicity and flexibility, and within the Python data space, scikit-learn is the unequivocal choice for machine learning. Its consistent API and plethora of features help solve any machine learning problem it comes across.The book starts by walking ...

Price:  $44.99  |  Publisher:  Packt Publishing  |  Release:  2014

Hands-On Cloud Solutions with Azure

Hands-On Cloud Solutions with Azure

by Greg Leonardo

Azure provides cloud-based solutions to support your business demands. Building and running solutions on Azure will help your business maximize the return on investment and minimize the total cost of ownership.Hands-On Cloud Solutions with Azure focuses on addressing the architectural decisions that usually arise when you design or migrat...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018