Bad Data Handbook

Mapping the World of Data Problems



Bookstore > Books > Bad Data Handbook

Price$25.65 - $50.53
Rating
AuthorQ. Ethan McCallum
PublisherO'Reilly Media
Published2012
Pages264
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101449321887
ISBN-139781449321888
EBook Hardcover Paperback

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they've recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.


  1. (2 books)


4 5 226

Similar Books


Data Quality Fundamentals

Data Quality Fundamentals

by Barr Moses, Lior Gavish, Molly Vorwerck

Do your product dashboards look funky? Are your quarterly reports stale? Is the data set you're using broken or just plain wrong? These problems affect almost every team, yet they're usually addressed on an ad hoc basis and in a reactive manner. If you answered yes to these questions, this book is for you.Many data engineering t...

Price:  $41.00  |  Publisher:  O'Reilly Media  |  Release:  2022

Prepare Your Data for Tableau

Prepare Your Data for Tableau

by Tim Costello, Lori Blackshear

Focus on the most important and most often overlooked factor in a successful Tableau project - data. Without a reliable data source, you will not achieve the results you hope for in Tableau. This book does more than teach the mechanics of data preparation. It teaches you: how to look at data in a new way, to recognize the most common issu...

Price:  $32.99  |  Publisher:  Apress  |  Release:  2020

Semantic Modeling for Data

Semantic Modeling for Data

by Panos Alexopoulos

What value does semantic data modeling offer? As an information architect or data science professional, let's say you have an abundance of the right data and the technology to extract business gold - but you still fail. The reason? Bad data semantics.In this practical and comprehensive field guide, author Panos Alexopoulos takes you ...

Price:  $56.99  |  Publisher:  O'Reilly Media  |  Release:  2020

Machine Learning for Kids

Machine Learning for Kids

by Dale Lane

Artificial intelligence (AI) is the ability of computers to simulate human thinking. Machine learning (ML) is one of the building blocks of AI. It's based on the idea that computers can be taught to do things on their own from the data and feedback you give them.Machine Learning for Kids consists of this book and a kid-friendly compa...

Price:  $13.37  |  Publisher:  No Starch Press  |  Release:  2021

Big Data Architect's Handbook

Big Data Architect's Handbook

by Syed Muhammad Fahad Akhtar

The big data architects are the “masters” of data, and hold high value in today's market. Handling big data, be it of good or bad quality, is not an easy task. The prime job for any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden ins...

Price:  $54.99  |  Publisher:  Packt Publishing  |  Release:  2018

Python Data Science Handbook, 2nd Edition

Python Data Science Handbook, 2nd Edition

by Jake VanderPlas

Python is a first-class tool for many researchers, primarily because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the new edition of Python Data Science Handbook do you get them all - IPython, NumPy, pandas, Matplotlib, sc...

Price:  $56.99  |  Publisher:  O'Reilly Media  |  Release:  2022

Python Data Science Handbook

Python Data Science Handbook

by Jake VanderPlas

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all - IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and ot...

Price:  $54.31  |  Publisher:  O'Reilly Media  |  Release:  2016

Oracle Database 12c Release 2 Real Application Clusters Handbook

Oracle Database 12c Release 2 Real Application Clusters Handbook

by K. Gopalakrishnan, Sam R. Alapati

Through clear instruction and detailed examples, Oracle Database 12c Real Application Clusters Handbook: Concepts, Administration, Tuning & Troubleshooting teaches how to build, configure, and maintain a dynamic enterprise computing infrastructure. This thoroughly revised edition covers best uses for the latest tools and features - al...

Price:  $38.21  |  Publisher:  McGraw-Hill  |  Release:  2018