Data Quality Fundamentals
A Practitioner's Guide to Building Trustworthy Data Pipelines
Price | $41.00 - $49.49
|
Rating | |
Authors | Barr Moses, Lior Gavish, Molly Vorwerck |
Publisher | O'Reilly Media |
Published | 2022 |
Pages | 308 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1098112040 |
ISBN-13 | 9781098112042 |
Do your product dashboards look funky? Are your quarterly reports stale? Is the data set you're using broken or just plain wrong? These problems affect almost every team, yet they're usually addressed on an ad hoc basis and in a reactive manner. If you answered yes to these questions, this book is for you.
Many data engineering teams today face the "good pipelines, bad data" problem. It doesn't matter how advanced your data infrastructure is if the data you're piping is bad. In this book, Barr Moses, Lior Gavish, and Molly Vorwerck, from the data observability company Monte Carlo, explain how to tackle data quality and trust at scale by leveraging best practices and technologies used by some of the world's most innovative companies.
- Barr Moses
- Lior Gavish
- Molly Vorwerck
4 5 20
Similar Books
Competing with High Quality Data
by Rajesh Jugulum
Data is rapidly becoming the powerhouse of industry, but low-quality data can actually put a company at a disadvantage. To be used effectively, data must accurately reflect the real-world scenario it represents, and it must be in a form that is usable and accessible. Quality data involves asking the right questions, targeting the correct ...
Price: $74.00 | Publisher: Wiley | Release: 2014
Data Quality Engineering in Financial Services
by Brian Buzzelli
Data quality will either make you or break you in the financial services industry. Missing prices, wrong market values, trading violations, client performance restatements, and incorrect regulatory filings can all lead to harsh penalties, lost clients, and financial disaster. This practical guide provides data analysts, data scientists, a...
Price: $38.49 | Publisher: O'Reilly Media | Release: 2022
Data Science Fundamentals for Python and MongoDB
by David Paper
Build the foundational data science skills necessary to work with and better understand complex data science algorithms. This example-driven book provides complete Python coding examples to complement and clarify data science concepts, and enrich the learning experience. Coding examples include visualizations whenever appropriate. The boo...
Price: $18.32 | Publisher: Apress | Release: 2018
JavaScript Data Structures and Algorithms
by Sammie Bae
Explore data structures and algorithm concepts and their relation to everyday JavaScript development. A basic understanding of these ideas is essential to any JavaScript developer wishing to analyze and build great software solutions. You'll discover how to implement data structures such as hash tables, linked lists, stacks, queues,...
Price: $26.76 | Publisher: Apress | Release: 2019
by Tim Costello, Lori Blackshear
Focus on the most important and most often overlooked factor in a successful Tableau project - data. Without a reliable data source, you will not achieve the results you hope for in Tableau. This book does more than teach the mechanics of data preparation. It teaches you: how to look at data in a new way, to recognize the most common issu...
Price: $32.99 | Publisher: Apress | Release: 2020
Practical Python Data Wrangling and Data Quality
by Susan E. McGregor
The world around us is full of data that holds unique insights and valuable stories, and this book will help you uncover them. Whether you already work with data or want to learn more about its possibilities, the examples and techniques in this practical book will help you more easily clean, evaluate, and analyze data so that you can gene...
Price: $49.58 | Publisher: O'Reilly Media | Release: 2021
Big Data Processing with Apache Spark
by Manuel Ignacio Franco Galeano
Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
Exam 70-463: Implementing a Data Warehouse with Microsoft SQL Server 2012
by Grega Jerkic, Matija Lah, Dejan Sarka
Ace your preparation for Microsoft Certification Exam 70-463 with this 2-in-1 Training Kit from Microsoft Press. Work at your own pace through a series of lessons and practical exercises, and then assess your skills with online practice tests - featuring multiple, customizable testing options.Design and implement a data warehouse. Develop...
Price: $5.44 | Publisher: Microsoft Press | Release: 2012