Practical Enterprise Data Lake Insights
Handle Data-Driven Challenges in an Enterprise Big Data Lake
Price | $24.14 - $55.37
|
Rating | |
Authors | Saurabh Gupta, Venkata Giri |
Publisher | Apress |
Published | 2018 |
Pages | 327 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 1484235215 |
ISBN-13 | 9781484235218 |
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.
When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go through stages that can bring up tough questions such as data processing, data querying, and security. Concepts such as change data capture and data streaming are covered. The book takes an end-to-end solution approach in a data lake environment that includes data security, high availability, data processing, data streaming, and more.
Each chapter includes application of a concept, code snippets, and use case demonstrations to provide you with a practical approach. You will learn the concept, scope, application, and starting point.
Get to know data lake architecture and design principles; Implement data capture and streaming strategies; Implement data processing strategies in Hadoop; Understand the data lake security framework and availability model.
- Saurabh Gupta
- Venkata Giri
4 5 19
Similar Books
The Azure Data Lakehouse Toolkit
by Ron L'Esteve
Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse using highly performant and cutting-edge Apa...
Price: $54.99 | Publisher: Apress | Release: 2022
by Andreas Francois Vermeulen
Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets.The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates...
Price: $28.98 | Publisher: Apress | Release: 2018
Enterprise Data Workflows with Cascading
by Paco Nathan
There is an easier way to build Hadoop applications. With this hands-on book, you'll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications - without having to learn the intricacies of MapReduce.Working with sample apps...
Price: $27.49 | Publisher: O'Reilly Media | Release: 2013
by Benjamin Weissman, Enrico van de Laar
Use this guide to one of SQL Server 2019's most impactful features - Big Data Clusters. You will learn about data virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) platform within the SQL Server database engine. You will know how to use Big Data Clusters to combine large volumes of...
Price: $33.67 | Publisher: Apress | Release: 2020
by Zoiner Tejada
Microsoft Azure has over 20 platform-as-a-service (PaaS) offerings that can act in support of a big data analytics solution. So which one is right for your project? This practical book helps you understand the breadth of Azure services by organizing them into a reference framework you can use when crafting your own big data analytics solu...
Price: $22.99 | Publisher: O'Reilly Media | Release: 2017
Practical Oracle Database Appliance
by Bobby Curtis, Fuad Arshad, Erik Benner, Maris Elsins, Matt Gallagher, Pete Sharman, Yury Velikanov
Practical Oracle Database Appliance is a hands-on book taking you through the components and implementation of the Oracle Database Appliance. Learn about architecture, installation, configuration, and reconfiguration. Install and configure the Oracle Database Appliance with confidence. Make the right choices between the various configurat...
Price: $49.99 | Publisher: Apress | Release: 2014
Practical Enterprise Software Development Techniques
by Edward Crookshanks
This expanded and updated edition of "Practical Enterprise Software Development Techniques" includes a new chapter which explains what makes enterprise scale software development different from other development endeavors. Chapter 4 has been expanded with additional coverage of code review, bug tracker systems and agile ...
Price: $49.99 | Publisher: Apress | Release: 2015
Practical Python Data Wrangling and Data Quality
by Susan E. McGregor
The world around us is full of data that holds unique insights and valuable stories, and this book will help you uncover them. Whether you already work with data or want to learn more about its possibilities, the examples and techniques in this practical book will help you more easily clean, evaluate, and analyze data so that you can gene...
Price: $49.58 | Publisher: O'Reilly Media | Release: 2021