by Hubert Dulay, Stephen Mooney
Data lakes and warehouses have become increasingly fragile, costly, and difficult to maintain as data gets bigger and moves faster. Data meshes can help your organization decentralize data, giving ownership back to the engineers who produced it. This book provides a concise yet comprehensive overview of data mesh patterns for streaming and real-time data services.Authors Hubert Dulay and Stephen Mooney exam...
Price: $41.99 | Publisher: O'Reilly Media | Release: 2023
by Dylan Scott, Viktor Gamov, Dave Klein
Kafka in Action is a fast-paced introduction to every aspect of working with Apache Kafka. Starting with an overview of Kafka's core concepts, you'll immediately learn how to set up and execute basic data movement tasks and how to produce and consume streams of events. Advancing quickly, you'll soon be ready to use Kafka in your day-to-day workflow, and start digging into even more advanced K...
Price: $44.99 | Publisher: Manning | Release: 2022
Trino: The Definitive Guide, 2nd Edition
by Matt Fuller, Manfred Moser, Martin Traverso
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. In the second edition of this practical guide, you'll learn how to conduct analytics on data where it lives, whether it's a data lake using Hive, a modern lakehouse with Iceberg or Delta Lake, a different system like Cassandra, Kafka, or SingleStore, or a relational data...
Price: $52.99 | Publisher: O'Reilly Media | Release: 2022
Modern Data Engineering with Apache Spark
by Scott Haines
Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire loc...
Price: $46.38 | Publisher: Apress | Release: 2022
by Josh Fischer, Ning Wang
Grokking Streaming Systems is a simple guide to the complex concepts behind streaming systems. This friendly and framework-agnostic tutorial teaches you how to handle real-time events, and even design and build your own streaming job that's a perfect fit for your needs. Each new idea is carefully explained with diagrams, clear examples, and fun dialogue between perplexed personalities!Streaming systems...
Price: $59.99 | Publisher: Manning | Release: 2022
FREE EBOOK - Kafka: The Definitive Guide, 2nd Edition
by Gwen Shapira, Todd Palino, Rajini Sivaram, Krit Petty
Every enterprise application creates data, whether it consists of log messages, metrics, user activity, or outgoing messages. Moving all this data is just as important as the data itself. With this updated edition, application architects, developers, and production engineers new to the Kafka streaming platform will learn how to handle data in motion. Additional chapters cover Kafka's AdminClient API, t...
Price: $34.29 | Publisher: O'Reilly Media | Release: 2021
Mastering Kafka Streams and ksqlDB
by Mitch Seymour
Working with unbounded and fast-moving data streams has historically been difficult. But with Kafka Streams and ksqlDB, building stream processing applications is easy and fun. This practical guide shows data engineers how to use these tools to build highly scalable stream processing applications for moving, enriching, and transforming large amounts of data in real time.Mitch Seymour, data services engineer...
Price: $52.88 | Publisher: O'Reilly Media | Release: 2021
by Clement Escoffier, Ken Finnigan
Reactive systems and event-driven architecture are becoming indispensable to application design, and companies are taking note. Reactive systems ensure that applications are responsive, resilient, and elastic no matter what failures or errors may be occurring, while event-driven architecture offers a flexible and composable option for distributed systems. This practical book helps Java developers bring thes...
Price: $54.20 | Publisher: O'Reilly Media | Release: 2021
Designing Cloud Data Platforms
by Danil Zburivsky, Lynda Partner
Centralized data warehouses, the long-time defacto standard for housing data for analytics, are rapidly giving way to multi-faceted cloud data platforms. Companies that embrace modern cloud data platforms benefit from an integrated view of their business using all of their data and can take advantage of advanced analytic practices to drive predictions and as yet unimagined data services. Designing Cloud Dat...
Price: $39.99 | Publisher: Manning | Release: 2021
by David Kjerrumgaard
Apache Pulsar in Action is a comprehensive and practical guide to building high-traffic applications with Pulsar. You'll learn to use this mature and battle-tested platform to deliver extreme levels of speed and durability to your messaging. Apache Pulsar committer David Kjerrumgaard teaches you to apply Pulsar's seamless scalability through hands-on case studies, including IOT analytics applicati...
Price: $46.99 | Publisher: Manning | Release: 2021