Streaming Systems

The What, Where, When, and How of Large-Scale Data Processing



Bookstore > Books > Streaming Systems

Price$52.52 - $68.22
Rating
AuthorsTyler Akidau, Slava Chernyak, Reuven Lax
PublisherO'Reilly Media
Published2018
Pages352
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101491983876
ISBN-139781491983874
EBook Hardcover Paperback

Streaming data is a big deal in big data these days. As more and more businesses seek to tame the massive unbounded data sets that pervade our world, streaming systems have finally reached a level of maturity sufficient for mainstream adoption. With this practical guide, data engineers, data scientists, and developers will learn how to work with streaming data in a conceptual and platform-agnostic way.

Expanded from Tyler Akidau's popular blog posts "Streaming 101" and "Streaming 102", this book takes you from an introductory level to a nuanced understanding of the what, where, when, and how of processing real-time data streams. You'll also dive deep into watermarks and exactly-once processing with co-authors Slava Chernyak and Reuven Lax.

How streaming and batch data processing patterns compare; The core principles and concepts behind robust out-of-order data processing; How watermarks track progress and completeness in infinite datasets; How exactly-once data processing techniques ensure correctness; How the concepts of streams and tables form the foundations of both batch and streaming data processing; The practical motivations behind a powerful persistent state mechanism, driven by a real-world example; How time-varying relations provide a link between stream processing and the world of SQL and relational algebra.




4 5 247

Similar Books


Grokking Streaming Systems

Grokking Streaming Systems

by Josh Fischer, Ning Wang

Grokking Streaming Systems is a simple guide to the complex concepts behind streaming systems. This friendly and framework-agnostic tutorial teaches you how to handle real-time events, and even design and build your own streaming job that's a perfect fit for your needs. Each new idea is carefully explained with diagrams, clear exampl...

Price:  $59.99  |  Publisher:  Manning  |  Release:  2022

Foundations of Scalable Systems

Foundations of Scalable Systems

by Ian Gorton

In many systems, scalability becomes the primary driver as the user base grows. Attractive features and high utility breed success, which brings more requests to handle and more data to manage. But organizations reach a tipping point when design decisions that made sense under light loads suddenly become technical debt. This practical boo...

Price:  $44.93  |  Publisher:  O'Reilly Media  |  Release:  2022

Designing Event-Driven Systems

Designing Event-Driven Systems

by Ben Stopford

Many forces affect software today: larger datasets, geographical disparities, complex company structures, and the growing need to be fast and nimble in the face of change. Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stre...

Free ebook  |  Publisher:  O'Reilly Media  |  Release:  2018

Introduction to Embedded Systems, 2nd Edition

Introduction to Embedded Systems, 2nd Edition

by Edward Ashford Lee, Sanjit Arunkumar Seshia

An introduction to the engineering principles of embedded systems, with a focus on modeling, design, and analysis of cyber-physical systems.The most visible use of computers and software is processing information for human consumption. The vast majority of computers in use, however, are much less visible. They run the engine, brakes, seat...

Price:  $18.66  |  Free ebook  |  Publisher:  Self-publishing  |  Release:  2017

Real-Time Analytics

Real-Time Analytics

by Byron Ellis

Real-time analytics is the hottest topic in data analytics today. In Real-Time Analytics - expert Byron Ellis teaches data analysts technologies to build an effective real-time analytics platform. This platform can then be used to make sense of the constantly changing data that is beginning to outpace traditional batch-based analysis plat...

Price:  $33.90  |  Publisher:  Wiley  |  Release:  2014

Design Technologies for Green and Sustainable Computing Systems

Design Technologies for Green and Sustainable Computing Systems

by Partha Pratim Pande, Amlan Ganguly, Krishnendu Chakrabarty

This book provides a comprehensive guide to the design of sustainable and green computing systems (GSC). Coverage includes important breakthroughs in various aspects of GSC, including multi-core architectures, interconnection technology, data centers, high-performance computing (HPC), and sensor networks. The authors address the challenge...

Price:  $19.95  |  Publisher:  Springer  |  Release:  2013

Stream Processing with Apache Flink

Stream Processing with Apache Flink

by Fabian Hueske, Vasiliki Kalavri

Get started with Apache Flink, the open source framework that enables you to process streaming data - such as user interactions, sensor data, and machine logs - as it arrives. With this practical guide, you'll learn how to use Apache Flink's stream processing APIs to implement, continuously run, and maintain real-world applicati...

Price:  $32.99  |  Publisher:  O'Reilly Media  |  Release:  2018

Flow Architectures

Flow Architectures

by James Urquhart

Software development today is embracing events and streaming data, which optimizes not only how technology interacts but also how businesses integrate with one another to meet customer needs. This phenomenon, called flow, consists of patterns and standards that determine which activity and related data is communicated between parties over...

Price:  $38.92  |  Publisher:  O'Reilly Media  |  Release:  2021