Stream Processing with Apache Flink
Fundamentals, Implementation, and Operation of Streaming Applications
Price | $32.99 - $49.60
|
Rating | |
Authors | Fabian Hueske, Vasiliki Kalavri |
Publisher | O'Reilly Media |
Published | 2018 |
Pages | 200 |
Language | English |
Format | Paper book / ebook (PDF) |
ISBN-10 | 149197429X |
ISBN-13 | 9781491974292 |
Get started with Apache Flink, the open source framework that enables you to process streaming data - such as user interactions, sensor data, and machine logs - as it arrives. With this practical guide, you'll learn how to use Apache Flink's stream processing APIs to implement, continuously run, and maintain real-world applications.
Authors Fabian Hueske, one of Flink's creators, and Vasia Kalavri, a core contributor to Flink's graph processing API (Gelly), explains the fundamental concepts of parallel stream processing and shows you how streaming analytics differs from traditional batch data analysis. Software engineers, data engineers, and system administrators will learn the basics of Flink's DataStream API, including the structure and components of a common Flink streaming application.
- Solve real-world problems with Apache Flink's DataStream API;
- Set up an environment for developing stream processing applications for Flink;
- Design streaming applications and migrate periodic batch workloads to continuous streaming workloads;
- Learn about windowed operations that process groups of records;
- Ingest data streams into a DataStream application and emit a result stream into different storage systems;
- Implement stateful and custom operators common in stream processing applications;
- Operate, maintain, and update continuously running Flink streaming applications;
- Explore several deployment options, including the setup of highly available installations.
- Fabian Hueske
- Vasiliki Kalavri
5 5 140
Similar Books
Designing Event-Driven Systems
by Ben Stopford
Many forces affect software today: larger datasets, geographical disparities, complex company structures, and the growing need to be fast and nimble in the face of change. Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stre...
Free ebook | Publisher: O'Reilly Media | Release: 2018
Apache Spark 2: Data Processing and Real-Time Analytics
by Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei
Apache Spark is an in-memory, cluster-based data processing system that provides a wide range of functionalities such as big data processing, analytics, machine learning, and more. With this Learning Path, you can take your knowledge of Apache Spark to the next level by learning how to expand Spark's functionality and building your o...
Price: $49.99 | Publisher: Packt Publishing | Release: 2018
by Mike Frampton
Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations.This book aims to take your limited knowledge of Spark to th...
Price: $43.99 | Publisher: Packt Publishing | Release: 2015
Modern Data Engineering with Apache Spark
by Scott Haines
Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data inges...
Price: $46.38 | Publisher: Apress | Release: 2022
Storm Real-time Processing Cookbook
by Quinton Anderson
Storm is a free and open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!Storm Real Time Processing Cookbook will ha...
Price: $29.99 | Publisher: Packt Publishing | Release: 2013
by Ankit Jain, Anand Nalya
Starting with the very basics of Storm, you will learn how to set up Storm on a single machine and move on to deploying Storm on your cluster. You will understand how Kafka can be integrated with Storm using the Kafka spout.You will then proceed to explore the Trident abstraction tool with Storm to perform stateful stream processing, guar...
Price: $8.99 | Publisher: Packt Publishing | Release: 2014
Big Data Processing with Apache Spark
by Manuel Ignacio Franco Galeano
Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streamin...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
by Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to...
Price: $4.49 | Publisher: Addison-Wesley | Release: 2014