Apache Books



Bookstore > Books > Apache

Programmer's Guide to Apache Thrift

Programmer's Guide to Apache Thrift

by Randy Abernethy

Programmer's Guide to Apache Thrift provides comprehensive coverage of the Apache Thrift framework along with a developer's-eye view of modern distributed application architecture.Thrift-based distributed software systems are built out of communicating components that use different languages, protocols, and message types. Sitting between them is Thrift, which handles data serialization, transport, and servi...

Price:  $53.61  |  Publisher:  Manning  |  Release:  2019

Apache Kafka Quick Start Guide

Apache Kafka Quick Start Guide

by Raul Estrada

Apache Kafka is a great open source platform for handling your real-time data pipeline to ensure high-speed filtering and pattern matching on the fly. In this book, you will learn how to use Apache Kafka for efficient processing of distributed applications and will get familiar with solving everyday problems in fast data and processing pipelines.This book focuses on programming rather than the configuratio...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Apache Spark 2: Data Processing and Real-Time Analytics

Apache Spark 2: Data Processing and Real-Time Analytics

by Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei

Apache Spark is an in-memory, cluster-based data processing system that provides a wide range of functionalities such as big data processing, analytics, machine learning, and more. With this Learning Path, you can take your knowledge of Apache Spark to the next level by learning how to expand Spark's functionality and building your own data flow and machine learning programs on this platform.You will work w...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Next-Generation Big Data

Next-Generation Big Data

by Butch Quinto

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies.Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used fo...

Price:  $33.51  |  Publisher:  Apress  |  Release:  2018

Beginning Apache Spark 2

Beginning Apache Spark 2

by Hien Luu

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data;...

Price:  $25.33  |  Publisher:  Apress  |  Release:  2018

Practical Apache Spark

Practical Apache Spark

by Subhashini Chellappan, Dharanitharan Ganesan

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You'l...

Price:  $31.66  |  Publisher:  Apress  |  Release:  2018

Stream Processing with Apache Flink

Stream Processing with Apache Flink

by Fabian Hueske, Vasiliki Kalavri

Get started with Apache Flink, the open source framework that enables you to process streaming data - such as user interactions, sensor data, and machine logs - as it arrives. With this practical guide, you'll learn how to use Apache Flink's stream processing APIs to implement, continuously run, and maintain real-world applications.Authors Fabian Hueske, one of Flink's creators, and Vasia Kalavri, a core co...

Price:  $47.52  |  Publisher:  O'Reilly Media  |  Release:  2018

Learning Apache Drill

Learning Apache Drill

by Paul Rogers, Charles Givre

...

Price:  $36.92  |  Publisher:  O'Reilly Media  |  Release:  2018

Apache Superset Quick Start Guide

Apache Superset Quick Start Guide

by Shashank Shekhar

Apache Superset is a modern, open source, enterprise-ready business intelligence (BI) web application. With the help of this book, you will see how Superset integrates with popular databases like Postgres, Google BigQuery, Snowflake, and MySQL. You will learn to create real time data visualizations and dashboards on modern web browsers for your organization using Superset.First, we look at the fundamentals ...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Apache Hadoop 3 Quick Start Guide

Apache Hadoop 3 Quick Start Guide

by Hrishikesh Karambelkar

Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.The book begins with an overview of big data and Apache Hadoop. Then, you will set up a p...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Pages: 1, 2, 3 ... 16 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.