In-Memory Analytics with Apache Arrow

Perform fast and efficient data analytics on both flat and hierarchical structured data



Bookstore > Books > In-Memory Analytics with Apache Arrow

Price$44.99 - $78.30
Rating
AuthorMatthew Topol
PublisherPackt Publishing
Published2022
Pages392
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101801071039
ISBN-139781801071031
EBook Hardcover Paperback

Apache Arrow is designed to accelerate analytics and allow the exchange of data across big data systems easily.

In-Memory Analytics with Apache Arrow begins with a quick overview of the Apache Arrow format, before moving on to helping you to understand Arrow's versatility and benefits as you walk through a variety of real-world use cases. You'll cover key tasks such as enhancing data science workflows with Arrow, using Arrow and Apache Parquet with Apache Spark and Jupyter for better performance and hassle-free data translation, as well as working with Perspective, an open source interactive graphical and tabular analysis tool for browsers. As you advance, you'll explore the different data interchange and storage formats and become well-versed with the relationships between Arrow, Parquet, Feather, Protobuf, Flatbuffers, JSON, and CSV. In addition to understanding the basic structure of the Arrow Flight and Flight SQL protocols, you'll learn about Dremio's usage of Apache Arrow to enhance SQL analytics and discover how Arrow can be used in web-based browser apps. Finally, you'll get to grips with the upcoming features of Arrow to help you stay ahead of the curve.

By the end of this book, you will have all the building blocks to create useful, efficient, and powerful analytical services and utilities with Apache Arrow.




4 5 3

Similar Books


Practical Graph Analytics with Apache Giraph

Practical Graph Analytics with Apache Giraph

by Claudio Martella, Dionysios Logothetis, Roman Shaposhnik

Practical Graph Analytics with Apache Giraph helps you build data mining and machine learning applications using the Apache Foundation's Giraph framework for graph processing. This is the same framework as used by Facebook, Google, and other social media analytics operations to derive business value from vast amounts of interconnecte...

Price:  $37.67  |  Publisher:  Apress  |  Release:  2015

First Semester in Numerical Analysis with Python

First Semester in Numerical Analysis with Python

by Yaning Liu

The book is based on "First semester in Numerical Analysis with Julia". The contents of the original book are retained, while all the algorithms are implemented in Python (Version 3.8.0). Python is an open source (under OSI), interpreted, general-purpose programming language that has a large number of users around the world. Pyt...

Free ebook  |  Publisher:  Self-publishing  |  Release:  2020

Foundations for Analytics with Python

Foundations for Analytics with Python

by Clinton Brownley

If you're like many of Excel's 750 million users, you want to do more with your data - like repeating similar analyses over hundreds of files, or combining data in many files for analysis at one time. This practical guide shows ambitious non-programmers how to automate and scale the processing and analysis of data in different f...

Price:  $35.99  |  Publisher:  O'Reilly Media  |  Release:  2016

Apache Spark 2: Data Processing and Real-Time Analytics

Apache Spark 2: Data Processing and Real-Time Analytics

by Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei

Apache Spark is an in-memory, cluster-based data processing system that provides a wide range of functionalities such as big data processing, analytics, machine learning, and more. With this Learning Path, you can take your knowledge of Apache Spark to the next level by learning how to expand Spark's functionality and building your o...

Price:  $49.99  |  Publisher:  Packt Publishing  |  Release:  2018

Predictive Analytics with Microsoft Azure Machine Learning

Predictive Analytics with Microsoft Azure Machine Learning

by Roger Barga, Valentine Fontama, Wee Hyong Tok

Data Science and Machine Learning are in high demand, as customers are increasingly looking for ways to glean insights from all their data. More customers now realize that Business Intelligence is not enough as the volume, speed and complexity of data now defy traditional analytics tools. While Business Intelligence addresses descriptive ...

Price:  $24.59  |  Publisher:  Apress  |  Release:  2014

Mastering Apache Spark

Mastering Apache Spark

by Mike Frampton

Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations.This book aims to take your limited knowledge of Spark to th...

Price:  $43.99  |  Publisher:  Packt Publishing  |  Release:  2015

Big Data Analytics with R and Hadoop

Big Data Analytics with R and Hadoop

by Vignesh Prajapati

Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased revenue. New...

Price:  $5.77  |  Publisher:  Packt Publishing  |  Release:  2013

Introduction to Search with Sphinx

Introduction to Search with Sphinx

by Andrew Aksyonoff

This concise introduction to Sphinx shows you how to use this free software to index an enormous number of documents and provide fast results to both simple and complex searches. Written by the creator of Sphinx, this authoritative book is short and to the point....

Price:  $16.70  |  Publisher:  O'Reilly Media  |  Release:  2011