Ruby Data Processing

Using Map, Reduce, and Select



Bookstore > Books > Ruby Data Processing

Ruby Data Processing
Buy
Preview
Price$19.99 - $28.84
Rating
AuthorJay Godse
PublisherApress
Published2018
Pages98
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101484234731
ISBN-139781484234730
EBook Hardcover Paperback

Gain the basics of Ruby's map, reduce, and select functions and discover how to use them to solve data-processing problems. This compact hands-on book explains how you can encode certain complex programs in 10 lines of Ruby code, an astonishingly small number. You will walk through problems and solutions which are effective because they use map, reduce, and select. As you read Ruby Data Processing, type in the code, run the code, and ponder the results. Tweak the code to test the code and see how the results change.

After reading this book, you will have a deeper understanding of how to break data-processing problems into processing stages, each of which is understandable, debuggable, and composable, and how to combine the stages to solve your data-processing problem. As a result, your Ruby coding will become more efficient and your programs will be more elegant and robust.

Discover Ruby data processing and how to do it using the map, reduce, and select functions; Develop complex solutions including debugging, randomizing, sorting, grouping, and more; Reverse engineer complex data-processing solutions.




5 5 2

Similar Books


Fast Data Processing with Spark

Fast Data Processing with Spark

Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and ...
Storm Blueprints: Patterns for Distributed Real-time Computation

Storm Blueprints: Patterns for Distributed Real-time Computation

Storm is the most popular framework for real-time stream processing. Storm provides the fundamental primitives and guarantees required for fault-tolerant distributed computing in high-volume, mission critical applications. It is both an integration technology as well as a data flow and control mechanism, making it the core of many big dat...
Mastering Hadoop

Mastering Hadoop

Hadoop is synonymous with Big Data processing. Its simple programming model, "code once and deploy at any scale" paradigm, and an ever-growing ecosystem makes Hadoop an all-encompassing platform for programmers with different levels of expertise.This book explores the industry guidelines to optimize MapReduce jobs and hi...
Fast Data Processing with Spark, 2nd Edition

Fast Data Processing with Spark, 2nd Edition

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (G...
Storm Real-time Processing Cookbook

Storm Real-time Processing Cookbook

Storm is a free and open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!Storm Real Time Processing Cookbook will ha...
Enterprise Data Workflows with Cascading

Enterprise Data Workflows with Cascading

There is an easier way to build Hadoop applications. With this hands-on book, you'll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications - without having to learn the intricacies of MapReduce.Working with sample apps base...