by Edward Capriolo, Dean Wampler, Jason Rutherglen
Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure. You'll quickly learn how to use Hive's SQL dialect - HiveQL - to summarize, query, and analyze large datasets stored in Hadoop's distributed filesystem.This example-driven guide shows you how to set up and configure Hive in your environ...
Price: $24.98 | Publisher: O'Reilly Media | Release: 2012
by Nick Dimiduk, Amandeep Khurana
HBase is a NoSQL storage system designed for fast, random access to large volumes of data. It runs on commodity hardware and scales smoothly from modest datasets to billions of rows and millions of columns.HBase in Action is an experience-driven guide that shows you how to design, build, and run applications using HBase. First, it introduces you to the fundamentals of handling big data. Then, you'll ex...
Price: $8.99 | Publisher: Manning | Release: 2012
Ruby and MongoDB Web Development
by Gautam Rege
Step-by-step instructions and practical examples to creating web applications with Ruby and MongoDB. Learn to design the object model in a NoSQL way. Create objects in Ruby and map them to MongoDB. Learn about Mongoid and MongoMapper for mapping Ruby objects to MongoDB documents. Process large datasets with MapReduce. Create geo-spatial indexes or 2D indexes....
Price: $26.99 | Publisher: Packt Publishing | Release: 2012
Writing and Querying MapReduce Views in CouchDB
by Bradley Holt
Learn how to create MapReduce views in CouchDB that let you query the document-oriented database for meaningful data. With this short and concise ebook, you'll get step-by-step instructions and lots of sample code to create and explore several MapReduce views, using an example database you construct....
Price: $16.99 | Publisher: O'Reilly Media | Release: 2011
by Pete Warden
To help you navigate the large number of new data tools available, this guide describes 60 of the most recent innovations, from NoSQL databases and MapReduce approaches to machine learning and visualization tools. Descriptions are based on first-hand experience with these tools in a production environment.This handy glossary also includes a chapter of key terms that help define many of these tool categories...
Price: $14.99 | Publisher: O'Reilly Media | Release: 2011
by J. Chris Anderson, Jan Lehnardt, Noah Slater
Three of CouchDB's creators show you how to use this document-oriented database as a standalone application framework or with high-volume, distributed applications. With its simple model for storing, processing, and accessing data, CouchDB is ideal for web applications that handle huge amounts of loosely structured data. You'll learn how to work with CouchDB through its RESTful web interface, and ...
Price: $28.74 | Publisher: O'Reilly Media | Release: 2010
Hadoop: The Definitive Guide, 2nd Edition
by Tom White
Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework - an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoo...
Price: $4.22 | Publisher: O'Reilly Media | Release: 2010
by Jason Venner
You've heard the hype about Hadoop: it runs petabyte - scale data mining tasks insanely fast, it runs gigantic tasks on clouds for absurdly cheap, it's been heavily committed to by tech giants like IBM, Yahoo!, and the Apache Project, and it's completely open-source. But what exactly is it, and more importantly, how do you even get a Hadoop cluster up and running?From Apress, the name you...
Price: $29.99 | Publisher: Apress | Release: 2009