Programming Hive

Data Warehouse and Query Language for Hadoop



Bookstore > Books > Programming Hive

Price$24.98 - $33.99
Rating
AuthorsEdward Capriolo, Dean Wampler, Jason Rutherglen
PublisherO'Reilly Media
Published2012
Pages352
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-101449319335
ISBN-139781449319335
EBook Hardcover Paperback

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure. You'll quickly learn how to use Hive's SQL dialect - HiveQL - to summarize, query, and analyze large datasets stored in Hadoop's distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You'll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.
  • Use Hive to create, alter, and drop databases, tables, views, functions, and indexes;
  • Customize data formats and storage options, from files to external databases;
  • Load and extract data from tables - and use queries, grouping, filtering, joining, and other conventional query methods;
  • Gain best practices for creating user defined functions (UDFs);
  • Learn Hive patterns you should use and anti-patterns you should avoid;
  • Integrate Hive with other data processing programs;
  • Use storage handlers for NoSQL databases and other datastores;
  • Learn the pros and cons of running Hive on Amazon's Elastic MapReduce.


  1. (4 books)


4 5 163

Similar Books


Options and Derivatives Programming in C++20, 2nd Edition

Options and Derivatives Programming in C++20, 2nd Edition

by Carlos Oliveira

Master the features of C++ that are frequently used to write financial software for options and derivatives, including the STL, templates, functional programming, and numerical libraries. This book also covers new features introduced in C++20 and other recent standard releases: modules, concepts, spaceship operators, and smart pointers. Y...

Price:  $34.37  |  Publisher:  Apress  |  Release:  2020

Learn Scala Programming

Learn Scala Programming

by Slava Schmidt

The second version of Scala has undergone multiple changes to support features and library implementations. Scala 2.13, with its main focus on modularizing the standard library and simplifying collections, brings with it a host of updates.Learn Scala Programming addresses both technical and architectural changes to the redesigned standard...

Price:  $37.37  |  Publisher:  Packt Publishing  |  Release:  2018

Java-Based Real-Time Programming

Java-Based Real-Time Programming

by Klas Nilsson

Development of embedded software has for some years mainly been carried out by hardware-aware programming using the C-language, and in some cases even in assembly languages. This works well in simple cases when the application demands and the hardware are known at design time, and the size of the (statically defined) software is small. Wh...

Free ebook  |  Publisher:  Self-publishing  |  Release:  2016

Is Parallel Programming Hard, And, If So, What Can You Do About It?

Is Parallel Programming Hard, And, If So, What Can You Do About It?

by Paul McKenney

The purpose of this book is to help you program shared-memory parallel systems without risking your sanity. Nevertheless, you should think of the information in this book as a foundation on which to build, rather than as a completed cathedral. Your mission, if you choose to accept, is to help make further progress in the exciting field of...

Free ebook  |  Publisher:  Self-publishing  |  Release:  2021

Beej's Guide to C Programming

Beej's Guide to C Programming

by Brian Hall

This is an intro to C for folks who already know how to program in another language. The first half of the book is written in a tutorial style, while the second half is a reference section complete with examples (inspired by the incomparable Turbo C Bible). The goal is to keep this up-to-date with the latest C standards.This guide assumes...

Free ebook  |  Publisher:  Self-publishing  |  Release:  2022

Multiplayer Game Programming

Multiplayer Game Programming

by Josh Glazer, Sanjay Madhav

Networked multiplayer games are a multibillion dollar business: some games now attract tens of millions of players. In this practical, code-rich guide, Joshua Glazer and Sanjay Madhav guide you through every aspect of engineering them. Drawing on their immense experience as both game developers and instructors, the authors lead you throug...

Price:  $33.99  |  Publisher:  Addison-Wesley  |  Release:  2015

Learning Swift 2 Programming, 2nd Edition

Learning Swift 2 Programming, 2nd Edition

by Jacob Schatz

Learning Swift 2 Programming is a fast-paced, hands-on introduction to writing production-quality iOS and OS X apps with Apple's programming language. Written for developers with experience in any modern language, this book explains Swift simply and clearly, using relevant examples that solve realistic problems.Author Jacob Schatz�...

Price:  $28.32  |  Publisher:  Addison-Wesley  |  Release:  2015

A Practical Guide to Linux Commands, Editors, and Shell Programming, 4th Edition

A Practical Guide to Linux Commands, Editors, and Shell Programming, 4th Edition

by Mark G. Sobell, Matthew Helmke

Linux is today's dominant Internet server platform. System administrators and Web developers need deep Linux fluency, including expert knowledge of shells and the command line. This is the only guide with everything you need to achieve that level of Linux mastery. Renowned Linux expert Mark Sobell has brought together comprehensive, ...

Price:  $28.99  |  Publisher:  Addison-Wesley  |  Release:  2017