by Sandeep Karanth
Hadoop is synonymous with Big Data processing. Its simple programming model, "code once and deploy at any scale" paradigm, and an ever-growing ecosystem makes Hadoop an all-encompassing platform for programmers with different levels of expertise.This book explores the industry guidelines to optimize MapReduce jobs and higher-level abstractions such as Pig and Hive in Hadoop 2.0. Then, it d...
Price: $49.99 | Publisher: Packt Publishing | Release: 2014
by Debarchan Sarkar
Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (...
Price: $43.14 | Publisher: Apress | Release: 2014
by Michael Frampton
Many corporations are finding that the size of their data sets are outgrowing the capability of their systems to store and process them. The data is becoming too big to manage and use with traditional tools. The solution: implementing a big data system.The problem is that the Internet offers IT pros wading into big data many versions of the truth and some outright falsehoods born of ignorance. What is neede...
Price: $44.52 | Publisher: Apress | Release: 2014
Hadoop in Practice, 2nd Edition
by Alex Holmes
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and...
Price: $30.77 | Publisher: Manning | Release: 2014
Introducing Windows Azure Hdinsight
by Avkash Chauhan, Valentine Fontama, Michele Hart, Wee Hyong Tok, Buck Woody
In Introducing Microsoft Azure HDInsight, we cover what big data really means, how you can use it to your advantage in your company or organization, and one of the services you can use to do that quickly-specifically, Microsoft's HDInsight service. We start with an overview of big data and Hadoop, but we don't emphasize only concepts in this book-we want you to jump in and get your hands dirty wor...
Publisher: Microsoft Press | Release: 2014
by Jordan Tigani, Siddartha Naidu
Google BigQuery Analytics is the perfect guide for business and data analysts who want the latest tips on running complex queries and writing code to communicate with the BigQuery API. The book uses real-world examples to demonstrate current best practices and techniques, and also explains and demonstrates streaming ingestion, transformation via Hadoop in Google Compute engine, AppEngine datastore integrati...
Price: $33.42 | Publisher: Wiley | Release: 2014
by Bahaaldine Azarmi
Talend, a successful Open Source Data Integration Solution, accelerates the adoption of new big data technologies and efficiently integrates them into your existing IT infrastructure. It is able to do this because of its intuitive graphical language, its multiple connectors to the Hadoop ecosystem, and its array of tools for data integration, quality, management, and governance.This is a concise, pragmatic ...
Price: $20.99 | Publisher: Packt Publishing | Release: 2014
by Pradeep Pasupuleti
Pig Design Patterns is a comprehensive guide that will enable readers to readily use design patterns that simplify the creation of complex data pipelines in various stages of data management. This book focuses on using Pig in an enterprise context, bridging the gap between theoretical understanding and practical implementation. Each chapter contains a set of design patterns that pose and then solve technica...
Price: $28.26 | Publisher: Packt Publishing | Release: 2014
by Ankit Jain, Anand Nalya
Starting with the very basics of Storm, you will learn how to set up Storm on a single machine and move on to deploying Storm on your cluster. You will understand how Kafka can be integrated with Storm using the Kafka spout.You will then proceed to explore the Trident abstraction tool with Storm to perform stateful stream processing, guaranteeing single message processing in every topology. You will move ah...
Price: $8.99 | Publisher: Packt Publishing | Release: 2014
Scaling Big Data with Hadoop and Solr
by Hrishikesh Vijay Karambelkar
As data grows exponentially day-by-day, extracting information becomes a tedious activity in itself. Technologies like Hadoop are trying to address some of the concerns, while Solr provides high-speed faceted search. Bringing these two technologies together is helping organizations resolve the problem of information extraction from Big Data by providing excellent distributed faceted search capabilities.Scal...
Price: $26.99 | Publisher: Packt Publishing | Release: 2013