by Janek Bogucki, Alessandro Lacava, Aliaksandr Bedrytski, Matthew de Detrich, Benjamin Neil
Professional Scala provides experienced programmers with fast track coverage aimed at supporting the use of Scala in professional production applications. Skipping over the basics and fundamentals of programming, the discussion launches directly into practical Scala topics with the most up-to-date coverage of the rapidly-expanding language and related tools. Scala bridges the gap between functional and obje...
Price: $16.28 | Publisher: Wrox | Release: 2016
by Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard
Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardwar...
Price: $35.66 | Publisher: Apress | Release: 2016
by Bhushan Lakhe
Re-architect relational applications to NoSQL, integrate relational database management systems with the Hadoop ecosystem, and transform and migrate relational data to and from Hadoop components. This book covers the best-practice design approaches to re-architecting your relational applications and transforming your relational data for usage with the Hadoop ecosystem while considering concurrency, security...
Price: $44.99 | Publisher: Apress | Release: 2016
by Zubair Nabi
Learn the right cutting-edge skills and knowledge to leverage Spark Streaming to implement a wide array of real-time, streaming applications. This book walks you through end-to-end real-time application development using real-world applications, data, and code. Taking an application-first approach, each chapter introduces use cases from a specific industry and uses publicly available datasets from that doma...
Price: $21.98 | Publisher: Apress | Release: 2016
Kubernetes Microservices with Docker
by Deepak Vohra
This book on Kubernetes, a container cluster manager, discusses all aspects of using Kubernetes in today's complex big data and enterprise applications, with Docker containers.Starting with installing Kubernetes on a single node, Kubernetes Microservices with Docker introduces Kubernetes with a simple Hello example and discusses using environment variables in Kubernetes.Next, the book discusses using K...
Price: $49.99 | Publisher: Apress | Release: 2016
by Michael Nash, Wade Waldron
When it comes to big data processing, we can no longer ignore concurrency or try to add it in after the fact. Fortunately, the solution is not a new paradigm of development, but rather an old one. With this hands-on guide, Java and Scala developers will learn how to embrace concurrent and distributed applications with the open source Akka toolkit. You'll learn how to put the actor model and its associa...
Price: $19.07 | Publisher: O'Reilly Media | Release: 2016
Learning Probabilistic Graphical Models in R
by David Bellot
Probabilistic graphical models (PGM, also known as graphical models) are a marriage between probability theory and graph theory. Generally, PGMs use a graph-based representation. Two branches of graphical representations of distributions are commonly used, namely Bayesian networks and Markov networks. R has many packages to implement graphical models.We'll start by showing you how to transform a classi...
Price: $34.99 | Publisher: Packt Publishing | Release: 2016
by Dan Noble
ElasticSearch is a distributed search server similar to Apache Solr with a focus on large datasets, a schema-less setup, and high availability. This schema-free architecture allows ElasticSearch to index and search unstructured content, making it perfectly suited for both small projects and large big data warehouses with petabytes of unstructured data.This book is your toolkit to teach you how to keep your ...
Price: $34.99 | Publisher: Packt Publishing | Release: 2016
by Matei Zaharia, Holden Karau, Andy Konwinski, Patrick Wendell
Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition includes new information on Spark SQL, Spark Streaming, set...
Price: $32.23 | Publisher: O'Reilly Media | Release: 2015
Beginning Big Data with Power BI and Excel 2013
by Neil Dunlop
In Beginning Big Data with Power BI and Excel 2013, you will learn to solve business problems by tapping the power of Microsoft's Excel and Power BI to import data from NoSQL and SQL databases and other sources, create relational data models, and analyze business problems through sophisticated dashboards and data-driven maps.While Beginning Big Data with Power BI and Excel 2013 covers prominent tools s...
Price: $18.76 | Publisher: Apress | Release: 2015