Apache Spark Books



Bookstore > Books > Apache Spark

Jupyter Cookbook

Jupyter Cookbook

by Dan Toomey

Jupyter has garnered a strong interest in the data science community of late, as it makes common data processing and analysis tasks much simpler. This book is for data science professionals who want to master various tasks related to Jupyter to create efficient, easy-to-share, scientific applications.The book starts with recipes on installing and running the Jupyter Notebook system on various platforms and ...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Machine Learning with Apache Spark Quick Start Guide

Machine Learning with Apache Spark Quick Start Guide

by Jillur Quddus

Every person and every organization in the world manages data, whether they realize it or not. Data is used to describe the world around us and can be used for almost any purpose, from analyzing consumer habits to fighting disease and serious organized crime. Ultimately, we manage data in order to derive value from it, and many organizations around the world have traditionally invested in technology to help...

Price:  $29.99  |  Publisher:  Packt Publishing  |  Release:  2018

Complete Guide to Open Source Big Data Stack

Complete Guide to Open Source Big Data Stack

by Mike Frampton

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one ...

Price:  $30.77  |  Publisher:  Apress  |  Release:  2018

PySpark Recipes

PySpark Recipes

by Raju Kumar Mishra

Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...

Price:  $35.10  |  Publisher:  Apress  |  Release:  2018

Advanced Data Analytics Using Python

Advanced Data Analytics Using Python

by Sayan Mukhopadhyay

Gain a broad foundation of advanced data analytics concepts and discover the recent revolution in databases such as Neo4j, Elasticsearch, and MongoDB. This book discusses how to implement ETL techniques including topical crawling, which is applied in domains such as high-frequency algorithmic trading and goal-oriented dialog systems. You'll also see examples of machine learning concepts such as semi-su...

Price:  $29.01  |  Publisher:  Apress  |  Release:  2018

The Business Value of Developer Relations

The Business Value of Developer Relations

by Mary Thengvall

Discover the true value of Developer Relations as you learn to build and maintain positive relationships with your developer community. Use the principles laid out in this book to walk through your company goals and discover how you can formulate a plan tailored to your specific needs.First you will understand the value of a technical community: why you need to foster a community and how to do it. Then you ...

Price:  $19.21  |  Publisher:  Apress  |  Release:  2018

Applied Text Analysis with Python

Applied Text Analysis with Python

by Benjamin Bengfort, Tony Ojeda, Rebecca Bilbro

From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical...

Price:  $40.44  |  Publisher:  O'Reilly Media  |  Release:  2018

Data Science on the Google Cloud Platform

Data Science on the Google Cloud Platform

by Valliappa Lakshmanan

Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you'll work through a sampl...

Price:  $42.33  |  Publisher:  O'Reilly Media  |  Release:  2018

Getting Started with Kudu

Getting Started with Kudu

by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator - either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data....

Price:  $40.44  |  Publisher:  O'Reilly Media  |  Release:  2018

Microsoft Excel 2019 VBA and Macros

Microsoft Excel 2019 VBA and Macros

by Bill Jelen, Tracy Syrstad

Renowned Excel experts Bill Jelen (MrExcel) and Tracy Syrstad explain how to build more powerful, reliable, and efficient Excel spreadsheets.Use this guide to automate virtually any routine Excel task: save yourself hours, days, maybe even weeks. Make Excel do things you thought were impossible, discover macro techniques you won't find anywhere else, and create automated reports that are amazingly powe...

Price:  $16.77  |  Publisher:  Microsoft Press  |  Release:  2018

Pages: ←Previous | 1, 2, 3, 4, 5, 6 ... 9 | Next→

Subscribe to Newsletter

Be the first to know about new IT books, upcoming releases, exclusive offers and more.