Beginning Apache Spark Using Azure Databricks
by Robert Ilijason
Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incremental...
Price: $32.32 | Publisher: Apress | Release: 2020
by Atri Sharma
Gain a thorough knowledge of Lucene's capabilities and use it to develop your own search applications. This book explores the Java-based, high-performance text search engine library used to build search capabilities in your applications. Starting with the basics of Lucene and searching, you will learn about the types of queries used in it and also take a look at scoring models. Applying this basic know...
Price: $31.61 | Publisher: Apress | Release: 2020
Using and Administering Linux: Volume 3
by David Both
Manage complex systems with ease and equip yourself for a new career. This book builds upon the skills you learned in Volumes 1 and 2 of this course and it depends upon the virtual network and virtual machine you created there. However, more experienced Linux users can begin with this volume and download an assigned script that will set up the VM for the start of Volume 3. Instructions with the script will ...
Price: $30.86 | Publisher: Apress | Release: 2020
by Benjamin Weissman, Enrico van de Laar
Use this guide to one of SQL Server 2019's most impactful features - Big Data Clusters. You will learn about data virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) platform within the SQL Server database engine. You will know how to use Big Data Clusters to combine large volumes of streaming data for analysis along with data stored in a traditional d...
Price: $33.67 | Publisher: Apress | Release: 2020
by Burr Sutter, Kamesh Sampath
Enterprise developers face several challenges when it comes to building serverless applications, such as integrating applications and building container images from source. With more than 60 practical recipes, this cookbook helps you solve these issues with Knative - the first serverless platform natively designed for Kubernetes. Each recipe contains detailed examples and exercises, along with a discussion ...
Price: $38.50 | Publisher: O'Reilly Media | Release: 2020
by Jean-Georges Perrin
The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, 2nd Edition, you'll learn to take advantage of Spark's core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in ente...
Price: $35.89 | Publisher: Manning | Release: 2020
by Mark Needham, Amy Hodler
Learn how graph algorithms can help you leverage relationships within your data to develop intelligent solutions and enhance your machine learning models. With this practical guide,developers and data scientists will discover how graph analytics deliver value, whether they're used for building dynamic network models or forecasting real-world behavior.Mark Needham and Amy Hodler from Neo4j explain how g...
Price: $45.63 | Publisher: O'Reilly Media | Release: 2019
Programmer's Guide to Apache Thrift
by Randy Abernethy
Programmer's Guide to Apache Thrift provides comprehensive coverage of the Apache Thrift framework along with a developer's-eye view of modern distributed application architecture.Thrift-based distributed software systems are built out of communicating components that use different languages, protocols, and message types. Sitting between them is Thrift, which handles data serialization, transport,...
Price: $53.61 | Publisher: Manning | Release: 2019
by Tommaso Teofili
Deep Learning for Search teaches you how to improve the effectiveness of your search by implementing neural network-based techniques. By the time you're finished with the book, you'll be ready to build amazing search engines that deliver the results your users need and that get better as time goes on!Deep learning handles the toughest search challenges, including imprecise search terms, badly inde...
Price: $39.99 | Publisher: Manning | Release: 2019
Apache Kafka Quick Start Guide
by Raul Estrada
Apache Kafka is a great open source platform for handling your real-time data pipeline to ensure high-speed filtering and pattern matching on the fly. In this book, you will learn how to use Apache Kafka for efficient processing of distributed applications and will get familiar with solving everyday problems in fast data and processing pipelines.This book focuses on programming rather than the configuratio...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018