Data Science from Scratch, 2nd Edition
by Joel Grus
To really learn data science, you should not only master the tools - data science libraries, frameworks, modules, and toolkits - but also understand the ideas and principles underlying them. Updated for Python 3.6, this second edition of Data Science from Scratch shows you how these tools and algorithms work by implementing them from scratch.If you have an aptitude for mathematics and some programming skill...
Price: $22.31 | Publisher: O'Reilly Media | Release: 2019
by Hillel Wayne
Learn how to design complex, correct programs and fix problems before writing a single line of code. This book is a practical, comprehensive resource on TLA+ programming with rich, complex examples. Practical TLA+ shows you how to use TLA+ to specify a complex system and test the design itself for bugs.You'll learn how even a short TLA+ spec can find critical bugs. Start by getting your feet wet with a...
Price: $23.93 | Publisher: Apress | Release: 2018
Seven Databases in Seven Weeks, 2nd Edition
by Luc Perkins, Jim Wilson, Eric Redmond
Data is getting bigger and more complex by the day, and so are your choices in handling it. Explore some of the most cutting-edge databases available - from a traditional relational database to newer NoSQL approaches - and make informed decisions about challenging data storage problems. This is the only comprehensive guide to the world of NoSQL databases, with in-depth practical and conceptual introductions...
Price: $28.50 | Publisher: The Pragmatic Programmers | Release: 2018
Apache Hadoop 3 Quick Start Guide
by Hrishikesh Karambelkar
Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.The book begins with an overview of big data and Apache Hadoop. Then, you will set up a p...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
Sams Teach Yourself Hadoop in 24 Hours
by Jeffrey Aven
Apache Hadoop is the technology at the heart of the Big Data revolution, and Hadoop skills are in enormous demand. Now, in just 24 lessons of one hour or less, you can learn all the skills and techniques you'll need to deploy each key component of a Hadoop platform in your local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets. Each sh...
Price: $31.99 | Publisher: SAMS Publishing | Release: 2017
by Michael Brzustowicz
Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today's data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz explains the basic math theory behind each step of the data science p...
Price: $27.33 | Publisher: O'Reilly Media | Release: 2017
by Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard
Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardwar...
Price: $35.66 | Publisher: Apress | Release: 2016
Hadoop MapReduce v2 Cookbook, 2nd Edition
by Thilina Gunarathne
Starting with installing Hadoop YARN, MapReduce, HDFS, and other Hadoop ecosystem components, with this book, you will soon learn about many exciting topics such as MapReduce patterns, using Hadoop to solve analytics, classifications, online marketing, recommendations, and data indexing and searching. You will learn how to take advantage of Hadoop ecosystem projects including Hive, HBase, Pig, Mahout, Nutch...
Price: $39.08 | Publisher: Packt Publishing | Release: 2015
by Mahmoud Parsian
If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You'll learn how to implement the app...
Price: $54.57 | Publisher: O'Reilly Media | Release: 2015
by Chandramani Tiwary
In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing...
Price: $44.99 | Publisher: Packt Publishing | Release: 2015