by Dan Toomey
Jupyter has garnered a strong interest in the data science community of late, as it makes common data processing and analysis tasks much simpler. This book is for data science professionals who want to master various tasks related to Jupyter to create efficient, easy-to-share, scientific applications.The book starts with recipes on installing and running the Jupyter Notebook system on various platforms and ...
Price: $39.99 | Publisher: Packt Publishing | Release: 2018
Machine Learning with Apache Spark Quick Start Guide
by Jillur Quddus
Every person and every organization in the world manages data, whether they realize it or not. Data is used to describe the world around us and can be used for almost any purpose, from analyzing consumer habits to fighting disease and serious organized crime. Ultimately, we manage data in order to derive value from it, and many organizations around the world have traditionally invested in technology to help...
Price: $29.99 | Publisher: Packt Publishing | Release: 2018
Complete Guide to Open Source Big Data Stack
by Mike Frampton
See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one ...
Price: $30.77 | Publisher: Apress | Release: 2018
by Raju Kumar Mishra
Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!PySpark Recipes covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will ...
Price: $35.10 | Publisher: Apress | Release: 2018
Advanced Data Analytics Using Python
by Sayan Mukhopadhyay
Gain a broad foundation of advanced data analytics concepts and discover the recent revolution in databases such as Neo4j, Elasticsearch, and MongoDB. This book discusses how to implement ETL techniques including topical crawling, which is applied in domains such as high-frequency algorithmic trading and goal-oriented dialog systems. You'll also see examples of machine learning concepts such as semi-su...
Price: $29.01 | Publisher: Apress | Release: 2018
The Business Value of Developer Relations
by Mary Thengvall
Discover the true value of Developer Relations as you learn to build and maintain positive relationships with your developer community. Use the principles laid out in this book to walk through your company goals and discover how you can formulate a plan tailored to your specific needs.First you will understand the value of a technical community: why you need to foster a community and how to do it. Then you ...
Price: $19.21 | Publisher: Apress | Release: 2018
Applied Text Analysis with Python
by Benjamin Bengfort, Tony Ojeda, Rebecca Bilbro
From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical...
Price: $40.44 | Publisher: O'Reilly Media | Release: 2018
Data Science on the Google Cloud Platform
by Valliappa Lakshmanan
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you'll work through a sampl...
Price: $42.33 | Publisher: O'Reilly Media | Release: 2018
by Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bosshart
Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator - either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data....
Price: $40.44 | Publisher: O'Reilly Media | Release: 2018
Microsoft Excel 2019 VBA and Macros
by Bill Jelen, Tracy Syrstad
Renowned Excel experts Bill Jelen (MrExcel) and Tracy Syrstad explain how to build more powerful, reliable, and efficient Excel spreadsheets.Use this guide to automate virtually any routine Excel task: save yourself hours, days, maybe even weeks. Make Excel do things you thought were impossible, discover macro techniques you won't find anywhere else, and create automated reports that are amazingly powe...
Price: $15.00 | Publisher: Microsoft Press | Release: 2018