Practical Data Science With Hadoop And Spark: Designing And Building Effective Analytics At Scale

Paperback | December 16, 2016

byOfer Mendelevitch, Casey Stella, Douglas Eadline

not yet rated|write a review

As adoption of Hadoop accelerates in the enterprise and beyond, there's soaring demand for those who can solve real world problems by applying advanced data science techniques in Hadoop environments. Now Practical Data Science with Hadoop(R) and Spark provides a complete and up-to-date guide to data science with Hadoop: high-level concepts, deep-dive techniques, practical applications, hands-on tutorials, and real-world use cases. Drawing on their immense experience with Hadoop in enterprise Big Data environments, this book's authors bring together all the practical knowledge you'll need to do real, useful data science with Hadoop. Coverage includes

  • What data science is, what data scientists do, and how to build or join a data science team
  • Core data science applications in retail, healthcare, insurance, banking, education, and beyond
  • How Hadoop has evolved into an outstanding environment for doing data science
  • A day in the life of a data scientist: exploration, iteration, and more
  • Getting your data into Hadoop: data lakes, Sqoop, Flume, Falcon, and more
  • Preparing your data, from start to finish
  • Data modeling and machine learning
  • Visualization: how (and how not) to use it
  • Start-to-finish case studies: recommender systems, customer segmentation, sentiment analysis, and predictive risk modeling
  • The future: Storm online scoring, GIRAPH graph algorithms, Solr/Elastic search, and more

Pricing and Purchase Info

$55.99

Pre-order online
Ships free on orders over $25

From the Publisher

As adoption of Hadoop accelerates in the enterprise and beyond, there's soaring demand for those who can solve real world problems by applying advanced data science techniques in Hadoop environments. Now Practical Data Science with Hadoop(R) and Spark provides a complete and up-to-date guide to data science with Hadoop: high-level c...

Ofer Mendelevitch is Vice President of Data Science at Lendup, where he is responsible for Lendup’s machine learning and advanced analytics group. Prior to joining Lendup, Ofer was Director of Data Science at Hortonworks, where he was responsible for helping Hortonwork’s customers apply Data Science with Hadoop and Spark to big data ...

other books by Ofer Mendelevitch

Data Munging with Hadoop
Data Munging with Hadoop

Kobo ebook|Nov 20 2015

$10.09 online$13.11list price(save 23%)
Format:PaperbackDimensions:256 pages, 8.75 × 6.35 × 0.68 inPublished:December 16, 2016Publisher:Pearson EducationLanguage:English

The following ISBNs are associated with this title:

ISBN - 10:0134024141

ISBN - 13:9780134024141

Customer Reviews of Practical Data Science With Hadoop And Spark: Designing And Building Effective Analytics At Scale

Reviews

Extra Content

Table of Contents

Foreword

Preface

Acknowledgments

About the Authors

 

Part I: Data Science with Hadoop—An Overview

Chapter 1: Introduction to Data Science

Chapter 2: Use Cases for Data Science

Chapter 3: Hadoop and Data Science

 

Part II: Preparing and Visualizing Data with Hadoop

Chapter 4: Getting the Data into Hadoop

Chapter 5: Data Munging with Hadoop

Chapter 6: Exploring and Visualizing Data

 

Part III: Applying Data Modeling with Hadoop

Chapter 7: Machine Learning with Hadoop

Chapter 8: Predictive Modeling

Chapter 9: Clustering

Chapter 10: Anomaly Detection with Hadoop

Chapter 11: Natural Language Processing

Chapter 12: Data Science—The Next Frontier

 

Appendix A: Book Webpage and Code Download

Appendix B: HDFS Quick Start

Appendix C: Additional Background on Data Science and Apache Hadoop

 

Index