Mining The Social Web: Data Mining Facebook, Twitter, Linkedin, Google+, Github, And More by Matthew A. RussellMining The Social Web: Data Mining Facebook, Twitter, Linkedin, Google+, Github, And More by Matthew A. Russell

Mining The Social Web: Data Mining Facebook, Twitter, Linkedin, Google+, Github, And More

byMatthew A. Russell

Paperback | October 20, 2013

Pricing and Purchase Info

$58.50

Earn 293 plum® points

Prices and offers may vary in store

Out of stock online

Not available in stores

about

How can you tap into the wealth of social web data to discover who’s making connections with whom, what they’re talking about, and where they’re located? With this expanded and thoroughly revised edition, you’ll learn how to acquire, analyze, and summarize data from all corners of the social web, including Facebook, Twitter, LinkedIn, Google+, GitHub, email, websites, and blogs.

  • Employ the Natural Language Toolkit, NetworkX, and other scientific computing tools to mine popular social web sites
  • Apply advanced text-mining techniques, such as clustering and TF-IDF, to extract meaning from human language data
  • Bootstrap interest graphs from GitHub by discovering affinities among people, programming languages, and coding projects
  • Build interactive visualizations with D3.js, an extraordinarily flexible HTML5 and JavaScript toolkit
  • Take advantage of more than two-dozen Twitter recipes, presented in O’Reilly’s popular "problem/solution/discussion" cookbook format

The example code for this unique data science book is maintained in a public GitHub repository. It’s designed to be easily accessible through a turnkey virtual machine that facilitates interactive learning with an easy-to-use collection of IPython Notebooks.

Matthew Russell, Chief Technology Officer at Digital Reasoning, Principal at Zaffra, and author of several books on technology including Mining the Social Web (O'Reilly, 2013), now in its second edition. He is passionate about open source software development, data mining, and creating technology to amplify human intelligence. Matthew ...
Loading
Title:Mining The Social Web: Data Mining Facebook, Twitter, Linkedin, Google+, Github, And MoreFormat:PaperbackDimensions:448 pages, 9.19 × 7 × 0.94 inPublished:October 20, 2013Publisher:O'Reilly MediaLanguage:English

The following ISBNs are associated with this title:

ISBN - 10:1449367615

ISBN - 13:9781449367619

Reviews

Table of Contents

  • Preface
  • A Guided Tour of the Social Web
    • Prelude
    • Chapter 1: Mining Twitter: Exploring Trending Topics, Discovering What People Are Talking About, and More
    • Chapter 2: Mining Facebook: Analyzing Fan Pages, Examining Friendships, and More
    • Chapter 3: Mining LinkedIn: Faceting Job Titles, Clustering Colleagues, and More
    • Chapter 4: Mining Google+: Computing Document Similarity, Extracting Collocations, and More
    • Chapter 5: Mining Web Pages: Using Natural Language Processing to Understand Human Language, Summarize Blog Posts, and More
    • Chapter 6: Mining Mailboxes: Analyzing Who's Talking to Whom About What, How Often, and More
    • Chapter 7: Mining GitHub: Inspecting Software Collaboration Habits, Building Interest Graphs, and More
    • Chapter 8: Mining the Semantically Marked-Up Web: Extracting Microformats, Inferencing over RDF, and More
  • Twitter Cookbook
    • Chapter 9: Twitter Cookbook
  • Appendixes
    • Information About This Book's Virtual Machine Experience
    • OAuth Primer
    • Python and IPython Notebook Tips & Tricks
  • Colophon

Editorial Reviews

Mining the social web, again When we first published Mining the Social Web, I thought it was one of the most important books I worked on that year. Now that we’re publishing a second edition (which I didn’t work on), I find that I agree with myself. With this new edition, Mining the Social Web is more important than ever.   While we’re seeing more and more cynicism about the value of data, and particularly “big data,” that cynicism isn’t shared by most people who actually work with data. Data has undoubtedly been overhyped and oversold, but the best way to arm yourself against the hype machine is to start working with data yourself, to find out what you can and can’t learn. And there’s no shortage of data around. Everything we do leaves a cloud of data behind it: Twitter, Facebook, Google+ — to say nothing of the thousands of other social sites out there, such as Pinterest, Yelp, Foursquare, you name it. Google is doing a great job of mining your data for value. Why shouldn’t you? There are few better ways to learn about mining social data than by starting with Twitter; Twitter is really a ready-made laboratory for the new data scientist. And this book is without a doubt the best and most thorough approach to mining Twitter data out there. But that’s only a starting point. We hear a lot in the press about sentiment analysis and mining unstructured text data; this book shows you how to do it. If you need to mine the data in web pages or email archives, this book shows you how. And if you want to understand how to people collaborate on projects, Mining the Social Web is the only place I’ve seen that analyzes GitHub data. All of the examples in the book are available on Github. In addition to the example code, which is bundled into IPython notebooks, Matthew has provided a VirtualBox VM that installs Python, all the libraries you need to run the examples, the examples themselves, and an IPython server. Checking out the examples is as simple as installing Virtual Box, installing Vagrant, cloning the 2nd edition’s Github archive, and typing “vagrant up.”  You can execute the examples for yourself in the virtual machine; modify them; and use the virtual machine for your own projects, since it’s a fully functional Linux system with Python, Java, MongoDB, and other necessities pre-installed. You can view this as a book with accompanying examples in a particularly nice package, or you can view the book as “premium support” for an open source project that consists of the examples and the VM.   If you want to engage with the data that’s surrounding you, Mining the Social Web is the best place to start. Use it to learn, to experiment, and to build your own data projects.   -- Mike Loukides Vice President of Content Strategy for O'Reilly Media, Inc.