Processing and visualizing the data in tweets

  • Authors:
  • Adam Marcus;Michael S. Bernstein;Osama Badar;David R. Karger;Samuel Madden;Robert C. Miller

  • Affiliations:
  • MIT CSAIL;MIT CSAIL;MIT CSAIL;MIT CSAIL;MIT CSAIL;MIT CSAIL

  • Venue:
  • ACM SIGMOD Record
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Microblogs such as Twitter provide a valuable stream of diverse user-generated data. While the data extracted from Twitter is generally timely and accurate, the process by which developers extract structured data from the tweet stream is ad-hoc and requires reimplementation of common data manipulation primitives. In this paper, we present two systems for querying and extracting structure from Twitter-embedded data. The first, TweeQL, provides a streaming SQL-like interface to the Twitter API, making common tweet processing tasks simpler. The second, TwitInfo, shows how end-users can interact with and understand aggregated data from the tweet stream, in addition to showcasing the power of the TweeQL language. Together these systems show the richness of content that can be extracted from Twitter.