Fast, Scalable, and Context-Sensitive Detection of Trending Topics in Microblog Post Streams

  • Authors:
  • Nargis Pervin;Fang Fang;Anindya Datta;Kaushik Dutta;Debra Vandermeer

  • Affiliations:
  • National University of Singapore;National University of Singapore;National University of Singapore;National University of Singapore;Florida International University

  • Venue:
  • ACM Transactions on Management Information Systems (TMIS)
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Social networks, such as Twitter, can quickly and broadly disseminate news and memes across both real-world events and cultural trends. Such networks are often the best sources of up-to-the-minute information, and are therefore of considerable commercial and consumer interest. The trending topics that appear first on these networks represent an answer to the age-old query “what are people talking about?” Given the incredible volume of posts (on the order of 45,000 or more per minute), and the vast number of stories about which users are posting at any given time, it is a formidable problem to extract trending stories in real time. In this article, we describe a method and implementation for extracting trending topics from a high-velocity real-time stream of microblog posts. We describe our approach and implementation, and a set of experimental results that show that our system can accurately find “hot” stories from high-rate Twitter-scale text streams.