Discovering hot topics using Twitter streaming data: social topic detection and geographic clustering

  • Authors:
  • Hwi-Gang Kim;Seongjoo Lee;Sunghyon Kyeong

  • Affiliations:
  • National Institute for Mathematical Sciences, Daejeon, Korea;Yonsei University Seoul, Korea;National Institute for Mathematical Sciences, Daejeon, Korea

  • Venue:
  • Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

There has been an increasing interest in analyzing social network services data. However, detecting social topics in the era of information explosion requires state-of-the-art analytics techniques. The geographic clustering analysis based on social topics across provinces, i.e., states, has rarely been studied. Using the Twitter data collected in the United States (US), we detected the social hot topic by using the ratio of word frequency. Also, we found geographic communities by correlating the time series for a set of topic words across US states. The result of the geographic clustering was visualized using the Google Fusion Table. In conclusion, the ratio of word frequency properly detects social topics or breaking news while suppressing daily tweeted small talks or emotional words such as lol, like, and love. We have also demonstrated that a clustering algorithm based on a social topic can be useful in classifying social communities.