Social event detection on twitter

  • Authors:
  • Elena Ilina;Claudia Hauff;Ilknur Celik;Fabian Abel;Geert-Jan Houben

  • Affiliations:
  • Web Information Systems, Delft University of Technology, The Netherlands;Web Information Systems, Delft University of Technology, The Netherlands;Middle East Technical University, Turkey;Web Information Systems, Delft University of Technology, The Netherlands;Web Information Systems, Delft University of Technology, The Netherlands

  • Venue:
  • ICWE'12 Proceedings of the 12th international conference on Web Engineering
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Various applications are developed today on top of microblogging services like Twitter. In order to engineer Web applications which operate on microblogging data, there is a need for appropriate filtering techniques to identify messages. In this paper, we focus on detecting Twitter messages (tweets) that report on social events. We introduce a filtering pipeline that exploits textual features and n-grams to classify messages into event related and non-event related tweets. We analyze the impact of preprocessing techniques, achieving accuracies higher than 80%. Further, we present a strategy to automate labeling of training data, since our proposed filtering pipeline requires training data. When testing on our dataset, this semi-automated method achieves an accuracy of 79% and results comparable to the manual labeling approach.