Comparative study of clustering techniques for short text documents

  • Authors:
  • Aniket Rangrej;Sayali Kulkarni;Ashish V. Tendulkar

  • Affiliations:
  • IIT Madras, Chennai, India;Self, Pune, India;IIT Madras, Chennai, India

  • Venue:
  • Proceedings of the 20th international conference companion on World wide web
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We compare various document clustering techniques including K-means, SVD-based method and a graph-based approach and their performance on short text data collected from Twitter. We define a measure for evaluating the cluster error with these techniques. Observations show that graph-based approach using affinity propagation performs best in clustering short text data with minimal cluster error.