A general bio-inspired method to improve the short-text clustering task

  • Authors:
  • Diego Ingaramo;Marcelo Errecalde;Paolo Rosso

  • Affiliations:
  • LIDIC, Universidad Nacional de San Luis, Argentina;LIDIC, Universidad Nacional de San Luis, Argentina;Natural Language Eng. Lab. ELiRF, DSIC, Universidad Politécnica de Valencia, Spain

  • Venue:
  • CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

“Short-text clustering” is a very important research field due to the current tendency for people to use very short documents, e.g. blogs, text-messaging and others. In some recent works, new clustering algorithms have been proposed to deal with this difficult problem and novel bio-inspired methods have reported the best results in this area. In this work, a general bio-inspired method based on the AntTree approach is proposed for this task. It takes as input the results obtained by arbitrary clustering algorithms and refines them in different stages. The proposal shows an interesting improvement in the results obtained with different algorithms on several short-text collections.