ACONS: a new algorithm for clustering documents

  • Authors:
  • Andrés Gago Alonso;Airel Pérez Suárez;José E. Medina Pagola

  • Affiliations:
  • Advanced Technologies Application Center, CENATAV, La Habana, Cuba;Advanced Technologies Application Center, CENATAV, La Habana, Cuba;Advanced Technologies Application Center, CENATAV, La Habana, Cuba

  • Venue:
  • CIARP'07 Proceedings of the Congress on pattern recognition 12th Iberoamerican conference on Progress in pattern recognition, image analysis and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a new algorithm for document clustering called Condensed Star (ACONS). This algorithm is a natural evolution of the Star algorithm proposed by Aslam et al., and improved by them and other researchers. In this method, we introduced a new concept of star allowing a different star-shaped form; in this way we retain the strengths of previous algorithms as well as address previous shortcomings. The evaluation experiments on standard document collections show that the proposed algorithm outperforms previously defined methods and obtains a smaller number of clusters. Since the ACONS algorithm is relatively simple to implement and is also efficient, we advocate its use for tasks that require clustering, such as information organization, browsing, topic tracking, and new topic detection.