BUAP: performance of K-Star at the INEX'09 clustering task

  • Authors:
  • David Pinto;Mireya Tovar;Darnes Vilariño;Beatriz Beltrán;Héctor Jiménez-Salazar;Basilia Campos

  • Affiliations:
  • Faculty of Computer Science, B. Autonomous University of Puebla, Mexico;Faculty of Computer Science, B. Autonomous University of Puebla, Mexico;Faculty of Computer Science, B. Autonomous University of Puebla, Mexico;Faculty of Computer Science, B. Autonomous University of Puebla, Mexico;Department of Information Technologies, Autonomous Metropolitan University, Mexico;Faculty of Computer Science, B. Autonomous University of Puebla, Mexico

  • Venue:
  • INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The aim of this paper is to use unsupervised classification techniques in order to group the documents of a given huge collection into clusters. We approached this challenge by using a simple clustering algorithm (K-Star) in a recursive clustering process over subsets of the complete collection. The presented approach is a scalable algorithm which may automatically discover the number of clusters. The obtained results outperformed different baselines presented in the INEX 2009 clustering task.