Feature diversity in cluster ensembles for robust document clustering

  • Authors:
  • Xavier Sevillano;Germán Cobo;Francesc Alías;Joan Claudi Socoró

  • Affiliations:
  • Ramon Llull University, Barcelona, Spain;Ramon Llull University, Barcelona, Spain;Ramon Llull University, Barcelona, Spain;Ramon Llull University, Barcelona, Spain

  • Venue:
  • SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The performance of document clustering systems depends on employing optimal text representations, which are not only difficult to determine beforehand, but also may vary from one clustering problem to another. As a first step towards building robust document clusterers, a strategy based on feature diversity and cluster ensembles is presented in this work. Experiments conducted on a binary clustering problem show that our method is robust to near-optimal model order selection and able to detect constructive interactions between different document representations in the test bed.