Ensembles based on random projections to improve the accuracy of clustering algorithms

  • Authors:
  • Alberto Bertoni;Giorgio Valentini

  • Affiliations:
  • DSI, Dipartimento di Scienze dell' Informazione, Università degli Studi di Milano, Milano, Italia;DSI, Dipartimento di Scienze dell' Informazione, Università degli Studi di Milano, Milano, Italia

  • Venue:
  • WIRN'05 Proceedings of the 16th Italian conference on Neural Nets
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an algorithmic scheme for unsupervised cluster ensembles, based on randomized projections between metric spaces, by which a substantial dimensionality reduction is obtained. Multiple clusterings are performed on random subspaces, approximately preserving the distances between the projected data, and then they are combined using a pairwise similarity matrix; in this way the accuracy of each “base” clustering is maintained, and the diversity between them is improved. The proposed approach is effective for clustering problems characterized by high dimensional data, as shown by our preliminary experimental results.