Clustering by random projections

  • Authors:
  • Thierry Urruty;Chabane Djeraba;Dan A. Simovici

  • Affiliations:
  • LIFL-UMR CNRS, Laboratoire d'Informatique Fondamentale de Lille, Université de Lille 1, France;LIFL-UMR CNRS, Laboratoire d'Informatique Fondamentale de Lille, Université de Lille 1, France;University of Massachusetts Boston, Department of Computer Science, Boston, Massachusetts

  • Venue:
  • ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering algorithms for multidimensional numerical data must overcome special difficulties due to the irregularities of data distribution. We present a clustering algorithm for numerical data that combines ideas from random projection techniques and density-based clustering. The algorithm consists of two phases: the first phase that entails the use of random projections to detect clusters, and the second phase that consists of certain post-processing techniques of clusters obtained by several random projections. Experiments were performed on synthetic data consisting of randomly-generated points in Rn, synthetic images containing colored regions randomly distributed, and, finally, real images. Our results suggest the potential of our algorithm for image segmentation.