Random direction divisive clustering

  • Authors:
  • S. K. Tasoulis;D. K. Tasoulis;V. P. Plagianakos

  • Affiliations:
  • Computer Science and Biomedical Informatics, University of Central Greece, Papassiopoulou 2-4, Lamia 35100, Greece;Winton Capital Management, 1-5 St. Mary Abbot's Place, SW8 6LS London, United Kingdom;Computer Science and Biomedical Informatics, University of Central Greece, Papassiopoulou 2-4, Lamia 35100, Greece

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2013

Quantified Score

Hi-index 0.10

Visualization

Abstract

Projection methods for dimension reduction have enabled the discovery of otherwise unattainable structure in ultra high dimensional data. More recently, a particular method, namely Random Projection, has been shown to have the advantage of high quality data representations with minimal computation effort, even for data dimensions in the range of hundreds of thousands or even millions. Here, we couple this dimension reduction technique with data clustering algorithms that are specially designed for high dimensional cases. First, we show that the theoretical properties of both components can be combined in a sound manner, promising an effective clustering framework. Indeed, for a series of simulated and real ultra high dimensional data scenarios, as the experimental analysis shows, the resulting algorithms achieve high quality data partitions, orders of magnitude faster.