Mode seeking clustering by KNN and mean shift evaluated

  • Authors:
  • Robert P. W. Duin;Ana L. N. Fred;Marco Loog;Elżbieta Pękalska

  • Affiliations:
  • Pattern Recognition Laboratory, Delft University of Technology, The Netherlands;Department of Electrical and Computer Engineering, Instituto Superior Técnico (IST - Technical University of Lisbon), Portugal;Pattern Recognition Laboratory, Delft University of Technology, The Netherlands;School of Computer Science, University of Manchester, United Kingdom

  • Venue:
  • SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering by mode seeking is most popular using the mean shift algorithm. A less well known alternative with different properties on the computational complexity is kNN mode seeking, based on the nearest neighbor rule instead of the Parzen kernel density estimator. It is faster and allows for much higher dimensionalities. We compare the performances of both procedures using a number of labeled datasets. The retrieved clusters are compared with the given class labels. In addition, the properties of the procedures are investigated for prototype selection. It is shown that kNN mode seeking is well performing and is feasible for large scale problems with hundreds of dimensions and up to a hundred thousand data points. The mean shift algorithm may perform better than kNN mode seeking for smaller dataset sizes.