A Mean Shift-Based Initialization Method for K-means

  • Authors:
  • Ivan Cabria;Iker Gondra

  • Affiliations:
  • -;-

  • Venue:
  • CIT '12 Proceedings of the 2012 IEEE 12th International Conference on Computer and Information Technology
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Because of its conceptual simplicity, k-means is one of the most commonly used clustering algorithms. However, its performance in terms of global optimality depends heavily on both the selection of k and the selection of the initial cluster centers. On the other hand, Mean Shift clustering does not rely upon a priori knowledge of the number of clusters. Furthermore, it finds the modes of the underlying probability density function of the observations, which would be a good choice of initial cluster centers for k-means. We present a Mean Shift-based initialization method for k-means. A comparative study of the proposed and other initialization methods is performed on two real-life problems with very large amounts of data: Facility Location and Molecular Dynamics. In the study, the proposed initialization method outperforms the other methods in terms of clustering performance.