K-Harmonic Means - A Spatial Clustering Algorithm with Boosting

  • Authors:
  • Bin Zhang;Meichun Hsu;Umeshwar Dayal

  • Affiliations:
  • -;-;-

  • Venue:
  • TSDM '00 Proceedings of the First International Workshop on Temporal, Spatial, and Spatio-Temporal Data Mining-Revised Papers
  • Year:
  • 2000

Quantified Score

Hi-index 0.01

Visualization

Abstract

We propose a new center-based iterative clustering algorithm, KHarmonic Means (KHM), which is essentially insensitive to the initialization of the centers, demonstrated through a set of experiments. The dependency of the K-Means performance on the initialization of the centers has been a major problem; a similar issue exists for an alternative algorithm, Expectation Maximization (EM). Many have tried to generate good initializations to solve the sensitivity problem. KHM addresses the intrinsic problem by replacing the minimum distance from a data point to the centers, used in K-means, by the Harmonic Averages of the distances from the data point to all centers. KHM significantly improves the quality of clustering results comparing with both K-Means and EM. The KHM algorithm has been implemented in both sequential and parallel languages and tested on hundreds of randomly generated datasets with different data distribution and clustering characteristics.