An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization

  • Authors:
  • Fengqin Yang;Tieli Sun;Changhai Zhang

  • Affiliations:
  • College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China and College of Computer Science and Technology, Jilin University, Changchun, Jilin 130012, China;College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China;College of Computer Science and Technology, Jilin University, Changchun, Jilin 130012, China

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 12.06

Visualization

Abstract

Clustering is the process of grouping data objects into set of disjoint classes called clusters so that objects within a class are highly similar with one another and dissimilar with the objects in other classes. K-means (KM) algorithm is one of the most popular clustering techniques because it is easy to implement and works fast in most situations. However, it is sensitive to initialization and is easily trapped in local optima. K-harmonic means (KHM) clustering solves the problem of initialization using a built-in boosting function, but it also easily runs into local optima. Particle Swarm Optimization (PSO) algorithm is a stochastic global optimization technique. A hybrid data clustering algorithm based on PSO and KHM (PSOKHM) is proposed in this research, which makes full use of the merits of both algorithms. The PSOKHM algorithm not only helps the KHM clustering escape from local optima but also overcomes the shortcoming of the slow convergence speed of the PSO algorithm. The performance of the PSOKHM algorithm is compared with those of the PSO and the KHM clustering on seven data sets. Experimental results indicate the superiority of the PSOKHM algorithm.