Using genetic algorithm for selection of initial cluster centers for the K-means method

  • Authors:
  • Wojciech Kwedlo;Piotr Iwanowicz

  • Affiliations:
  • Faculty of Computer Science, Białystok University of Technology, Białystok, Poland;Faculty of Computer Science, Białystok University of Technology, Białystok, Poland

  • Venue:
  • ICAISC'10 Proceedings of the 10th international conference on Artifical intelligence and soft computing: Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The K-means algorithm is one of the most widely used clustering methods. However, solutions obtained by it are strongly dependent on initialization of cluster centers. In the paper a novel genetic algorithm, called GAKMI (Genetic Algorithm for the K-Means Initialization), for the selection of initial cluster centers is proposed. Contrary to most of the approaches described in the literature, which encode coordinates of cluster centers directly in a chromosome, our method uses binary encoding. In this encoding bits set to one select elements of the learning set as initial cluster centers. Since in our approach not every binary chromosome encodes a feasible solution, we propose two repair algorithms to convert infeasible chromosomes into feasible ones. GAKMI was tested on three datasets, using varying number of clusters. The experimental results are encouraging.