To combine steady-state genetic algorithm and ensemble learning for data clustering

  • Authors:
  • Yi Hong;Sam Kwong

  • Affiliations:
  • Department of Computer Science, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong;Department of Computer Science, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2008

Quantified Score

Hi-index 0.10

Visualization

Abstract

This paper proposes a data clustering algorithm that combines the steady-state genetic algorithm and the ensemble learning method, termed as genetic-guided clustering algorithm with ensemble learning operator (GCEL). GCEL adopts the steady-state genetic algorithm to perform the search task, but replaces its traditional recombination operator with an ensemble learning operator. Therefore, GCEL can avoid the problems of clustering invalidity and context insensitivity of the traditional recombination operator of genetic algorithms. In addition, GCEL generates its initial population of candidate clustering solutions by using the random subspaces method. Therefore, less fitness evaluations are required to converge. The proposed GCEL is tested on one synthetic and several real data sets. Experimental results demonstrate that GCEL is able to achieve a comparative or better clustering solution with less fitness evaluations when compared with several other existing genetic-guided clustering algorithms.