A genetic algorithm for cluster analysis

  • Authors:
  • Eduardo R. Hruschka;Nelson F. f. Ebecken

  • Affiliations:
  • COPPE / Federal University of Rio de Janeiro, Brasil. E-mail: erh@onda.com.br;E-mail: nelson@ntt.ufrj.br

  • Venue:
  • Intelligent Data Analysis
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new approach to find the right clustering of a dataset. We have developed a genetic algorithm to perform this task. A simple encoding scheme that yields to constant-length chromosomes is used. The objective function maximizes both the homogeneity within each cluster and the heterogeneity among clusters. Besides, the clustering genetic algorithm also finds the right number of clusters according to the Average Silhouette Width criterion. We have also developed specific genetic operators that are context-sensitive. Four examples are presented to illustrate the efficacy of the proposed method.