A genetic rule-based data clustering toolkit

  • Authors:
  • I. Sarafis;A. M. S. Zalzala;P. W. Trinder

  • Affiliations:
  • Dept. of Comput. & Electr. Eng., Heriot-Watt Univ., Edinburgh, UK;Dept. of Comput. & Electr. Eng., Heriot-Watt Univ., Edinburgh, UK;Dept. of Comput. & Electr. Eng., Heriot-Watt Univ., Edinburgh, UK

  • Venue:
  • CEC '02 Proceedings of the Evolutionary Computation on 2002. CEC '02. Proceedings of the 2002 Congress - Volume 02
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Clustering is a hard combinatorial problem and is defined as the unsupervised classification of patterns. The formation of clusters is based on the principle of maximizing the similarity between objects of the same cluster while simultaneously minimizing the similarity between objects belonging to distinct clusters. This paper presents a tool for database clustering using a rule-based genetic algorithm (RBCGA). RBCGA evolves individuals consisting of a fixed set of clustering rules, where each rule includes d non-binary intervals, one for each feature. The investigations attempt to alleviate certain drawbacks related to the classical minimization of square-error criterion by suggesting a flexible fitness function which takes into consideration, cluster asymmetry, density, coverage and homogeny.