Feature selection using genetic algorithm and cluster validation

  • Authors:
  • Yi-Leh Wu;Cheng-Yuan Tang;Maw-Kae Hor;Pei-Fen Wu

  • Affiliations:
  • Dept. of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan;Dept. of Information Management, Huafan University, Taipei, Taiwan;Dept. of Computer Science, National Chengchi University, Taipei, Taiwan;Dept. of Information Management, Huafan University, Taipei, Taiwan

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 12.05

Visualization

Abstract

Feature selection plays an important role in image retrieval systems. The better selection of features usually results in higher retrieval accuracy. This work tries to select the best feature set from a total of 78 low level image features, including regional, color, and textual features, using the genetic algorithms (GA). However, the GA is known to be slow to converge. In this work we propose two directions to improve the convergence time of the GA. First we employ the Taguchi method to reduce the number of necessary offspring to be tested in every generation in the GA. Second we propose to use an alternative measure, the Hubert's @C statistics, to evaluate the fitness of each offspring instead of evaluating the retrieval accuracy directly. The experiment results show that the proposed techniques improve the feature selection results by using the GA in both time and accuracy.