Supervised feature selection by clustering using conditional mutual information-based distances

  • Authors:
  • José Martínez Sotoca;Filiberto Pla

  • Affiliations:
  • Institute of New Imaging Technologies, Dept. Llenguatges i Sistemes Informátics, Universitat Jaume I, Campus de Riu Sec, 12071 Castellón, Spain;Institute of New Imaging Technologies, Dept. Llenguatges i Sistemes Informátics, Universitat Jaume I, Campus de Riu Sec, 12071 Castellón, Spain

  • Venue:
  • Pattern Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, a supervised feature selection approach is presented, which is based on metric applied on continuous and discrete data representations. This method builds a dissimilarity space using information theoretic measures, in particular conditional mutual information between features with respect to a relevant variable that represents the class labels. Applying a hierarchical clustering, the algorithm searches for a compression of the information contained in the original set of features. The proposed technique is compared with other state of art methods also based on information measures. Eventually, several experiments are presented to show the effectiveness of the features selected from the point of view of classification accuracy.