Low bias histogram-based estimation of mutual information for feature selection

  • Authors:
  • Abdenour Hacine-Gharbi;Philippe Ravier;Rachid Harba;Tayeb Mohamadi

  • Affiliations:
  • PRISME laboratory, BP 6744 Orléans Cedex 2, University of Orléans, France and LMSE Laboratory, Electronics Department, The University Center of Bordj Bou Arréridj, Elanasser-Bordj B ...;PRISME laboratory, BP 6744 Orléans Cedex 2, University of Orléans, France;PRISME laboratory, BP 6744 Orléans Cedex 2, University of Orléans, France;Electronics Department, Faculty of technology, Ferhat Abbas University, Setif 19000, Algeria

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2012

Quantified Score

Hi-index 0.10

Visualization

Abstract

This paper presents a low bias histogram-based estimation of mutual information and its application to feature selection problems. By canceling the first order bias, the estimation avoids the bias accumulation problem that affects classical methods. As a consequence, on a synthetic feature selection problem, only the proposed method results in the exact number of features to be chosen in the Gaussian case when compared to four other approaches. In a speech recognition application, the proposed method and the Sturges method are the only ones that lead to a correct number of selected features in the noise free case. In the reduced data case, only the proposed method points out the optimal number of features to select. Finally, in the noisy case, only the proposed method leads to results of high quality; other methods show severely underestimated numbers of selected features.