Stable and Accurate Feature Selection

  • Authors:
  • Gokhan Gulgezen;Zehra Cataltepe;Lei Yu

  • Affiliations:
  • Computer Engineering Department, Istanbul Technical University, Istanbul, Turkey;Computer Engineering Department, Istanbul Technical University, Istanbul, Turkey;Computer Science Department, Binghamton University, Binghamton, USA

  • Venue:
  • ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In addition to accuracy, stability is also a measure of success for a feature selection algorithm. Stability could especially be a concern when the number of samples in a data set is small and the dimensionality is high. In this study, we introduce a stability measure, and perform both accuracy and stability measurements of MRMR (Minimum Redundancy Maximum Relevance) feature selection algorithm on different data sets. The two feature evaluation criteria used by MRMR, MID (Mutual Information Difference) and MIQ (Mutual Information Quotient), result in similar accuracies, but MID is more stable. We also introduce a new feature selection criterion, MID *** , where redundancy and relevance of selected features are controlled by parameter *** .