A minority class feature selection method

  • Authors:
  • German Cuaya;Angélica Muñoz-Meléndez;Eduardo F. Morales

  • Affiliations:
  • Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Tonantzintla, México;Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Tonantzintla, México;Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Tonantzintla, México

  • Venue:
  • CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In many classification problems, and in particular in medical domains, it is common to have an unbalanced class distribution. This pose problems to classifiers as they tend to perform poorly in the minority class which is often the class of interest. One commonly used strategy that to improve the classification performance is to select a subset of relevant features. Feature selection algorithms, however, have not been designed to favour the classification performance of the minority class. In this paper, we present a novel filter feature selection algorithm, called FSMC, for unbalanced data sets. FSMC selects attributes that have minority class distributions significantly different from the majority class distributions. FSMC is fast, simple, selects a small number of features and outperforms in most cases other feature selection algorithms in terms of global accuracy and in terms of performance measures for the minority class such as precision, recall, F-measure and ROC values.