Classification of high dimensional and imbalanced hyperspectral imagery data
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Robust classification of imbalanced data using one-class and two-class SVM-based multiclassifiers
Intelligent Data Analysis - Business Analytics and Intelligent Optimization
Hi-index | 0.00 |
Feature selection is an important topic in data mining, especially for high dimensional datasets. Filtering techniques in particular have received much attention, but detailed comparisons of their performance is lacking. This work considers three filters using classifier performance metrics and six commonly-used filters. All nine filtering techniques are compared and contrasted using five different microarray expression datasets. In addition, given that these datasets exhibit an imbalance between the number of positive and negative examples, the utilization of sampling techniques in the context of feature selection is examined.