Probabilistic reasoning in intelligent systems: networks of plausible inference
Probabilistic reasoning in intelligent systems: networks of plausible inference
Elements of information theory
Elements of information theory
C4.5: programs for machine learning
C4.5: programs for machine learning
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Making large-scale support vector machine learning practical
Advances in kernel methods
Feature selection for high-dimensional genomic microarray data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
An introduction to variable and feature selection
The Journal of Machine Learning Research
Feature selection for classifying high-dimensional numerical data
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
The Impact of Gene Selection on Imbalanced Microarray Expression Data
BICoB '09 Proceedings of the 1st International Conference on Bioinformatics and Computational Biology
Parallel Selection of Informative Genes for Classification
BICoB '09 Proceedings of the 1st International Conference on Bioinformatics and Computational Biology
Expert Systems with Applications: An International Journal
The fuzzy gene filter: an adaptive fuzzy inference system for expression array feature selection
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
A dynamic over-sampling procedure based on sensitivity for multi-class problems
Pattern Recognition
Efficient semi-supervised learning on locally informative multiple graphs
Pattern Recognition
A two-stage evolutionary algorithm based on sensitivity and accuracy for multi-class problems
Information Sciences: an International Journal
A Top-r Feature Selection Algorithm for Microarray Gene Expression Data
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A multi-index ROC-based methodology for high throughput experiments in gene discovery
International Journal of Data Mining and Bioinformatics
Pattern Recognition
Hi-index | 0.01 |
We present a new method based on the ROC (Receiver Operating Characteristic) curve to efficiently select a feature subset in classifying a high-dimensional microarray dataset with a limited number of observations. Our method has two steps: (1) selecting the most relevant features to the target label using the ROC curve and (2) iteratively eliminating a redundant feature using the ROC curves. The ROC curve is strongly related with a non-parametric hypothesis testing, which must be effective for a dataset with small numerical observations. Experiments with real datasets revealed the significant performance advantage of our method over two competing feature subset selection methods.