Minimum Redundancy Feature Selection from Microarray Gene Expression Data
CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
An introduction to variable and feature selection
The Journal of Machine Learning Research
ISBMDA'05 Proceedings of the 6th International conference on Biological and Medical Data Analysis
OVA scheme vs. single machine approach in feature selection for microarray datasets
ICDM'06 Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining
Hi-index | 0.00 |
Feature selection is crucial to tumor classification due to the high dimensionality of microarray datasets. With the aid of the degree of differential prioritization (DDP) between relevance and antiredundancy, our proposed DDP-based feature selection technique is capable of achieving better accuracies than those reported in previous studies, while using fewer genes in the predictor set. Additionally, we discovered a strong correlation between the DDP parameter in our feature selection technique and the number of classes in the dataset. This leads us to question if the measure of relevance in our feature selection technique becomes less efficient at capturing the class-specific relevance for each individual class of the dataset as the number of classes increases. In this study, we analyze the class-specific relevance of the predictor sets found using our feature selection technique. The analysis ultimately lays down the theoretical foundation for a beneficial improvement to our feature selection technique.