MaskedPainter: Feature selection for microarray data analysis

Authors:
Daniele Apiletti;Elena Baralis;Giulia Bruno;Alessandro Fiori
Affiliations:
Dipartimento di Automatica e Informatica, Politecnico di Torino, Torino, Italy;Dipartimento di Automatica e Informatica, Politecnico di Torino, Torino, Italy;Dipartimento di Automatica e Informatica, Politecnico di Torino, Torino, Italy;Dipartimento di Automatica e Informatica, Politecnico di Torino, Torino, Italy
Venue:
Intelligent Data Analysis
Year:
2012

Citing 17
Cited 0

Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
Improving classification of microarray data using prototype-based feature selection

ACM SIGKDD Explorations Newsletter
Attribute Clustering for Grouping, Selection, and Classification of Gene Expression Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Semisupervised Learning for Molecular Profiling

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis

Bioinformatics
Data Mining

Data Mining
Incremental wrapper-based gene selection from microarray data for cancer classification

Pattern Recognition
Pattern classification in DNA microarray data of multiple tumor types

Pattern Recognition
Cancer gene search with data-mining and genetic algorithms

Computers in Biology and Medicine
Accurate Cancer Classification Using Expressions of Very Few Genes

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Iterative RELIEF for Feature Weighting: Algorithms, Theories, and Applications

IEEE Transactions on Pattern Analysis and Machine Intelligence
MSVM-RFE

Bioinformatics
Gene Selection Using Neighborhood Rough Set from Gene Expression Profiles

CIS '07 Proceedings of the 2007 International Conference on Computational Intelligence and Security
A review of feature selection techniques in bioinformatics

Bioinformatics
GSEA-P

Bioinformatics
Performance of feature-selection methods in the classification of high-dimension data

Pattern Recognition
Optimal Search-Based Gene Subset Selection for Gene Array Cancer Classification

IEEE Transactions on Information Technology in Biomedicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Selecting a small number of discriminative genes from thousands is a fundamental task in microarray data analysis. An effective feature selection allows biologists to investigate only a subset of genes instead of the entire set, thus avoiding insignificant, noisy, and redundant features. This paper presents the MaskedPainter feature selection method for gene expression data. The proposed method measures the ability of each gene to classify samples belonging to different classes and ranks genes by computing an overlap score. A density based technique is exploited to smooth the effects of outliers in the overlap score computation. Analogously to other approaches, the number of selected genes can be set by the user. However, our algorithm may automatically detect the minimum set of genes that yields the best classification coverage of training set samples. The effectiveness of our approach has been demonstrated through an empirical study on public microarray datasets with different characteristics. Experimental results show that the proposed approach yields a higher classification accuracy with respect to widely used feature selection techniques.