Gene Selection for Microarray Data by a LDA-Based Genetic Algorithm
PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
A memetic algorithm for gene selection and molecular classification of cancer
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
An Iterative GASVM-Based Method: Gene Selection and Classification of Microarray Data
IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
On the effectiveness of gene selection for microarray classification methods
ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part II
A new combined filter-wrapper framework for gene subset selection with specialized genetic operators
MCPR'10 Proceedings of the 2nd Mexican conference on Pattern recognition: Advances in pattern recognition
ICIC'11 Proceedings of the 7th international conference on Intelligent Computing: bio-inspired computing and applications
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
Hi-index | 0.00 |
In this paper, we present a gene selection method based on genetic algorithm (GA) and support vector machines (SVM) for cancer classification. First, the Wilcoxon rank sum test is used to filter noisy and redundant genes in high dimensional microarray data. Then, the different highly informative genes subsets are selected by GA/SVM using different training sets. The final subset, consisting of highly discriminating genes, is obtained by analyzing the frequency of appearance of each gene in the different gene subsets. The proposed method is tested on three open datasets: leukemia, breast cancer, and colon cancer data. The results show that the proposed method has excellent selection and classification performance, especially for breast cancer data, which can yield 100% classification accuracy using only four genes.