Analysis of gene expression profiles: class discovery and leaf ordering
Proceedings of the sixth annual international conference on Computational biology
Feature selection for high-dimensional genomic microarray data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Redundancy based feature selection for microarray data
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A multi-population χ2 test approach to informative gene selection
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
A cost-function approach to rival penalized competitive learning (RPCL)
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hi-index | 0.00 |
This paper presents an informative gene set selection approach to tumor diagnosis based on the Distance Sensitive Rival Penalized Competitive Learning (DSRPCL) algorithm and redundancy analysis. Since the DSRPCL algorithm can allocate an appropriate number of clusters for an input dataset automatically, we can utilize it to classify the genes (expressed by the gene expression levels of all the samples) into certain basic clusters. Then, we apply the post-filtering algorithm to each basic gene cluster to get the typical and independent informative genes. In this way we can obtain a compact set of informative genes. To test the effectiveness of the selected informative gene set, we utilize the support vector machine (SVM) to construct a tumor diagnosis system based on the express profiles of its genes. It is shown by the experiments that the proposed method can achieve a higher diagnosis accuracy with a smaller number of informative genes and less computational complexity in comparison with the previous ones.