Instance-Based Learning Algorithms
Machine Learning
The nature of statistical learning theory
The nature of statistical learning theory
Statistical Pattern Recognition: A Review
IEEE Transactions on Pattern Analysis and Machine Intelligence
Proceedings of the 2002 ACM symposium on Applied computing
Machine Learning
P-tree classification of yeast gene deletion data
ACM SIGKDD Explorations Newsletter
Guest editorial: research on machine learning issues in biomedical informatics modeling
Journal of Biomedical Informatics - Special issue: Biomedical machine learning
Prediction of cis/trans isomerization using feature selection and support vector machines
Journal of Biomedical Informatics
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Artificial Intelligence in Medicine
A novel cancer classifier based on differentially expressed gene network
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
TC-VGC: A Tumor Classification System using Variations in Genes' Correlation
Computer Methods and Programs in Biomedicine
Gene expression classification using binary rule majority voting genetic programming classifier
International Journal of Advanced Intelligence Paradigms
Journal of Biomedical Informatics
Hi-index | 0.00 |
Classification analysis of microarray gene expression data has been widely used to uncover biological features and to distinguish closely related cell types that often appear in the diagnosis of cancer. However, the number of dimensions of gene expression data is often very high, e.g., in the hundreds or thousands. Accurate and efficient classification of such high-dimensional data remains a contemporary challenge. In this paper, we propose a comprehensive vertical sample-based KNN/LSVM classification approach with weights optimized by genetic algorithms for high-dimensional data. Experiments on common gene expression datasets demonstrated that our approach can achieve high accuracy and efficiency at the same time. The improvement of speed is mainly related to the vertical data representation, P-tree, and its optimized logical algebra. The high accuracy is due to the combination of a KNN majority voting approach and a local support vector machine approach that makes optimal decisions at the local level. As a result, our approach could be a powerful tool for high-dimensional gene expression data analysis.