The bias-variance tradeoff and the randomized GACV
Proceedings of the 1998 conference on Advances in neural information processing systems II
Classification of gene functions using support vector machine for time-course gene expression data
Computational Statistics & Data Analysis
Computational Statistics & Data Analysis
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Bayesian classification for bivariate normal gene expression
Computational Statistics & Data Analysis
Non-parametric detection of meaningless distances in high dimensional data
Statistics and Computing
A fuzzy intelligent approach to the classification problem in gene expression data analysis
Knowledge-Based Systems
Hi-index | 0.03 |
Due to recent interest in the analysis of DNA microarray data, new methods have been considered and developed in the area of statistical classification. In particular, according to the gene expression profile of existing data, the goal is to classify the sample into a relevant diagnostic category. However, when classifying outcomes into certain cancer types, it is often the case that some genes are not important, while some genes are more important than others. A novel algorithm is presented for selecting such relevant genes referred to as marker genes for cancer classification. This algorithm is based on the Support Vector Machine (SVM) and Supervised Weighted Kernel Clustering (SWKC). To investigate the performance of this algorithm, the methods were applied to a simulated data set and some real data sets. For comparison, some other well-known methods such as Prediction Analysis of Microarrays (PAM), Support Vector Machine-Recursive Feature Elimination (SVM-RFE), and a Structured Polychotomous Machine (SPM) were considered. The experimental results indicate that the proposed SWKC/SVM algorithm is conceptually much simpler and performs more efficiently than other existing methods used in identifying marker genes for cancer classification. Furthermore, the SWKC/SVM algorithm has the advantage that it requires much less computing time compared with the other existing methods.