C4.5: programs for machine learning
C4.5: programs for machine learning
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Gene identification and survival prediction with Lp Cox regression and novel similarity measure
International Journal of Data Mining and Bioinformatics
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
An analysis of Bayesian classifiers
AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Multi-platform gene-expression mining and marker gene analysis
International Journal of Data Mining and Bioinformatics
ML-Flex: a flexible toolbox for performing classification analyses in parallel
The Journal of Machine Learning Research
Hi-index | 0.00 |
Glioblastoma multiforme GBM, a highly aggressive form of brain cancer, results in a median survival of 12-15 months. For decades, researchers have explored the effects of clinical and molecular factors on this disease and have identified several candidate prognostic markers. In this study, we evaluated the use of multivariate classification models for differentiating between subsets of patients who survive a relatively long or short time. Data for this study came from The Cancer Genome Atlas TCGA, a public repository containing clinical, treatment, histological and biomolecular variables for hundreds of patients. We applied variable-selection and classification algorithms in a cross-validated design and observed that predictive performance of the resulting models varied substantially across the algorithms and categories of data. The best-performing models were based on age, treatments and global DNA methylation. In this paper, we summarise our findings, discuss lessons learned in analysing TCGA data and offer recommendations for performing such analyses.