Survival prediction using gene expression data: A review and comparison

Authors:
Wessel N. van Wieringen;David Kun;Regina Hampel;Anne-Laure Boulesteix
Affiliations:
Department of Mathematics, Vrije Universiteit, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands;Department of Mathematics, Vrije Universiteit, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands;Institute for Medical Statistics and Epidemiology, Technical University of Munich, Ismaningerstr. 22, D-81675 Munich, Germany and Institute of Epidemiology, Helmholtz Zentrum München, German ...;Institute for Medical Statistics and Epidemiology, Technical University of Munich, Ismaningerstr. 22, D-81675 Munich, Germany and Sylvia Lawry Centre for Multiple Sclerosis Research, Hohenlindener ...
Venue:
Computational Statistics & Data Analysis
Year:
2009

Citing 13
Cited 4

Bagging predictors

Machine Learning
Dimension reduction methods for microarrays with application to censored survival data

Bioinformatics
Partial Cox regression analysis for high-dimensional microarray gene expression data

Bioinformatics
A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis

Bioinformatics
Testing association of a pathway with survival using gene expression data

Bioinformatics
Use of extreme patient samples for outcome prediction from gene expression data

Bioinformatics
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data

Bioinformatics
CASPAR: a hierarchical bayesian approach to predict survival times in cancer from gene expression data

Bioinformatics
Survival analysis of longitudinal microarrays

Bioinformatics
An overview on the shrinkage properties of partial least squares regression

Computational Statistics
WilcoxCV

Bioinformatics
Survival analysis of microarray expression data by transformation models

Computational Biology and Chemistry
Comparison of tree-based methods for prognostic stratification of survival data

Artificial Intelligence in Medicine

Editorial: Statistical genetics & statistical genomics: Where biology, epistemology, statistics, and computation collide

Computational Statistics & Data Analysis
A two-component Weibull mixture to model early and late mortality in a Bayesian framework

Computational Statistics & Data Analysis
Gene Selection Using Iterative Feature Elimination Random Forests for Survival Outcomes

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A new variable selection approach using Random Forests

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

Knowledge of transcription of the human genome might greatly enhance our understanding of cancer. In particular, gene expression may be used to predict the survival of cancer patients. Microarray data are characterized by their high-dimensionality: the number of covariates (p~1000) greatly exceeds the number of samples (n~100), which is a considerable challenge in the context of survival prediction. An inventory of methods that have been used to model survival using gene expression is given. These methods are critically reviewed and compared in a qualitative way. Next, these methods are applied to three real-life data sets for a quantitative comparison. The choice of the evaluation measure of predictive performance is crucial for the selection of the best method. Depending on the evaluation measure, either the L"2-penalized Cox regression or the random forest ensemble method yields the best survival time prediction using the considered gene expression data sets. Consensus on the best evaluation measure of predictive performance is needed.