Bootstrap and Cross-Validation to Assess Complexity of Data-Driven Regression Models

  • Authors:
  • Willi Sauerbrei;Martin Schumacher

  • Affiliations:
  • -;-

  • Venue:
  • ISMDA '00 Proceedings of the First International Symposium on Medical Data Analysis
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

The number of potential variables included into a regression model is often too large and a more parsimonious model may be preferable. Selection strategies are widely used, but there are few analytical results about their properties. To investigate problems as replication stability, model complexity and selection bias we use bootstrap and cross-validation methods. For stepwise strategies, we discuss the importance of the predefined selection level. The methods are illustrated by investigating prognostic factors for survival time of patients with malignant glioma in the framework of a Cox regression model.