On properties of predictors derived with a two-step bootstrap model averaging approach-A simulation study in the linear regression model

Authors:
Anika Buchholz;Norbert Holländer;Willi Sauerbrei
Affiliations:
Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Stefan-Meier-Strasse 26, 79104 Freiburg, Germany;Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Stefan-Meier-Strasse 26, 79104 Freiburg, Germany;Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Stefan-Meier-Strasse 26, 79104 Freiburg, Germany
Venue:
Computational Statistics & Data Analysis
Year:
2008

Citing 2
Cited 3

Bagging predictors

Machine Learning
Stock and bond return predictability: the discrimination power of model selection criteria

Computational Statistics & Data Analysis

Investigation about a screening step in model selection

Statistics and Computing
Frequentist Model Averaging with missing observations

Computational Statistics & Data Analysis
Weighted average least squares estimation with nonspherical disturbances and an application to the Hong Kong housing market

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

In many applications of model selection there is a large number of explanatory variables and thus a large set of candidate models. Selecting one single model for further inference ignores model selection uncertainty. Often several models fit the data equally well. However, these models may differ in terms of the variables included and might lead to different predictions. To account for model selection uncertainty, model averaging procedures have been proposed. Recently, an extended two-step bootstrap model averaging approach has been proposed. The first step of this approach is a screening step. It aims to eliminate variables with negligible effect on the outcome. In the second step the remaining variables are considered in bootstrap model averaging. A large simulation study is performed to compare the MSE and coverage rate of models derived with bootstrap model averaging, the full model, backward elimination using Akaike and Bayes information criterion and the model with the highest selection probability in bootstrap samples. In a data example, these approaches are also compared with Bayesian model averaging. Finally, some recommendations for the development of predictive models are given.