Model Selection for Small Sample Regression

  • Authors:
  • Olivier Chapelle;Vladimir Vapnik;Yoshua Bengio

  • Affiliations:
  • LIP6, 15 rue du Capitaine Scott, 75015 Paris, France. olivier.chapelle@liple.fr;AT&T Research Labs, 200 Laurel Avenue, Middletown, NJ 07748, USA. vlad@research.att.com;Dept. IRO, CP 6128, Université de Montréal, Succ. Centre-Ville, 2920 Chemin de la tour, Montréal, Québec, Canada, H3C 3J7. bengioy@IRO.UMontreal.CA

  • Venue:
  • Machine Learning
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Model selection is an important ingredient of many machine learning algorithms, in particular when the sample size in small, in order to strike the right trade-off between overfitting and underfitting. Previous classical results for linear regression are based on an asymptotic analysis. We present a new penalization method for performing model selection for regression that is appropriate even for small samples. Our penalization is based on an accurate estimator of the ratio of the expected training error and the expected generalization error, in terms of the expected eigenvalues of the input covariance matrix.