Model Selection for Small Sample Regression

Authors:
Olivier Chapelle;Vladimir Vapnik;Yoshua Bengio
Affiliations:
LIP6, 15 rue du Capitaine Scott, 75015 Paris, France. olivier.chapelle@liple.fr;AT&T Research Labs, 200 Laurel Avenue, Middletown, NJ 07748, USA. vlad@research.att.com;Dept. IRO, CP 6128, Université de Montréal, Succ. Centre-Ville, 2920 Chemin de la tour, Montréal, Québec, Canada, H3C 3J7. bengioy@IRO.UMontreal.CA
Venue:
Machine Learning
Year:
2002

Citing 6
Cited 7

Matrix analysis

Matrix analysis
The nature of statistical learning theory

The nature of statistical learning theory
Model Selection and Error Estimation

COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)

Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)
A new metric-based approach to model selection

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
The minimum description length principle in coding and modeling

IEEE Transactions on Information Theory

Comparison of model selection for regression

Neural Computation
Extensions to metric based model selection

The Journal of Machine Learning Research
Model Selection for Unsupervised Learning of Visual Context

International Journal of Computer Vision
A Class of Novel Kernel Functions

IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
Bounds for multistage stochastic programs using supervised learning strategies

SAGA'09 Proceedings of the 5th international conference on Stochastic algorithms: foundations and applications
Adaptive sparse polynomial chaos expansion based on least angle regression

Journal of Computational Physics
Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.01

Visualization

Abstract

Model selection is an important ingredient of many machine learning algorithms, in particular when the sample size in small, in order to strike the right trade-off between overfitting and underfitting. Previous classical results for linear regression are based on an asymptotic analysis. We present a new penalization method for performing model selection for regression that is appropriate even for small samples. Our penalization is based on an accurate estimator of the ratio of the expected training error and the expected generalization error, in terms of the expected eigenvalues of the input covariance matrix.