Coaching variables for regression and classification

Authors:
Robert Tibshirani;Geoffrey Hinton
Affiliations:
Department of Public Health Sciences and Department of Statistics, University of Toronto, Toronto, Ontario, Canada;Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
Venue:
Statistics and Computing
Year:
1998

Citing 1
Cited 3

Hierarchical mixtures of experts and the EM algorithm

Neural Computation

Generalised indirect classifiers

Computational Statistics & Data Analysis
Frequentist Model Averaging with missing observations

Computational Statistics & Data Analysis
Video summarization via transferrable structured learning

Proceedings of the 20th international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

In a regression or classification setting where we wish to predict Y from x1,x2,…, xp, we suppose that an additional set of ’coaching‘ variables z1,z2,…, zm are available in our training sample. These might be variables that are difficult to measure, and they will not be available when we predict Y from x1,x2,…, xp in the future. We consider two methods of making use of the coaching variables in order to improve the prediction of Y from x1,x2,…, xp. The relative merits of these approaches are discussed and compared in a number of examples.