Variance and Bias for General Loss Functions

Authors:
Gareth M. James
Affiliations:
Marshall School of Business, University of Southern California, USA. gareth@usc.edu
Venue:
Machine Learning
Year:
2003

Citing 0
Cited 20

A Bias-Variance Analysis of a Real World Learning Problem: The CoIL Challenge 2000

Machine Learning
Bias-Variance Analysis of Support Vector Machines for the Development of SVM-Based Ensemble Methods

The Journal of Machine Learning Research
Extremely randomized trees

Machine Learning
Swarm bias-variance analysis of an evolutionary neural network classifier

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
RotBoost: A technique for combining Rotation Forest and AdaBoost

Pattern Recognition Letters
A lazy bagging approach to classification

Pattern Recognition
A bias/variance decomposition for models using collective inference

Machine Learning
Feature Ranking Ensembles for Facial Action Unit Classification

ANNPR '08 Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition
Practical Bias Variance Decomposition

AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Bayesian classifiers based on kernel density estimation: Flexible classifiers

International Journal of Approximate Reasoning
The Bias Variance Trade-Off in Bootstrapped Error Correcting Output Code Ensembles

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
Zone analysis: a visualization framework for classification problems

Artificial Intelligence Review
Spectral coefficients and classifier correlation

MCS'03 Proceedings of the 4th international conference on Multiple classifier systems
Statistical inference of minimum BD estimators and classifiers for varying-dimensional models

Journal of Multivariate Analysis
Stop wasting time: on predicting the success or failure of learning for industrial applications

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Grid-based retargeting with transformation consistency smoothing

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
A comparison of random forest with ECOC-based classifiers

MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
DEA based data preprocessing for maximum decisional efficiency linear case valuation models

Expert Systems with Applications: An International Journal
Two New Prediction-Driven Approaches to Discrete Choice Prediction

ACM Transactions on Management Information Systems (TMIS)
An analysis of how ensembles of collective classifiers improve predictions in graphs

Proceedings of the 21st ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

When using squared error loss, bias and variance and their decomposition of prediction error are well understood and widely used concepts. However, there is no universally accepted definition for other loss functions. Numerous attempts have been made to extend these concepts beyond squared error loss. Most approaches have focused solely on 0-1 loss functions and have produced significantly different definitions. These differences stem from disagreement as to the essential characteristics that variance and bias should display. This paper suggests an explicit list of rules that we feel any “reasonable” set of definitions should satisfy. Using this framework, bias and variance definitions are produced which generalize to any symmetric loss function. We illustrate these statistics on several loss functions with particular emphasis on 0-1 loss. We conclude with a discussion of the various definitions that have been proposed in the past as well as a method for estimating these quantities on real data sets.