Deformation of log-likelihood loss function for multiclass boosting

Authors:
Takafumi Kanamori
Affiliations:
-
Venue:
Neural Networks
Year:
2010

Citing 25
Cited 1

The Strength of Weak Learnability

Machine Learning
Support-Vector Networks

Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
Boosting in the limit: maximizing the margin of learned ensembles

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Multicategory Classification by Support Vector Machines

Computational Optimization and Applications - Special issue on computational optimization—a tribute to Olvi Mangasarian, part I
Improved Boosting Algorithms Using Confidence-rated Predictions

Machine Learning - The Eleventh Annual Conference on computational Learning Theory
Soft Margins for AdaBoost

Machine Learning
Using output codes to boost multiclass learning problems

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
MadaBoost: A Modification of AdaBoost

COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Inference for the Generalization Error

Machine Learning
Boosting in the presence of noise

Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
Reducing multiclass to binary: a unifying approach for margin classifiers

The Journal of Machine Learning Research
On the algorithmic implementation of multiclass kernel-based vector machines

The Journal of Machine Learning Research
Smooth boosting and learning with malicious noise

The Journal of Machine Learning Research
On the rate of convergence of regularized boosting classifiers

The Journal of Machine Learning Research
Information geometry of U-Boost and Bregman divergence

Neural Computation
Statistical Analysis of Some Multi-Category Large Margin Classification Methods

The Journal of Machine Learning Research
Robustifying AdaBoost by Adding the Naive Error Rate

Neural Computation
Sparseness vs Estimating Conditional Probabilities: Some Asymptotic Results

The Journal of Machine Learning Research
Robust Loss Functions for Boosting

Neural Computation
AdaBoost is Consistent

The Journal of Machine Learning Research
Robust boosting algorithm against mislabeling in multiclass problems

Neural Computation
Random classification noise defeats all convex potential boosters

Proceedings of the 25th international conference on Machine learning
On the Consistency of Multiclass Classification Methods

The Journal of Machine Learning Research
Solving multiclass learning problems via error-correcting output codes

Journal of Artificial Intelligence Research

Fully corrective boosting with arbitrary loss and regularization

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

The purpose of this paper is to study loss functions in multiclass classification. In classification problems, the decision function is estimated by minimizing an empirical loss function, and then, the output label is predicted by using the estimated decision function. We propose a class of loss functions which is obtained by a deformation of the log-likelihood loss function. There are four main reasons why we focus on the deformed log-likelihood loss function: (1) this is a class of loss functions which has not been deeply investigated so far, (2) in terms of computation, a boosting algorithm with a pseudo-loss is available to minimize the proposed loss function, (3) the proposed loss functions provide a clear correspondence between the decision functions and conditional probabilities of output labels, (4) the proposed loss functions satisfy the statistical consistency of the classification error rate which is a desirable property in classification problems. Based on (3), we show that the deformed log-likelihood loss provides a model of mislabeling which is useful as a statistical model of medical diagnostics. We also propose a robust loss function against outliers in multiclass classification based on our approach. The robust loss function is a natural extension of the existing robust loss function for binary classification. A model of mislabeling and a robust loss function are useful to cope with noisy data. Some numerical studies are presented to show the robustness of the proposed loss function. A mathematical characterization of the deformed log-likelihood loss function is also presented.