Relative loss bounds for on-line density estimation with the exponential family of distributions

Authors:
Katy S. Azoury;M. K. Warmuth
Affiliations:
College of Business, San Fkancisco State University, San Francisco, CA;Computer Science Department, University of California, Santa Cruz, CA
Venue:
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Year:
1999

Citing 14
Cited 9

Aggregating strategies

COLT '90 Proceedings of the third annual workshop on Computational learning theory
The weighted majority algorithm

Information and Computation
On-line learning of linear functions

Computational Complexity
Predicting a binary sequence almost as well as the optimal biased coin

COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Exponentiated gradient versus gradient descent for linear predictors

Information and Computation
How to use expert advice

Journal of the ACM (JACM)
General convergence results for linear discriminant updates

COLT '97 Proceedings of the tenth annual conference on Computational learning theory
Tracking the best regressor

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Relative loss bounds for multidimensional regression problems

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm

Machine Learning
Minimax redundancy for the class of memoryless sources

IEEE Transactions on Information Theory
A decision-theoretic extension of stochastic complexity and its applications to learning

IEEE Transactions on Information Theory
Sequential prediction of individual sequences under general loss functions

IEEE Transactions on Information Theory
Worst-case quadratic loss bounds for prediction using linear functions and gradient descent

IEEE Transactions on Neural Networks

Boosting as entropy projection

COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
Relative Loss Bounds for Multidimensional Regression Problems

Machine Learning
General Convergence Results for Linear Discriminant Updates

Machine Learning
Relative Loss Bounds for On-Line Density Estimation with the Exponential Family of Distributions

Machine Learning
The Last-Step Minimax Algorithm

ALT '00 Proceedings of the 11th International Conference on Algorithmic Learning Theory
On-Line Estimation of Hidden Markov Model Parameters

DS '00 Proceedings of the Third International Conference on Discovery Science
On Relative Loss Bounds in Generalized Linear Regression

FCT '99 Proceedings of the 12th International Symposium on Fundamentals of Computation Theory
Strong Entropy Concentration, Game Theory, and Algorithmic Randomness

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Learning Additive Models Online with Fast Evaluating Kernels

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider on-line density estimation with a parameterized density from the exponential family. The on-line algorithm receives one example at a time and maintains a parameter that is essentially an average of the past examples. After receiving an example the algorithm incurs a loss which is the negative log-likelihood of the example w.r.t. the past parameter of the algorithm. An off-line algorithm can choose the best parameter based on all the examples. We prove bounds on the additional total loss of the on-line algorithm over the total loss of the off-line algorithm. These relative loss bounds hold for an arbitrary sequence of examples. The goal is to design algorithms with the best possible relative loss bounds. We use a certain divergence to derive and analyze the algorithms. This divergence is a relative entropy between two exponential distributions.