Stability Properties of Empirical Risk Minimization over Donsker Classes

Authors:
Andrea Caponnetto;Alexander Rakhlin
Affiliations:
-;-
Venue:
The Journal of Machine Learning Research
Year:
2006

Citing 4
Cited 1

On the difficulty of approximately maximizing agreements

Journal of Computer and System Sciences
Stability and generalization

The Journal of Machine Learning Research
Almost-everywhere algorithmic stability and generalization error

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
The importance of convexity in learning with squared loss

IEEE Transactions on Information Theory

Optimality and stability of the K-hyperline clustering algorithm

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study some stability properties of algorithms which minimize (or almost-minimize) empirical error over Donsker classes of functions. We show that, as the number n of samples grows, the L2-diameter of the set of almost-minimizers of empirical error with tolerance ξ(n)=o(n-1/2) converges to zero in probability. Hence, even in the case of multiple minimizers of expected error, as n increases it becomes less and less likely that adding a sample (or a number of samples) to the training set will result in a large jump to a new hypothesis. Moreover, under some assumptions on the entropy of the class, along with an assumption of Komlos-Major-Tusnady type, we derive a power rate of decay for the diameter of almost-minimizers. This rate, through an application of a uniform ratio limit inequality, is shown to govern the closeness of the expected errors of the almost-minimizers. In fact, under the above assumptions, the expected errors of almost-minimizers become closer with a rate strictly faster than n-1/2.