Generalization performance of regularization networks and support vector machines via entropy numbers of compact operators

  • Authors:
  • R. C. Williamson;A. J. Smola;B. Scholkopf

  • Affiliations:
  • Res. Sch. of Inf. Sci., Australian Nat. Univ., Canberra, ACT;-;-

  • Venue:
  • IEEE Transactions on Information Theory
  • Year:
  • 2001

Quantified Score

Hi-index 754.84

Visualization

Abstract

We derive new bounds for the generalization error of kernel machines, such as support vector machines and related regularization networks by obtaining new bounds on their covering numbers. The proofs make use of a viewpoint that is apparently novel in the field of statistical learning theory. The hypothesis class is described in terms of a linear operator mapping from a possibly infinite-dimensional unit ball in feature space into a finite-dimensional space. The covering numbers of the class are then determined via the entropy numbers of the operator. These numbers, which characterize the degree of compactness of the operator can be bounded in terms of the eigenvalues of an integral operator induced by the kernel function used by the machine. As a consequence, we are able to theoretically explain the effect of the choice of kernel function on the generalization performance of support vector machines