Accuracy of suboptimal solutions to kernel principal component analysis

  • Authors:
  • Giorgio Gnecco;Marcello Sanguineti

  • Affiliations:
  • Department of Communications, Computer and System Sciences (DIST), University of Genova, Genoa, Italy 16145 and Department of Mathematics (DIMA), University of Genova, Genoa, Italy 16146;Department of Communications, Computer and System Sciences (DIST), University of Genova, Genoa, Italy 16145

  • Venue:
  • Computational Optimization and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

For Principal Component Analysis in Reproducing Kernel Hilbert Spaces (KPCA), optimization over sets containing only linear combinations of all n-tuples of kernel functions is investigated, where n is a positive integer smaller than the number of data. Upper bounds on the accuracy in approximating the optimal solution, achievable without restrictions on the number of kernel functions, are derived. The rates of decrease of the upper bounds for increasing number n of kernel functions are given by the summation of two terms, one proportional to n 驴1/2 and the other to n 驴1, and depend on the maximum eigenvalue of the Gram matrix of the kernel with respect to the data. Primal and dual formulations of KPCA are considered. The estimates provide insights into the effectiveness of sparse KPCA techniques, aimed at reducing the computational costs of expansions in terms of kernel units.