Predictive active set selection methods for Gaussian processes

Authors:
Ricardo Henao;Ole Winther
Affiliations:
DTU Informatics, Technical University of Denmark, Denmark;Bioinformatics Centre, University of Copenhagen, Denmark
Venue:
Neurocomputing
Year:
2012

Citing 10
Cited 0

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Training Invariant Support Vector Machines

Machine Learning
Gaussian Processes for Classification: Mean-Field Algorithms

Neural Computation
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)

Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
Assessing Approximate Inference for Binary Gaussian Process Classification

The Journal of Machine Learning Research
A Unifying View of Sparse Approximate Gaussian Process Regression

The Journal of Machine Learning Research
Building Support Vector Machines with Reduced Classifier Complexity

The Journal of Machine Learning Research
Sparse kernel SVMs via cutting-plane training

Machine Learning
Gaussian Processes for Object Categorization

International Journal of Computer Vision
Bayesian Generalized Kernel Mixed Models

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.01

Visualization

Abstract

We propose an active set selection framework for Gaussian process classification for cases when the dataset is large enough to render its inference prohibitive. Our scheme consists of a two step alternating procedure of active set update rules and hyperparameter optimization based upon marginal likelihood maximization. The active set update rules rely on the ability of the predictive distributions of a Gaussian process classifier to estimate the relative contribution of a data point when being either included or removed from the model. This means that we can use it to include points with potentially high impact to the classifier decision process while removing those that are less relevant. We introduce two active set rules based on different criteria, the first one prefers a model with interpretable active set parameters whereas the second puts computational complexity first, thus a model with active set parameters that directly control its complexity. We also provide both theoretical and empirical support for our active set selection strategy being a good approximation of a full Gaussian process classifier. Our extensive experiments show that our approach can compete with state-of-the-art classification techniques with reasonable time complexity. Source code publicly available at http://cogsys.imm.dtu.dk/passgp.