Reversible jump MCMC simulated annealing for neural networks

  • Authors:
  • Christophe Andrieu;Nando de Freitas;Arnaud Doucet

  • Affiliations:
  • Engineering Department, Cambridge University, Cambridge, UK;UC Berkeley Computer Science Division, Berkeley, CA;Engineering Department, Cambridge University, Cambridge, UK

  • Venue:
  • UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a novel reversible jump Markov chain Monte Carlo (MCMC) simulated annealing algorithm to optimize radial basis function (RBF) networks. This algorithm enables us to maximize the joint posterior distribution of the network parameters and the number of basis functions. It performs a global search in the joint space of the parameters and number of parameters, thereby surmounting the problem of local minima. We also show that by calibrating a Bayesian model, we can obtain the classical AIC, BIC and MDL model selection criteria within a penalized likelihood framework. Finally, we show theoretically and empirically that the algorithm converges to the modes of the full posterior distribution in an efficient way.