Efficient hierarchical MCMC for policy search

Authors:
Malcolm Strens
Affiliations:
Farnborough, Hampshire, UK
Venue:
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Year:
2004

Citing 2
Cited 0

PEGASUS: A policy search method for large MDPs and POMDPs

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Monte Carlo Strategies in Scientific Computing

Monte Carlo Strategies in Scientific Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many inference and optimization tasks in machine learning can be solved by sampling approaches such as Markov Chain Monte Carlo (MCMC) and simulated annealing. These methods can be slow if a single target density query requires many runs of a simulation (or a complete sweep of a training data set). We introduce a hierarchy of MCMC samplers that allow most steps to be taken in the solution space using only a small sample of simulation runs (or training examples). This is shown to accelerate learning in a policy search optimization task.