Monte-Carlo expectation maximization for decentralized POMDPs

  • Authors:
  • Feng Wu;Shlomo Zilberstein;Nicholas R. Jennings

  • Affiliations:
  • School of Electronics and Computer Science, University of Southampton, UK;School of Computer Science, University of Massachusetts Amherst;School of Electronics and Computer Science, University of Southampton, UK

  • Venue:
  • IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We address two significant drawbacks of state-of-the-art solvers of decentralized POMDPs (DECPOMDPs): the reliance on complete knowledge of the model and limited scalability as the complexity of the domain grows. We extend a recently proposed approach for solving DEC-POMDPs via a reduction to the maximum likelihood problem, which in turn can be solved using EM. We introduce a model-free version of this approach that employs Monte-Carlo EM (MCEM). While a naïve implementation of MCEM is inadequate in multiagent settings, we introduce several improvements in sampling that produce high-quality results on a variety of DEC-POMDP benchmarks, including large problems with thousands of agents.