An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs

  • Authors:
  • Daniel Szer;François Charpillet

  • Affiliations:
  • MAIA Group, INRIA Lorraine – LORIA, Vandœuvre-lès-Nancy, France;MAIA Group, INRIA Lorraine – LORIA, Vandœuvre-lès-Nancy, France

  • Venue:
  • ECML'05 Proceedings of the 16th European conference on Machine Learning
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the domain of decentralized Markov decision processes, we develop the first complete and optimal algorithm that is able to extract deterministic policy vectors based on finite state controllers for a cooperative team of agents. Our algorithm applies to the discounted infinite horizon case and extends best-first search methods to the domain of decentralized control theory. We prove the optimality of our approach and give some first experimental results for two small test problems. We believe this to be an important step forward in learning and planning in stochastic multi-agent systems.