Influence diagrams with memory states: representation and algorithms

  • Authors:
  • Xiaojian Wu;Akshat Kumar;Shlomo Zilberstein

  • Affiliations:
  • Computer Science Department, University of Massachusetts, Amherst, MA;Computer Science Department, University of Massachusetts, Amherst, MA;Computer Science Department, University of Massachusetts, Amherst, MA

  • Venue:
  • ADT'11 Proceedings of the Second international conference on Algorithmic decision theory
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Influence diagrams (IDs) offer a powerful framework for decision making under uncertainty, but their applicability has been hindered by the exponential growth of runtime and memory usage--largely due to the no-forgetting assumption. We present a novel way to maintain a limited amount of memory to inform each decision and still obtain near-optimal policies. The approach is based on augmenting the graphical model with memory states that represent key aspects of previous observations--a method that has proved useful in POMDP solvers. We also derive an efficient EM-based message-passing algorithm to compute the policy. Experimental results show that this approach produces highquality approximate polices and offers better scalability than existing methods.