Improving approximate value iteration using memories and predictive state representations

  • Authors:
  • Michael R. James;Ton Wessling;Nikos Vlassis

  • Affiliations:
  • AI and Robotics Group, Technical Research Dept., Toyota Technical Center;Intelligent Autonomous Systems Group, Informatics Institute, University of Amsterdam;Intelligent Autonomous Systems Group, Informatics Institute, University of Amsterdam

  • Venue:
  • AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Planning in partially-observable dynamical systems is a challenging problem, and recent developments in point-based techniques, such as Perseus significantly improve performance as compared to exact techniques. In this paper, we show how to apply these techniques to new models for non-Markovian dynamical systems called Predictive State Representatiolls (PSRs) and Memory-PSRs (mPSRs). PSRs and mPSRs are models of non-Markovian decision processes that differ from latent-variable models (e.g. HMMs, POMDPs) by representing state using only observable quantities. Further, mPSRs explicitly represent certain structural properties of the dynamical system that are also relevant to planning. We show how planning techniques can be adapted to leverage this structure to improve performance both in terms of execution time as well as quality of the resulting policy.