Fast value iteration for goal-directed Markov decision processes

  • Authors:
  • Nevin L. Zhang;Weihoag Zhang

  • Affiliations:
  • Department of Computer Science, Hong Kong University of Science & Technology;Department of Computer Science, Hong Kong University of Science & Technology

  • Venue:
  • UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

Planning problems where effects of actions are non-deterministic can be modeled as Markov decision processes. Planning problems axe usually goal-directed. This paper proposes several techniques for exploiting the goal-directedness to accelerate value iteration, a standard algorithm for solving Markov decision processes. Empirical studies have shown that the techniques can bring about significant speedups.