Towards analysis of semi-Markov decision processes

  • Authors:
  • Taolue Chen;Jian Lu

  • Affiliations:
  • FMT, University of Twente, The Netherlands;State Key Laboratory, Novel Software Technology, Nanjing University, China

  • Venue:
  • AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate Semi-Markov Decision Processes (SMDPs). Two problems are studied, namely, the time-bounded reachability problem and the long-run average fraction of time problem. The former aims to compute the maximal (or minimum) probability to reach a certain set of states within a given time bound. We obtain a Bellman equation to characterize the maximal time-bounded reachability probability, and suggest two approaches to solve it based on discretization and randomized techniques respectively. The latter aims to compute the maximal (or minimum) average amount of time spent in a given set of states during the long run. We exploit a graph-theoretic decomposition of the given SMDP based on maximal end components and reduce it to linear programming problems.