Reasoning about MDPs as Transformers of Probability Distributions

  • Authors:
  • Vijay Anand Korthikanti;Mahesh Viswanathan;Gul Agha;YoungMin Kwon

  • Affiliations:
  • -;-;-;-

  • Venue:
  • QEST '10 Proceedings of the 2010 Seventh International Conference on the Quantitative Evaluation of Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as exhibiting a behavior that is a sequence of probability distributions. Defining propositions using linear inequalities, one can reason about executions over the space of probability distributions. In this framework, one can analyze properties that cannot be expressed in logics like PCTL$^*$, such as expressing bounds on transient rewards and expected values of random variables, and comparing the probability of being in one set of states at a given time with that of being in another set of states. We show that model checking MDPs with this semantics against $\omega$-regular properties is in general undecidable. We then identify special classes of propositions and schedulers with respect to which the model checking problem becomes decidable. We demonstrate the potential usefulness of our results through an example.