Reasoning about MDPs as Transformers of Probability Distributions

Authors:
Vijay Anand Korthikanti;Mahesh Viswanathan;Gul Agha;YoungMin Kwon
Affiliations:
-;-;-;-
Venue:
QEST '10 Proceedings of the 2010 Seventh International Conference on the Quantitative Evaluation of Systems
Year:
2010

Citing 0
Cited 4

Infinite synchronizing words for probabilistic automata

MFCS'11 Proceedings of the 36th international conference on Mathematical foundations of computer science
Approximate Verification of the Symbolic Dynamics of Markov Chains

LICS '12 Proceedings of the 2012 27th Annual IEEE/ACM Symposium on Logic in Computer Science
Performance evaluation of sensor networks by statistical modeling and euclidean model checking

ACM Transactions on Sensor Networks (TOSN)
The steady-state control problem for markov decision processes

QEST'13 Proceedings of the 10th international conference on Quantitative Evaluation of Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as exhibiting a behavior that is a sequence of probability distributions. Defining propositions using linear inequalities, one can reason about executions over the space of probability distributions. In this framework, one can analyze properties that cannot be expressed in logics like PCTL$^*$, such as expressing bounds on transient rewards and expected values of random variables, and comparing the probability of being in one set of states at a given time with that of being in another set of states. We show that model checking MDPs with this semantics against $\omega$-regular properties is in general undecidable. We then identify special classes of propositions and schedulers with respect to which the model checking problem becomes decidable. We demonstrate the potential usefulness of our results through an example.