Qualitative analysis of partially-observable Markov decision processes

Authors:
Krishnendu Chatterjee;Laurent Doyen;Thomas A. Henzinger
Affiliations:
Institute of Science and Technology Austria;LSV, ENS Cachan & CNRS, France;Institute of Science and Technology Austria
Venue:
MFCS'10 Proceedings of the 35th international conference on Mathematical foundations of computer science
Year:
2010

Citing 16
Cited 6

The complexity of Markov decision processes

Mathematics of Operations Research
Modeling and verification of randomized distributed real-time systems

Modeling and verification of randomized distributed real-time systems
Languages, automata, and logic

Handbook of formal languages, vol. 3
Model Checking of Probabalistic and Nondeterministic Systems

Proceedings of the 15th Conference on Foundations of Software Technology and Theoretical Computer Science
On the undecidability of probabilistic planning and related stochastic optimization problems

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Algorithms for sequential decision-making

Algorithms for sequential decision-making
Quantitative stochastic parity games

SODA '04 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms
Introduction to probabilistic automata (Computer science and applied mathematics)

Introduction to probabilistic automata (Computer science and applied mathematics)
Automatic verification of probabilistic concurrent finite state programs

SFCS '85 Proceedings of the 26th Annual Symposium on Foundations of Computer Science
Strategy Construction for Parity Games with Imperfect Information

CONCUR '08 Proceedings of the 19th international conference on Concurrency Theory
Qualitative Concurrent Stochastic Games with Imperfect Information

ICALP '09 Proceedings of the 36th Internatilonal Collogquium on Automata, Languages and Programming: Part II
Qualitative Determinacy and Decidability of Stochastic Games with Signals

LICS '09 Proceedings of the 2009 24th Annual IEEE Symposium on Logic In Computer Science
Power of Randomization in Automata on Infinite Strings

CONCUR 2009 Proceedings of the 20th International Conference on Concurrency Theory
On decision problems for probabilistic Büchi automata

FOSSACS'08/ETAPS'08 Proceedings of the Theory and practice of software, 11th international conference on Foundations of software science and computational structures
Randomness for free

MFCS'10 Proceedings of the 35th international conference on Mathematical foundations of computer science
A lattice theory for solving games of imperfect information

HSCC'06 Proceedings of the 9th international conference on Hybrid Systems: computation and control

On model checking techniques for randomized distributed systems

IFM'10 Proceedings of the 8th international conference on Integrated formal methods
Probabilistic ω-automata

Journal of the ACM (JACM)
Partial-Observation Stochastic Games: How to Win When Belief Fails

LICS '12 Proceedings of the 2012 27th Annual IEEE/ACM Symposium on Logic in Computer Science
Verification of partial-information probabilistic systems using counterexample-guided refinements

ATVA'12 Proceedings of the 10th international conference on Automated Technology for Verification and Analysis
Equivalence of games with probabilistic uncertainty and partial-observation games

ATVA'12 Proceedings of the 10th international conference on Automated Technology for Verification and Analysis
A survey of partial-observation stochastic parity games

Formal Methods in System Design

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study observation-based strategies for partially-observable Markov decision processes (POMDPs) with parity objectives. An observation-based strategy relies on partial information about the history of a play, namely, on the past sequence of observations. We consider qualitative analysis problems: given a POMDP with a parity objective, decide whether there exists an observation-based strategy to achieve the objective with probability 1 (almost-sure winning), or with positive probability (positive winning). Our main results are twofold. First, we present a complete picture of the computational complexity of the qualitative analysis problem for POMDPs with parity objectives and its subclasses: safety, reachability, Büchi, and coBüchi objectives. We establish several upper and lower bounds that were not known in the literature. Second, we give optimal bounds (matching upper and lower bounds) for the memory required by pure and randomized observation-based strategies for each class of objectives.