Observable subspace solution for irreducible POMDPs with infinite horizon

Authors:
Lu Yu;Richard Brooks
Affiliations:
Clemson University;Clemson University
Venue:
Proceedings of the Seventh Annual Workshop on Cyber Security and Information Intelligence Research
Year:
2011

Citing 3
Cited 0

Competitive Markov decision processes

Competitive Markov decision processes
Improving POMDP tractability via belief compression and clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Partially Observable Markov Decision Processes and Performance Sensitivity Analysis

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cyber--attacks on critical infrastructure have two main properties: (i) they are adversarial processes, and (ii) defenders will not have access to all the information they need. We, therefore, model these problems as discrete-time infinite-horizon partially observable Markov decision processes (POMDPs) with undiscounted average payoff. Our solution creates a model of the observable subspace of the original POMDP, and then finds the control policy of the observable system. This requires neither a priori information nor belief state update. We compare our approach with the currently used value iteration approximation methods. The proposed approach reduces computational overhead, and provides better solutions under certain conditions. Furthermore, the long-run average payoff obtained by our method is predictable, which helps the controller to determine whether the algorithm performance at an acceptable level before implementation.