On the convexity of policy regions in partially observed systems
Operations Research
Some monotonicity results for partially observed Markov decision processes
Operations Research
Introduction to Stochastic Search and Optimization
Introduction to Stochastic Search and Optimization
Partially Observed Markov Decision Process Multiarmed Bandits---Structural Results
Mathematics of Operations Research
IEEE Transactions on Signal Processing
Algorithms for optimal scheduling and management of hidden Markovmodel sensors
IEEE Transactions on Signal Processing
Hi-index | 0.00 |
This paper deals with the solving multivariate partially observed Markov decision process (POMDPs). We give sufficient conditions on the cost function, dynamics of the Markov chain target and observation probabilities so that the optimal scheduling policy has a threshold structure with respect to the multivariate TP2 ordering. We present stochastic approximation algorithms to estimate the parameterized threshold policy.