Constrained Markov decision models with weighted discounted rewards
Mathematics of Operations Research
Piecewise linear dynamic programming for constrained POMDPs
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Opportunistic Fair Scheduling in Wireless Networks: An Approximate Dynamic Programming Approach
Mobile Networks and Applications
Point-based value iteration for constrained POMDPs
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Hi-index | 0.00 |
We consider semicontinuous controlled Markov models in discrete time with total expected losses. Only control strategies which meet a set of given constraint inequalities are admissible. One has to build an optimal admissible strategy. The main result consists in the constructive development of optimal strategy with the help of the dynamic programming method. The model studied covers the case of a finite horizon and the case of a homogeneous discounted model with different discount factors.