Vector Valued Markov Decision Process for robot platooning
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Hi-index | 0.00 |
We study the multichain case of a vector-valued Markov decision process with average reward criterion. We characterize optimal deterministic stationary policies via systems of linear inequalities and discuss a policy iteration algorithm for finding all optimal deterministic stationary policies.