Continuity of Optimal Values and Solutions for Control of Markov Chains with Constraints

Authors:
Mabel M. Tidball;Ariel Lombardi;Odile Pourtallier;Eitan Altman
Affiliations:
-;-;-;-
Venue:
SIAM Journal on Control and Optimization
Year:
2000

Citing 0
Cited 5

Convergence of population dynamics in symmetric routing games with a finite number of players

GameNets'09 Proceedings of the First ICST international conference on Game Theory for Networks
Brief paper: Towards the optimal control of Markov chains with constraints

Automatica (Journal of IFAC)
Methods to design optimal control of Markov process with finite state set in the presence of constraints

Automation and Remote Control
Multiscale fairness and its application to resource allocation in wireless networks

NETWORKING'11 Proceedings of the 10th international IFIP TC 6 conference on Networking - Volume Part II
Multiscale fairness and its application to resource allocation in wireless networks

Computer Communications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider in this paper constrained Markov decision processes. This type of control model has many applications in telecommunications and other fields [E. Altman and A. Shwartz, IEEE Trans. Automat. Control, 34 (1989), pp. 1089--1102, E. A. Feinberg and M. I. Reiman, Probab. Engrg. Inform. Sci., 8 (1994), pp. 463--489, A. Hordijk and F. Spieksma, Adv. in Appl. Probab., 21 (1989), pp. 409--431, A. Lazar, IEEE Trans. Automat. Control, 28 (1983), pp. 1001--1007, P. Nain and K. W. Ross, IEEE Trans. Automat. Control, 31 (1986), pp. 883--888, K. W. Ross and B. Chen, IEEE Trans. Automat. Control, 33 (1988), pp. 261--267]. We address the issue of the convergence of the value and optimal policies of the problem with discounted costs, to the ones for the problem with expected average cost. We consider the general multichain ergodic structure. We present two stability results in this paper. We establish the continuity of optimal values and solutions of as well as some type of robustness of some suboptimal solutions in the discount factor. Our proof relies on same general theory on continuity of values and solutions in convex optimization that relies on well-known notions of $\Gamma$-convergence.