Adaptive control of constrained finite Markov chains

Authors:
A. S. Poznyak;K. Najim
Affiliations:
CINVESTAV-IPN, A.P. 14-740, Seccion de Control Automatico, 07000 México D.F., Mexico;Process Control Laboratory E.N.S.I.G.C., Chemin de la loge, 31078 Toulouse cedex, France
Venue:
Automatica (Journal of IFAC)
Year:
1999

Citing 15
Cited 1

Sensitivity of constrained Markov decision processes

Annals of Operations Research
Adaptive control of constrained Markov chains: criteria and policies

Annals of Operations Research
Discrete-time controlled Markov processes with average cost criterion: a survey

SIAM Journal on Control and Optimization
Learning automata: theory and applications

Learning automata: theory and applications
Constrained discounted dynamic programming

Mathematics of Operations Research
Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains

SIAM Journal on Control and Optimization
Optimal adaptive policies for Markov decision processes

Mathematics of Operations Research
Adaptive Markov Control Processes

Adaptive Markov Control Processes
Dynamic Programming and Stochastic Control

Dynamic Programming and Stochastic Control
Random Iterative Models

Random Iterative Models
Learning Automata and Stochastic Optimization

Learning Automata and Stochastic Optimization
Finite State Markovian Decision Processes

Finite State Markovian Decision Processes
Stochastic approximation of constrained systems with system and constraint noise

Automatica (Journal of IFAC)
An adaptive automaton controller for discrete-time markov processes

Automatica (Journal of IFAC)
On constrained Markov decision processes

Operations Research Letters

Adaptive policy for two finite Markov chains zero-sum stochastic game with unknown transition matrices and average payoffs

Automatica (Journal of IFAC)

Quantified Score

Hi-index	22.15

Visualization

Abstract

An adaptive control algorithm is presented for constrained finite controlled Markov chains with unknown transition probabilities. A finite set of algebraic constraints has been considered. The Lagrange multipliers approach is used to solve this constrained optimization problem. This scheme is such that at each time n estimates the control policy on the basis on Bush-Mosteller scheme which is related to stochastic approximation procedures. We present the asymptotic properties (convergence and order of convergence rate) of the algorithm. They follow from the law of dependent large numbers, martingales theory and Lyapunov function analysis approaches.