The Relations Among Potentials, Perturbation Analysis,and Markov Decision Processes

Authors:
Xi-Ren Cao
Affiliations:
The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong. E-mail eecao@ee.ust.hk
Venue:
Discrete Event Dynamic Systems
Year:
1998

Citing 6
Cited 13

Using the QR factorization and group inversion to compute, differentiate ,and estimate the sensitivity of stationary probabilities for markov chains

SIAM Journal on Algebraic and Discrete Methods
Feature-based methods for large scale dynamic programming

Machine Learning - Special issue on reinforcement learning
Dynamic Programming and Optimal Control, Two Volume Set

Dynamic Programming and Optimal Control, Two Volume Set
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Stochastic Dynamic Programming: Probability and Mathematical

Introduction to Stochastic Dynamic Programming: Probability and Mathematical
Neuro-Dynamic Programming

Neuro-Dynamic Programming

Unity in Diversity, Diversity in Unity: Retrospective and Prospective Views on Control of Discrete Event Systems

Discrete Event Dynamic Systems
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning

Discrete Event Dynamic Systems
CONVERGENCE OF SIMULATION-BASED POLICY ITERATION

Probability in the Engineering and Informational Sciences
An Algorithmic Approach for Sensitivity Analysis of Perturbed Quasi-Birth-and-Death Processes

Queueing Systems: Theory and Applications
Basic Ideas for Event-Based Optimization of Markov Systems

Discrete Event Dynamic Systems
The optimal robust control policy for uncertain semi-Markov control processes

International Journal of Systems Science
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Error bounds of optimization algorithms for semi-Markov decision processes

International Journal of Systems Science
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Technical Communique: A unified approach to Markov decision problems and performance sensitivity analysis

Automatica (Journal of IFAC)
A time aggregation approach to Markov decision processes

Automatica (Journal of IFAC)
A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases

Automatica (Journal of IFAC)
The control of a two-level Markov decision process by time aggregation

Automatica (Journal of IFAC)

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper provides an introductory discussion for an importantconcept, the performance potentials of Markov processes, and its relationswith perturbation analysis (PA), average-cost Markov decision processes(MDP), Poisson equations, &agr;-potentials, the fundamentalmatrix, and the group inverse of the transition matrix (or the infinitesimalgenerators). Applications to single sample path-based performancesensitivity estimation and performance optimization are also discussed.On-line algorithms for performance sensitivity estimates and on-line schemesfor policy iteration methods are presented. The approach is closely relatedto reinforcement learning algorithms.