Continuous Time Discounted Jump Markov Decision Processes: A Discrete-Event Approach

Authors:
Eugene A. Feinberg
Affiliations:
-
Venue:
Mathematics of Operations Research
Year:
2004

Citing 0
Cited 12

Optimality Of Randomized Trunk Reservation For A Problem With Multiple Constraints

Probability in the Engineering and Informational Sciences
Continuous-Time Markov Decision Processes with Discounted Rewards: The Case of Polish Spaces

Mathematics of Operations Research
Brief paper: Towards the optimal control of Markov chains with constraints

Automatica (Journal of IFAC)
Average Continuous Control of Piecewise Deterministic Markov Processes

SIAM Journal on Control and Optimization
Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss Rates

Mathematics of Operations Research
Discounted Continuous-Time Markov Decision Processes with Unbounded Rates: The Convex Analytic Approach

SIAM Journal on Control and Optimization
A characterization of meaningful schedulers for continuous-time markov decision processes

FORMATS'06 Proceedings of the 4th international conference on Formal Modeling and Analysis of Timed Systems
Stochastic reasoning about channel-based component connectors

COORDINATION'06 Proceedings of the 8th international conference on Coordination Models and Languages
Linear Programming and Constrained Average Optimality for General Continuous-Time Markov Decision Processes in History-Dependent Policies

SIAM Journal on Control and Optimization
Continuous-Time Markov Decision Processes with State-Dependent Discount Factors

Acta Applicandae Mathematicae: an international survey journal on applying mathematics and mathematical applications
Bisimulation and logical preservation for continuous-time markov decision processes

CONCUR'07 Proceedings of the 18th international conference on Concurrency Theory
Optimal time-abstract schedulers for CTMDPs and continuous-time Markov games

Theoretical Computer Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces and develops a new approach to the theory of continuous time jump Markov decision processes (CTJMDP). This approach reduces discounted CTJMDPs to discounted semi-Markov decision processes (SMDPs) and eventually to discrete-time Markov decision processes (MDPs). The reduction is based on the equivalence of strategies that change actions between jumps and the randomized strategies that change actions only at jump epochs. This holds both for one-criterion problems and for multiple-objective problems with constraints. In particular, this paper introduces the theory for multiple-objective problems with expected total discounted rewards and constraints. If a problem is feasible, there exist three types of optimal policies: (i) nonrandomized switching stationary policies, (ii) randomized stationary policies for the CTJMDP, and (iii) randomized stationary policies for the corresponding SMDP with exponentially distributed sojourn times, and these policies can be implemented as randomized strategies in the CTJMDP.