A Dynamic Programming Algorithm for the Optimal Control of Piecewise Deterministic Markov Processes

Authors:
Anthony Almudevar
Affiliations:
-
Venue:
SIAM Journal on Control and Optimization
Year:
2001

Citing 0
Cited 1

Average Continuous Control of Piecewise Deterministic Markov Processes

SIAM Journal on Control and Optimization

Quantified Score

Hi-index	0.00

Visualization

Abstract

A piecewise deterministic Markov process (PDP) is a continuous time Markov process consisting of continuous, deterministic trajectories interrupted by random jumps. The trajectories may be controlled with the object of minimizing the expected costs associated with the process. A method of representing this controlled PDP as a discrete time decision process is presented, allowing the value function for the problem to be expressed as the fixed point of a dynamic programming operator. Decisions take the form of trajectory segments. The expected costs may then be minimized through a dynamic programming algorithm, rather than through the solution of the Bellman--Hamilton--Jacobi equation, assuming the trajectory segments are numerically tractable. The technique is applied to the optimal capacity expansion problem, that is, the problem of planning the construction of new production facilities to meet rising demand.