Brief On terminating Markov decision processes with a risk-averse objective function

  • Authors:
  • Stephen D. Patek

  • Affiliations:
  • Department of Systems Engineering, School of Engineering and Applied Science, University of Virginia, Olsson Hall, Charlottesville, VA 22903, USA

  • Venue:
  • Automatica (Journal of IFAC)
  • Year:
  • 2001

Quantified Score

Hi-index 22.14

Visualization

Abstract

We consider a class of terminating Markov decision processes with an exponential risk-averse objective function and compact constraint sets. We assume the existence of an absorbing cost-free terminal state @W, positive transition costs and continuity of the transition probability and cost functions. Without discounting future costs in the argument of the exponential utility function, we establish (i) the existence of a real-valued optimal cost function which can be achieved by a stationary policy and (ii) the convergence of value iteration and policy iteration to the unique solution of Bellman's equation. We illustrate the results with two computational examples.