Discounted Markov decision processes with utility constraints

  • Authors:
  • Yoshinobu Kadota;Masami Kurano;Masami Yasuda

  • Affiliations:
  • -;-;-

  • Venue:
  • Computers & Mathematics with Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.09

Visualization

Abstract

We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, a saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is characterized by optimal action sets specified with a parametric utility.