An exact algorithm for solving MDPs under risk-sensitive planning objectives with one-switch utility functions

Authors:
Yaxin Liu;Sven Koenig
Affiliations:
Fair Isaac Corporation;University of Southern California
Venue:
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Year:
2008

Citing 10
Cited 2

One-switch utility functions and a measure of risk

Management Science
An analysis of stochastic shortest path problems

Mathematics of Operations Research
Broadly Decreasing Risk Aversion

Management Science
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Strong One-Switch Utility

Management Science
Decision-theoretic planning under risk-sensitive planning objectives

Decision-theoretic planning under risk-sensitive planning objectives
Risk-sensitive planning with one-switch utility functions: value iteration

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Thresholded rewards: acting optimally in timed, zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
State space search for risk-averse agents

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Brief On terminating Markov decision processes with a risk-averse objective function

Automatica (Journal of IFAC)

Risk-sensitive planning in partially observable environments

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Computing rank dependent utility in graphical models for sequential decision problems

Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

One-switch utility functions are an important class of nonlinear utility functions that can model human beings whose decisions change with their wealth level. We study how to maximize the expected utility for Markov decision problems with given one-switch utility functions. We first utilize the fact that one-switch utility functions are weighted sums of linear and exponential utility functions to prove that there exists an optimal policy that is both stationary and deterministic as the wealth level approaches negative infinity. We then develop a solution method, the backward-induction method, that starts with this policy and augments it for higher and higher wealth levels. Our backward-induction method determines maximal expected utilities in finite time, different from the previous functional value iteration method, that typically determines only approximately maximal expected utilities.