One-switch utility functions and a measure of risk
Management Science
An analysis of stochastic shortest path problems
Mathematics of Operations Research
Broadly Decreasing Risk Aversion
Management Science
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Markov Decision Processes: Discrete Stochastic Dynamic Programming
Management Science
Decision-theoretic planning under risk-sensitive planning objectives
Decision-theoretic planning under risk-sensitive planning objectives
Risk-sensitive planning with one-switch utility functions: value iteration
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Thresholded rewards: acting optimally in timed, zero-sum games
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
State space search for risk-averse agents
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Brief On terminating Markov decision processes with a risk-averse objective function
Automatica (Journal of IFAC)
Risk-sensitive planning in partially observable environments
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Computing rank dependent utility in graphical models for sequential decision problems
Artificial Intelligence
Hi-index | 0.00 |
One-switch utility functions are an important class of nonlinear utility functions that can model human beings whose decisions change with their wealth level. We study how to maximize the expected utility for Markov decision problems with given one-switch utility functions. We first utilize the fact that one-switch utility functions are weighted sums of linear and exponential utility functions to prove that there exists an optimal policy that is both stationary and deterministic as the wealth level approaches negative infinity. We then develop a solution method, the backward-induction method, that starts with this policy and augments it for higher and higher wealth levels. Our backward-induction method determines maximal expected utilities in finite time, different from the previous functional value iteration method, that typically determines only approximately maximal expected utilities.