Operations Research
Parameter-Free Elicitation of Utility and Probability Weighting Functions
Management Science
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
On Decision Support Under Risk by the WOWA Optimization
ECSQARU '07 Proceedings of the 9th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
WOWA Enhancement of the Preference Modeling in the Reference Point Method
MDAI '08 Sabadell Proceedings of the 5th International Conference on Modeling Decisions for Artificial Intelligence
On efficient WOWA optimization for decision support under risk
International Journal of Approximate Reasoning
Risk-sensitive planning with one-switch utility functions: value iteration
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
State space search for risk-averse agents
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning a decision maker's utility function from (possibly) inconsistent behavior
Artificial Intelligence
A WOWA-based Aggregation Technique on Trust Values Connected to Metadata
Electronic Notes in Theoretical Computer Science (ENTCS)
Hi-index | 0.00 |
This paper is devoted to automated sequential decision in AI. More precisely, we focus here on the Rank Dependent Utility (RDU) model. This model is able to encompass rational decision behaviors that the Expected Utility model cannot accommodate. However, the non-linearity of RDU makes it difficult to compute an RDU-optimal strategy in sequential decision problems. This has considerably slowed the use of RDU in operational contexts. In this paper, we are interested in providing new algorithmic solutions to compute an RDU-optimal strategy in graphical models. Specifically, we present algorithms for solving decision tree models and influence diagram models of sequential decision problems. For decision tree models, we propose a mixed integer programming formulation that is valid for a subclass of RDU models (corresponding to risk seeking behaviors). This formulation reduces to a linear program when mixed strategies are considered. In the general case (i.e., when there is no particular assumption on the parameters of RDU), we propose a branch and bound procedure to compute an RDU-optimal strategy among the pure ones. After highlighting the difficulties induced by the use of RDU in influence diagram models, we show how this latter procedure can be extended to optimize RDU in an influence diagram. Finally, we provide empirical evaluations of all the presented algorithms.