Improved use of partial policies for identifying behavioral equivalence

  • Authors:
  • Yifeng Zeng;Hua Mao;Yinghui Pan;Jian Luo

  • Affiliations:
  • Aalborg University, Aalborg, Denmark and Xiamen University, Xiamen, China;Aalborg University, Aalborg, Denmark;Xiamen University, Xiamen, China and Jiangxi Univ. of Fin.& Eco., Jiangxi, China;Xiamen University, Xiamen, China

  • Venue:
  • Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Interactive multiagent decision making often requires to predict actions of other agents by solving their behavioral models from the perspective of the modeling agent. Unfortunately, the general space of models in the absence of constraining assumptions tends to be very large thereby making multiagent decision making intractable. One approach that can reduce the model space is to cluster behaviorally equivalent models that exhibit identical policies over the whole planning horizon. Currently, the state of the art on identifying equivalence of behavioral models compares partial policy trees instead of entire trees. In this paper, we further improve the use of partial trees for the identification purpose and develop an incremental comparison strategy in order to efficiently ascertain the model equivalence. We investigate the improved approach in a well-defined probabilistic graphical model for sequential multiagent decision making - interactive dynamic influence diagrams, and evaluate its performance over multiple problem domains.