Planning and acting in partially observable stochastic domains
Artificial Intelligence
A language for modeling agents' decision making processes in games
AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Graphical models for online solutions to interactive POMDPs
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
A particle filtering based approach to approximating interactive POMDPs
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
A framework for sequential planning in multi-agent settings
Journal of Artificial Intelligence Research
Multi-agent influence diagrams for representing and solving games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Improved use of partial policies for identifying behavioral equivalence
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Exploiting model equivalences for solving interactive dynamic influence diagrams
Journal of Artificial Intelligence Research
Learning Communication in Interactive Dynamic Influence Diagrams
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Hi-index | 0.00 |
Interactive dynamic influence diagrams (I-DIDs) offer a transparent and semantically clear representation for the sequential decision-making problem over multiple time steps in the presence of other interacting agents. Solving I-DlDs exactly involves knowing the solutions of possible models of the other agents, which increase exponentially with the number of time steps. We present a method of solving I-DlDs approximately by limiting the number of other agents' candidate models at each time step to a constant. We do this by clustering the models and selecting a representative set from the clusters. We discuss the error bound of the approximation technique and demonstrate its empirical performance.