Sequential decision making in repeated coalition formation under uncertainty

Authors:
Georgios Chalkiadakis;Craig Boutilier
Affiliations:
University of Southampton, Southampton, United Kingdom;University of Toronto, Toronto, Canada
Venue:
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Year:
2008

Citing 13
Cited 8

Coalitions among computationally bounded agents

Artificial Intelligence - Special issue on economic principles of multi-agent systems
Methods for task allocation via agent coalition formation

Artificial Intelligence
Bayesian Q-learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Multi-Agent Coordination through Coalition Formation

ATAL '97 Proceedings of the 4th International Workshop on Intelligent Agents IV, Agent Theories, Architectures, and Languages
Coalition formation with uncertain heterogeneous information

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Modelling Coalition Formation over Time for Iterative Coalition Games

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
The Advantages of Compromising in Coalition Formation with Incomplete Information

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 2
Bayesian Reinforcement Learning for Coalition Formation under Uncertainty

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Coalition formation under uncertainty: bargaining equilibria and the Bayesian core stability concept

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Sequential decision making with untrustworthy service providers

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Coalitional bargaining with agent type uncertainty

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Model based Bayesian exploration

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence

Sequential decision making with untrustworthy service providers

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Agents in neural uncertainty

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Heuristic search for identical payoff Bayesian games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects

Autonomous Agents and Multi-Agent Systems
Static and expanding grid coverage with ant robots: Complexity results

Theoretical Computer Science
Sequentially optimal repeated coalition formation under uncertainty

Autonomous Agents and Multi-Agent Systems
Deciding roles for efficient team formation by parameter learning

KES-AMSTA'12 Proceedings of the 6th KES international conference on Agent and Multi-Agent Systems: technologies and applications
Autonomous decision on team roles for efficient team formation by parameter learning and its evaluation

Intelligent Decision Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learning framework is developed for this problem when coalitions are formed (and tasks undertaken) repeatedly: not only does the model allow agents to refine their beliefs about the types of others, but uses value of information to define optimal exploration policies. However, computational approximations in that work are purely myopic. We present novel, non-myopic learning algorithms to approximate the optimal Bayesian solution, providing tractable means to ensure good sequential performance. We evaluate our algorithms in a variety of settings, and show that one, in particular, exhibits consistently good sequential performance. Further, it enables the Bayesian agents to transfer acquired knowledge among different dynamic tasks.