To teach or not to teach?: decision making under uncertainty in ad hoc teams

Authors:
Peter Stone;Sarit Kraus
Affiliations:
The University of Texas at Austin, Austin, TX;Bar-Ilan University, Ramat Gan, Israel and University of Maryland, College Park, MD
Venue:
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Year:
2010

Citing 14
Cited 10

Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

Machine Learning
Collaborative plans for complex group action

Artificial Intelligence
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork

Artificial Intelligence - Special issue on Robocop: the first step
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Distributed Intelligent Agents

IEEE Expert: Intelligent Systems and Their Applications
Electric Elves: Applying Agent Technology to Support Human Organizations

Proceedings of the Thirteenth Conference on Innovative Applications of Artificial Intelligence Conference
Agents for Establishing Ad hoc Cross-Organizational Teams

IAT '04 Proceedings of the IEEE/WIC/ACM International Conference on Intelligent Agent Technology
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Toward identifying process models in ad hoc and distributed teams

HuCom '08 Proceedings of the 1st International Working Conference on Human Factors and Computational Models in Negotiation
On partially controlled multi-agent systems

Journal of Artificial Intelligence Research
Adaptive load balancing: a study in multi-agent learning

Journal of Artificial Intelligence Research
A general approach to environment design with one agent

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence

Empirical evaluation of ad hoc teamwork in the pursuit domain

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Incentive design for adaptive agents

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Online planning for ad hoc autonomous agent teams

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Leading ad hoc agents in joint action settings with multiple teammates

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
An analysis framework for ad hoc teamwork tasks

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Cooperating with a markovian ad hoc teammate

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Ad hoc coordination in multiagent systems with applications to human-machine interaction

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Weighted synergy graphs for effective team formation with heterogeneous ad hoc agents

Artificial Intelligence

Quantified Score

Hi-index	0.01

Visualization

Abstract

In typical multiagent teamwork settings, the teammates are either programmed together, or are otherwise provided with standard communication languages and coordination protocols. In contrast, this paper presents an ad hoc team setting in which the teammates are not pre-coordinated, yet still must work together in order to achieve their common goal(s). We represent a specific instance of this scenario, in which a teammate has limited action capabilities and a fixed and known behavior, as a finite-horizon, cooperative k-armed bandit. In addition to motivating and studying this novel ad hoc teamwork scenario, the paper contributes to the k-armed bandits literature by characterizing the conditions under which certain actions are potentially optimal, and by presenting a polynomial dynamic programming algorithm that solves for the optimal action when the arm payoffs come from a discrete distribution.