Learning to Communicate and Act Using Hierarchical Reinforcement Learning

Authors:
Mohammad Ghavamzadeh;Sridhar Mahadevan
Affiliations:
University of Massachusetts at Amherst;University of Massachusetts at Amherst
Venue:
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Year:
2004

Citing 12
Cited 13

Elevator Group Control Using Multiple Reinforcement Learning Agents

Machine Learning
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Hierarchical multi-agent reinforcement learning

Proceedings of the fifth international conference on Autonomous agents
Communication decisions in multi-agent cooperation: model and experiments

Proceedings of the fifth international conference on Autonomous agents
Multiagent learning using a variable learning rate

Artificial Intelligence
Distributed Value Functions

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Sequential Optimality and Coordination in Multiagent Systems

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Learning to Cooperate via Policy Search

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Coordinated Reinforcement Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
The communicative multiagent team decision problem: analyzing teamwork theories and models

Journal of Artificial Intelligence Research
Policy recognition in the abstract hidden Markov model

Journal of Artificial Intelligence Research

Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Agent interaction in distributed POMDPs and its implications on complexity

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Hard constrained semi-Markov decision processes

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Communication-based decomposition mechanisms for decentralized MDPs

Journal of Artificial Intelligence Research
An investigation into mathematical programming for finite horizon decentralized POMDPs

Journal of Artificial Intelligence Research
Online planning for multi-agent systems with bounded communication

Artificial Intelligence
Scaling model-based average-reward reinforcement learning for product delivery

ECML'06 Proceedings of the 17th European conference on Machine Learning
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
Dealing with errors in a cooperative multi-agent learning system

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
An extension of a hierarchical reinforcement learning algorithm for multiagent settings

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Using conflict resolution to inform decentralized learning

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Interactive relational reinforcement learning of concept semantics

Machine Learning
Multiagent meta-level control for radar coordination

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we address the issue of rational communication behavior among autonomous agents. The goal is for agents to learn a policy to optimize the communication needed for proper coordination, given the communication cost. We extend our previously reported cooperative hierarchical reinforcement learning (HRL) algorithm to include communication decisions and propose a new multiagent HRL algorithm, called COM-Cooperative HRL. In this algorithm, we define cooperative subtasks to be those subtasks in which coordination among agents significantly improves the performance of the overall task. Those levels of the hierarchy which include cooperative subtasks are called cooperation levels. Coordination skills among agents are learned faster by sharing information at the cooperation levels, rather than the level of primitive actions. We add a communication level to the hierarchical decomposition of the problem below each cooperation level. Before making a decision at a cooperative subtask, agents decide if it is worthwhile to perform a communication action. A communication action has a certain cost and provides each agent at a certain cooperation level with the actions selected by the other agents at the same level. We demonstrate the efficacy of the COM-Cooperative HRL algorithm as well as the relation between the communication cost and the learned communication policy using a multiagent taxi domain.