Reinforcement learning of coordination in cooperative multi-agent systems

Authors:
Spiros Kapetanakis;Daniel Kudenko
Affiliations:
Department of Computer Science, University of York, Heslington, York YO10 5DD, UK;Department of Computer Science, University of York, Heslington, York YO10 5DD, UK
Venue:
Eighteenth national conference on Artificial intelligence
Year:
2002

Citing 6
Cited 28

Learning to coordinate without sharing information

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Sequential Optimality and Coordination in Multiagent Systems

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

Resource allocation games with changing resource capacities

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Coordination in multiagent reinforcement learning: a Bayesian approach

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
Reinforcement Learning of Coordination in Heterogeneous Cooperative Multi-Agent Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Coordinating Multiple Agents via Reinforcement Learning

Autonomous Agents and Multi-Agent Systems
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems
Learning against multiple opponents

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Selecting informative actions improves cooperative multiagent learning

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Lenient learners in cooperative multiagent systems

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Can good learners always compensate for poor learners?

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Gradient descent for symmetric and asymmetric multiagent reinforcement learning

Web Intelligence and Agent Systems
Exploring selfish reinforcement learning in repeated games with stochastic rewards

Autonomous Agents and Multi-Agent Systems
Reinforcement learning in multi-agent environment and ant colony for packet scheduling in routers

Proceedings of the 5th ACM international workshop on Mobility management and wireless access
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Knowledge propagation in a distributed omnidirectional vision system

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Marco Somalvico Memorial Issue
Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

The Journal of Machine Learning Research
Convergence of independent adaptive learners

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Cooperative learning using advice exchange

Adaptive agents and multi-agent systems
Theoretical convergence guarantees for cooperative coevolutionary algorithms

Evolutionary Computation
Theoretical considerations of potential-based reward shaping for multi-agent systems

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
An overview of cooperative and competitive multiagent learning

LAMAS'05 Proceedings of the First international conference on Learning and Adaption in Multi-Agent Systems
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-agent learning and control system using ants colony for packet scheduling in routers

APNOMS'07 Proceedings of the 10th Asia-Pacific conference on Network Operations and Management Symposium: managing next generation networks and services
Continuous strategy replicator dynamics for multi-agent Q-learning

Autonomous Agents and Multi-Agent Systems
Orchestrating multiagent learning of penalty games

SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
The dynamics of reinforcement social learning in cooperative multiagent systems

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Exploration strategies in n-Person general-sum multiagent reinforcement learning with sequential action selection

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multi-agent systems. Specifically, we focus on a novel action selection strategy for Q-learning (Watkins 1989). The new technique is applicable to scenarios where mutual observation of actions is not possible.To date, reinforcement learning approaches for such independent agents did not guarantee convergence to the optimal joint action in scenarios with high miscoordination costs. We improve on previous results (Claus & Boutilier 1998) by demonstrating empirically that our extension causes the agents to converge almost always to the optimal joint action even in these difficult cases.