Advice taking in multiagent reinforcement learning

Authors:
Michael Rovatsos;Alexandros Belesiotis
Affiliations:
The University of Edinburgh, Edinburgh, United Kingdom;The University of Edinburgh, Edinburgh, United Kingdom
Venue:
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Year:
2007

Citing 5
Cited 2

Technical Note: \cal Q-Learning

Machine Learning
Multiagent learning using a variable learning rate

Artificial Intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Collective Intelligence and Braess' Paradox

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence

Utility based Q-learning to facilitate cooperation in Prisoner's Dilemma games

Web Intelligence and Agent Systems
A personalized QoE-aware handover decision based on distributed reinforcement learning

Wireless Networks

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) that uses an additional "advice" signal to inform agents about mutually beneficial forms of behaviour. β-WoLF is an extension of the WoLF-PHC algorithm that allows agents to assess whether the advice obtained through this additional reward signal is (i) useful for the learning agent itself and (ii) currently being followed by other agents in the system. We report on experimental results obtained with this novel algorithm which indicate that it enables cooperation in scenarios in which the need to defend oneself against exploitation results in poor coordination using existing MARL algorithms.