Convergence of independent adaptive learners

Authors:
Francisco S. Melo;Manuel C. Lopes
Affiliations:
Institute for Systems and Robotics, Instituto Superior Técnico, Lisboa, Portugal;Institute for Systems and Robotics, Instituto Superior Técnico, Lisboa, Portugal
Venue:
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Year:
2007

Citing 10
Cited 1

Multi-agent reinforcement learning: independent vs. cooperative agents

Readings in agents
The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Reinforcement learning of coordination in cooperative multi-agent systems

Eighteenth national conference on Artificial intelligence
A Fuzzy-Logic Based Bidding Strategy for Autonomous Agents in Continuous Double Auctions

IEEE Transactions on Knowledge and Data Engineering
Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Exploring selfish reinforcement learning in repeated games with stochastic rewards

Autonomous Agents and Multi-Agent Systems
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Value-function reinforcement learning in Markov games

Cognitive Systems Research

The world of independent learners is not markovian

International Journal of Knowledge-based and Intelligent Engineering Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we analyze the convergence of independent adaptive learners in repeated games. We show that, in this class of games, independent adaptive learners converge to pure Nash equilibria in self play, if they exist, and to a best response strategy against stationary opponents. We discuss the relation between our result and convergence results of adaptive play [1]. The importance of our result stems from the fact that, unlike adaptive play, no communication/action observability is assumed. We also relate this result to recent results on the convergence of weakened ficticious play processes for independent learners [2,3]. Finally we present experimental results illustrating the main ideas of the paper.