Multi-agent reinforcement learning: independent vs. cooperative agents
Readings in agents
The dynamics of reinforcement learning in cooperative multiagent systems
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Reinforcement learning of coordination in cooperative multi-agent systems
Eighteenth national conference on Artificial intelligence
A Fuzzy-Logic Based Bidding Strategy for Autonomous Agents in Continuous Double Auctions
IEEE Transactions on Knowledge and Data Engineering
Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems
AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Exploring selfish reinforcement learning in repeated games with stochastic rewards
Autonomous Agents and Multi-Agent Systems
Rational and convergent learning in stochastic games
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Value-function reinforcement learning in Markov games
Cognitive Systems Research
The world of independent learners is not markovian
International Journal of Knowledge-based and Intelligent Engineering Systems
Hi-index | 0.00 |
In this paper we analyze the convergence of independent adaptive learners in repeated games. We show that, in this class of games, independent adaptive learners converge to pure Nash equilibria in self play, if they exist, and to a best response strategy against stationary opponents. We discuss the relation between our result and convergence results of adaptive play [1]. The importance of our result stems from the fact that, unlike adaptive play, no communication/action observability is assumed. We also relate this result to recent results on the convergence of weakened ficticious play processes for independent learners [2,3]. Finally we present experimental results illustrating the main ideas of the paper.