Multi-agent Learning Dynamics: A Survey

  • Authors:
  • H. Jaap Herik;D. Hennes;M. Kaisers;K. Tuyls;K. Verbeeck

  • Affiliations:
  • Adaptive Agents Group, MICC, Maastricht University, The Netherlands;Adaptive Agents Group, MICC, Maastricht University, The Netherlands;Adaptive Agents Group, MICC, Maastricht University, The Netherlands;Adaptive Agents Group, MICC, Maastricht University, The Netherlands;Adaptive Agents Group, MICC, Maastricht University, The Netherlands

  • Venue:
  • CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration and policy iteration. Four characteristics are studied: initial conditions, parameter settings, convergence speed, and local versus global convergence. Global convergence is still difficult to achieve in practice, despite existing theoretical guarantees. Multiple visualizations are included to provide a comprehensive insight into the learning dynamics.