Anytime Self-play Learning to Satisfy Functional Optimality Criteria

  • Authors:
  • Andriy Burkov;Brahim Chaib-Draa

  • Affiliations:
  • Laval University, Quebec, Canada;Laval University, Quebec, Canada

  • Venue:
  • ADT '09 Proceedings of the 1st International Conference on Algorithmic Decision Theory
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an anytime multiagent learning approach to satisfy any given optimality criterion in repeated game self-play. Our approach is opposed to classical learning approaches for repeated games: namely, learning of equilibrium, Pareto-efficient learning, and their variants. The comparison is given from a practical (or engineering) standpoint, i.e., from a point of view of a multiagent system designer whose goal is to maximize the system's overall performance according to a given optimality criterion. Extensive experiments in a wide variety of repeated games demonstrate the efficacy of our approach.