Value-function reinforcement learning in Markov games

  • Authors:
  • Michael L. Littman

  • Affiliations:
  • AT&T Labs Research, 180 Park Avenue, Florham Park, NJ 07932-0971, USA

  • Venue:
  • Cognitive Systems Research
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Markov games are a model of multiagent environments that are convenient for studying multiagent reinforcement learning. This paper describes a set of reinforcement-learning algorithms based on estimating value functions and presents convergence theorems for these algorithms. The main contribution of this paper is that it presents the convergence theorems in a way that makes it easy to reason about the behavior of simultaneous learners in a shared environment.