Reinforcement Learning Soccer Teams with Incomplete World Models

  • Authors:
  • Marco Wiering;Rafał Sałustowicz;Jürgen Schmidhuber

  • Affiliations:
  • IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. marco@idsia.ch;IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. rafal@idsia.ch;IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. juergen@idsia.ch

  • Venue:
  • Autonomous Robots
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

We use reinforcement learning (RL) to compute strategies formultiagent soccer teams. RL may profit significantly from worldmodels (WMs) estimating state transition probabilities and rewards.In high-dimensional, continuous input spaces, however, learningaccurate WMs is intractable. Here we show that incomplete WMs canhelp to quickly find good action selection policies. Our approach isbased on a novel combination of CMACs and prioritized sweeping-likealgorithms. Variants thereof outperform both Q(λ)-learningwith CMACs and the evolutionary method Probabilistic IncrementalProgram Evolution (PIPE) which performed best in previouscomparisons.