Reinforcement Learning for 3 vs. 2 Keepaway

  • Authors:
  • Peter Stone;Richard S. Sutton;Satinder P. Singh

  • Affiliations:
  • -;-;-

  • Venue:
  • RoboCup 2000: Robot Soccer World Cup IV
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

As a sequential decision problem, robotic soccer can benefit from research in reinforcement learning. We introduce the 3 vs. 2 keepaway domain, a subproblem of robotic soccer implemented in the RoboCup soccer server. We then explore reinforcement learning methods for policy evaluation and action selection in this distributed, real-time, partially observable, noisy domain. We present empirical results demonstrating that a learned policy can dramatically outperform hand-coded policies.