Temporal difference learning and TD-Gammon
Communications of the ACM
Neural Networks: A Comprehensive Foundation
Neural Networks: A Comprehensive Foundation
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Andhill-98: A RoboCup Team which Reinforces Positioning with Observation
RoboCup-98: Robot Soccer World Cup II
RoboCup-98: Robot Soccer World Cup II
RoboCup-98: Robot Soccer World Cup II
Hi-index | 0.00 |
This paper describes a softbot agent capable of learning to choose its actions, in order to achieve its goal when facing an opponent in a dynamic environment. The agent uses rewards gathered from the environment to assess and improve the quality of its own behavior. A multilayer perceptron neural network is assessed regarding its adequacy as a value function approximator for state-action pairs in the robotic soccer domain.