Reinforcement Learning Soccer Teams with Incomplete World Models

Authors:
Marco Wiering;Rafał Sałustowicz;Jürgen Schmidhuber
Affiliations:
IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. marco@idsia.ch;IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. rafal@idsia.ch;IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland. juergen@idsia.ch
Venue:
Autonomous Robots
Year:
1999

Citing 22
Cited 2

Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Technical Note: \cal Q-Learning

Machine Learning
Simplifying neural networks by soft weight-sharing

Neural Computation
Learning in embedded systems

Learning in embedded systems
Reinforcement learning for robots using neural networks

Reinforcement learning for robots using neural networks
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Reinforcement learning with replacing eligibility traces

Machine Learning - Special issue on reinforcement learning
Incremental multi-step Q-learning

Machine Learning - Special issue on reinforcement learning
Shifting Inductive Bias with Success-Story Algorithm, AdaptiveLevin Search, and Incremental Self-Improvement

Machine Learning - Special issue on inductive transfer
Reinforcement learning with self-modifying policies

Learning to learn
Efficient model-based exploration

Proceedings of the fifth international conference on simulation of adaptive behavior on From animals to animats 5
Fast Online Q(λ)

Machine Learning
Learning Team Strategies: Soccer Case Studies

Machine Learning
Finite-sample convergence rates for Q-learning and indirect algorithms

Proceedings of the 1998 conference on Advances in neural information processing systems II
Reinforcement Learning

Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots

Autonomous Robots
Learning to Predict by the Methods of Temporal Differences

Machine Learning
A Representation for the Adaptive Generation of Simple Sequential Programs

Proceedings of the 1st International Conference on Genetic Algorithms
On Learning Soccer Strategies

ICANN '97 Proceedings of the 7th International Conference on Artificial Neural Networks
Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces

Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
Probabilistic incremental program evolution

Evolutionary Computation

Evolution in the Orange Box - A New Approach to the Sphere-Packing Problem in CMAC-Based Neural Networks

AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Cooperative Multi-Agent Learning: The State of the Art

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We use reinforcement learning (RL) to compute strategies formultiagent soccer teams. RL may profit significantly from worldmodels (WMs) estimating state transition probabilities and rewards.In high-dimensional, continuous input spaces, however, learningaccurate WMs is intractable. Here we show that incomplete WMs canhelp to quickly find good action selection policies. Our approach isbased on a novel combination of CMACs and prioritized sweeping-likealgorithms. Variants thereof outperform both Q(λ)-learningwith CMACs and the evolutionary method Probabilistic IncrementalProgram Evolution (PIPE) which performed best in previouscomparisons.