Adaption of XCS to multi-learner predator/prey scenarios

Authors:
Clemens Lode;Urban Richter;Hartmut Schmeck
Affiliations:
Karlsruhe Institute of Technology, Karlsruhe, Germany;Karlsruhe Institute of Technology, Karlsruhe, Germany;Karlsruhe Institute of Technology, Karlsruhe, Germany
Venue:
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Year:
2010

Citing 11
Cited 0

Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Learning Classifier Systems Meet Multiagent Environments

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
A Bigger Learning Classifier Systems Bibliography

IWLCS '00 Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems
On the Rationality of Profit Sharing in Multi-Agent Reinforcement Learning

ICCIMA '01 Proceedings of the Fourth International Conference on Computational Intelligence and Multimedia Applications
Exploring XCS in multiagent environments

GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
Toward Optimal Classifier System Performance in Non-Markov Environments

Evolutionary Computation
Zcs: A zeroth level classifier system

Evolutionary Computation
Classifier fitness based on accuracy

Evolutionary Computation
Tournament selection: stable fitness pressure in XCS

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Toward a theory of generalization and learning in XCS

IEEE Transactions on Evolutionary Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Learning classifier systems (LCSs) are rule-based evolutionary reinforcement learning systems. Today, especially variants of Wilson's extended classifier system (XCS) are widely applied for machine learning. Despite their widespread application, LCSs have drawbacks, e. g., in multi-learner scennarios, since the Markov property is not fulfilled. In this paper, LCSs are investigated in an instance of the generic homogeneous and non-communicating predator/prey scenario. A group of predators collaboratively observe a (randomly) moving prey as long as possible, where each predator is equipped with a single, independent XCS. Results show that improvements in learning are achieved by cleverly adapting a multi-step approach to the characteristics of the investigated scenario. Firstly, the environmental reward function is expanded to include sensory information. Secondly, the learners are equipped with a memory to store and analyze the history of local actions and given payoffs.