XCS with eligibility traces

Authors:
Jan Drugowitsch;Alwyn M. Barry
Affiliations:
University of Bath, Bath, UK;University of Bath, Bath, UK
Venue:
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Year:
2005

Citing 12
Cited 2

Induction: processes of inference, learning, and discovery

Induction: processes of inference, learning, and discovery
Bucket brigade performance: I. Long sequences of classifiers

Proceedings of the Second International Conference on Genetic Algorithms on Genetic algorithms and their application
Learning and bucket brigade dynamics in classifier systems

CNLS '89 Proceedings of the ninth annual international conference of the Center for Nonlinear Studies on Self-organizing, Collective, and Cooperative Phenomena in Natural and Artificial Computing Networks on Emergent computation
The animat and the physician

Proceedings of the first international conference on simulation of adaptive behavior on From animals to animats
Technical Note: \cal Q-Learning

Machine Learning
Reinforcement learning with replacing eligibility traces

Machine Learning - Special issue on reinforcement learning
Robot Shaping: An Experiment in Behavior Engineering

Robot Shaping: An Experiment in Behavior Engineering
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Properties of the Bucket Brigade

Proceedings of the 1st International Conference on Genetic Algorithms
An Incremental Multiplexer Problem and Its Uses in Classifier System Research

IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
Classifier fitness based on accuracy

Evolutionary Computation
Limits in long path learning with XCS

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII

Learning classifier systems: a complete introduction, review, and roadmap

Journal of Artificial Evolution and Applications
Learning classifier systems: a complete introduction, review, and roadmap

Journal of Artificial Evolution and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investigations in delayed-reward (i.e. multi-step) environments have shown promise, XCS still struggles to efficiently find optimal solutions in environments with long action-chains. This paper highlights the strong relation of XCS to reinforcement learning and identifies some of the major differences. This makes it possible to add Eligibility Traces to XCS, a method taken from reinforcement learning to update the prediction of the whole action-chain on each step, which should cause prediction update to be faster and more accurate. However, it is shown that the discrete nature of the condition representation of a classifier and the operation of the genetic algorithm cause traces to propagate back incorrect prediction values and in some cases results in a decrease of system performance. As a result further investigation of the existing approach to generalisation is proposed.