On-Line reinforcement learning using cascade constructive neural networks

Authors:
Peter Vamplew;Robert Ollington
Affiliations:
School of Computing, University of Tasmania, Hobart, Tasmania, Australia;School of Computing, University of Tasmania, Hobart, Tasmania, Australia
Venue:
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Year:
2005

Citing 5
Cited 0

The cascade-correlation learning architecture

Advances in neural information processing systems 2
Reinforcement learning for robots using neural networks

Reinforcement learning for robots using neural networks
Temporal difference learning and TD-Gammon

Communications of the ACM
Investigation of the CasCor family of learning algorithms

Neural Networks
Learning to Predict by the Methods of Temporal Differences

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to use function approximation. Neural networks are one commonly used approach, with most work so far using fixed-architecture networks. Previous supervised learning research has shown that constructive networks which grow their architecture during training outperform fixed-architecture networks. This paper extends the sarsa algorithm to use a cascade constructive network, and shows it outperforms a fixed-architecture network on two benchmark tasks.