On-Line reinforcement learning using cascade constructive neural networks

  • Authors:
  • Peter Vamplew;Robert Ollington

  • Affiliations:
  • School of Computing, University of Tasmania, Hobart, Tasmania, Australia;School of Computing, University of Tasmania, Hobart, Tasmania, Australia

  • Venue:
  • KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to use function approximation. Neural networks are one commonly used approach, with most work so far using fixed-architecture networks. Previous supervised learning research has shown that constructive networks which grow their architecture during training outperform fixed-architecture networks. This paper extends the sarsa algorithm to use a cascade constructive network, and shows it outperforms a fixed-architecture network on two benchmark tasks.