RL-CD: dealing with non-stationarity in reinforcement learning

  • Authors:
  • Bruno C. da Silva;Eduardo W. Basso;Ana L. C. Bazzan;Paulo M. Engel

  • Affiliations:
  • Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil

  • Venue:
  • AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This student abstract describes ongoing investigations regarding an approach for dealing with non-stationarity in reinforcement learning (RL) problems. We briefly propose and describe a method for managing multiple partial models of the environment and comment previous results which show that the proposed mechanism has better convergence times comparing to standard RL algorithms. Current efforts include the development of a more robust approach, capable of dealing with noisy environments, and also investigations regarding the possibility of using partial models in order to aliviate learning problems in systems with an explosive number of states.