RL-CD: dealing with non-stationarity in reinforcement learning

Authors:
Bruno C. da Silva;Eduardo W. Basso;Ana L. C. Bazzan;Paulo M. Engel
Affiliations:
Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil;Instituto de Informática, UFRGS, Porto Alegre, Brazil
Venue:
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Year:
2006

Citing 2
Cited 0

Multiple model-based reinforcement learning

Neural Computation
Reinforcement learning: a survey

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

This student abstract describes ongoing investigations regarding an approach for dealing with non-stationarity in reinforcement learning (RL) problems. We briefly propose and describe a method for managing multiple partial models of the environment and comment previous results which show that the proposed mechanism has better convergence times comparing to standard RL algorithms. Current efforts include the development of a more robust approach, capable of dealing with noisy environments, and also investigations regarding the possibility of using partial models in order to aliviate learning problems in systems with an explosive number of states.