Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

  • Authors:
  • Itsuki Noda

  • Affiliations:
  • Information Technology Research Institute, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan 305-8568 and School of Information Science, Japan Advanced Institute of ...

  • Venue:
  • PRIMA '09 Proceedings of the 12th International Conference on Principles of Practice in Multi-Agent Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for non-stationary environments. When the environment is non-stationary, the learning agent must adapt learning parameters like stepsize to the changes of environment through continuous learning. We show several theorems on higher-order derivatives of exponential moving average, which is a base schema of major reinforcement learning methods, using stepsize parameters. We also derive a systematic mechanism to calculate these derivatives in a recursive manner. Based on it, we construct a precise and flexible adaptation method for the stepsize parameter in order to maximize a certain criterion. The proposed method is also validated by several experimental results.