Reinforcement learning with via-point representation

  • Authors:
  • Hiroyuki Miyamoto;Jun Morimoto;Kenji Doya;Mitsuo Kawato

  • Affiliations:
  • Kawato Dynamic Brain Project, Japan Science and Technology Corporation, Kyoto, Japan and Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, Hibikino 2-4, Waka ...;ATR Computational Neuroscience Laboratories, Kyoto, Japan;ATR Computational Neuroscience Laboratories, Kyoto, Japan;ATR Computational Neuroscience Laboratories, Kyoto, Japan

  • Venue:
  • Neural Networks
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a new learning framework for motor control. This framework consists of two components: reinforcement learning and via-point representation. In the field of motor control, conventional reinforcement learning has been used to acquire control sequences such as cart-pole or stand-up robot control. Recently, researchers have become interested in hierarchical architecture, such as multiple levels, and multiple temporal and spatial scales. Our new framework contains two levels of hierarchical architecture. The higher level is implemented using via-point representation, which corresponds to macro-actions or multiple time scales. The lower level is implemented using a trajectory generator that produces primitive actions. Our framework can modify the ongoing movement by means of temporally localized via-points and trajectory generation. Successful results are obtained in computer simulation of the cart-pole swing up task.