Constructing action set from basis functions for reinforcement learning of robot control

Authors:
Akihiko Yamaguchi;Jun Takamatsu;Tsukasa Ogasawara
Affiliations:
Graduate School of Information Science, Nara Institute of Science and Technology, Ikoma, Nara, Japan;Graduate School of Information Science, Nara Institute of Science and Technology, Ikoma, Nara, Japan;Graduate School of Information Science, Nara Institute of Science and Technology, Ikoma, Nara, Japan
Venue:
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Year:
2009

Citing 9
Cited 0

Feature-based methods for large scale dynamic programming

Machine Learning - Special issue on reinforcement learning
Multiple paired forward and inverse models for motor control

Neural Networks - Special issue on neural control and robotics: biology and technology
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Multiple model-based reinforcement learning

Neural Computation
Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Reinforcement Learning in POMDPs with Function Approximation

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Reinforcement learning with via-point representation

Neural Networks
On-line EM Algorithm for the Normalized Gaussian Network

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets also have the advantages of ease of implementation and compatibility with some sophisticated RL methods, such as the Dyna [1]. However, one of the problem is the absence of general principles on designing a discrete action set for robot control in higher dimensional input space. In this paper, we propose to construct a discrete action set given a set of basis functions (BFs). We designed the action set so that the size of the set is proportional to the number of the BFs. This method can exploit the function approximator's nature, that is, in practical RL applications, the number of BFs does not increase exponentially with the dimension of the state space (e.g. [2]). Thus, the size of the proposed action set does not increase exponentially with the dimension of the input space. We apply an RL with the proposed action set to a robot navigation task and a crawling and a jumping tasks. The simulation results demonstrate that the proposed action set has the advantages of improved learning speed, and better ability to acquire performance, compared to a conventional discrete action set.