A novel feature sparsification method for kernel-based approximate policy iteration

Authors:
Zhenhua Huang;Chunming Liu;Xin Xu;Chuanqiang Lian;Jun Wu
Affiliations:
Institute of Automation, National University of Defense Technology, Changsha, P.R. China;Institute of Automation, National University of Defense Technology, Changsha, P.R. China;Institute of Automation, National University of Defense Technology, Changsha, P.R. China;Institute of Automation, National University of Defense Technology, Changsha, P.R. China;Institute of Automation, National University of Defense Technology, Changsha, P.R. China
Venue:
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Year:
2012

Citing 8
Cited 0

An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Kernel-Based Reinforcement Learning

Machine Learning
Technical Update: Least-Squares Temporal Difference Learning

Machine Learning
Least-squares policy iteration

The Journal of Machine Learning Research
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Kernel-Based Least Squares Policy Iteration for Reinforcement Learning

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a novel feature sparsification approach for a class of kernel-based approximate policy iteration algorithms called KLSPI. We firstly introduce the relative approximation error in the sparsification process based on the approximate linear dependence (ALD) analysis. The relative approximation error is used as the criterion for selecting the kernel-based features. An improved KLSPI algorithm is also proposed by integrating the new sparsification method with KLSPI. Experimental results on the Inverted Pendulum problem demonstrate that the proposed sparsification method can obtain a smaller size of kernel dictionary than the previous ALD method. Furthermore, by using the more representative samples as the kernel dictionary, the precision of value function approximation has been increased. The improved KLSPI algorithm can also achieve better learning efficiency and policy quality than the original one. The feasibility and validity of the new method are proven.