Reinforcement learning algorithms with function approximation: Recent advances and applications

Authors:
Xin Xu;Lei Zuo;Zhenhua Huang
Affiliations:
-;-;-
Venue:
Information Sciences: an International Journal
Year:
2014

Citing 83
Cited 0

Note on learning rate schedules for stochastic optimization

NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Technical Note: \cal Q-Learning

Machine Learning
The Convergence of TD(λ) for General λ

Machine Learning
TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
TD(λ) Converges with Probability 1

Machine Learning
An Upper Bound on the Loss from Approximate Optimal-Value Functions

Machine Learning
Incremental dynamic programming for on-line adaptive optimal control

Incremental dynamic programming for on-line adaptive optimal control
Linear least-squares algorithms for temporal difference learning

Machine Learning - Special issue on reinforcement learning
Stochastic approximation with two time scales

Systems & Control Letters
Closed-Loop Object Recognition Using Reinforcement Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Natural gradient works efficiently in learning

Neural Computation
Tree based discretization for continuous state space reinforcement learning

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforcement learning with hierarchies of machines

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Elevator Group Control Using Multiple Reinforcement Learning Agents

Machine Learning
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Machine Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Kernel-Based Reinforcement Learning

Machine Learning
Technical Update: Least-Squares Temporal Difference Learning

Machine Learning
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
Least Squares Policy Evaluation Algorithms with Linear Function Approximation

Discrete Event Dynamic Systems
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
State abstraction for programmable reinforcement learning agents

Eighteenth national conference on Artificial intelligence
Adaptive Linear Quadratic Control Using Policy Iteration

Adaptive Linear Quadratic Control Using Policy Iteration
Kernel independent component analysis

The Journal of Machine Learning Research
Least-squares policy iteration

The Journal of Machine Learning Research
Integrating Guidance into Relational Reinforcement Learning

Machine Learning
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Tree-Based Batch Mode Reinforcement Learning

The Journal of Machine Learning Research
Proto-value functions: developmental reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Graph kernels and Gaussian processes for relational reinforcement learning

Machine Learning
Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming

Machine Learning
Brief paper: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control

Automatica (Journal of IFAC)
A fuzzy Actor-Critic reinforcement learning network

Information Sciences: an International Journal
Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes

The Journal of Machine Learning Research
Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)

Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
Natural Actor-Critic

Neurocomputing
On the convergence of stochastic iterative dynamic programming algorithms

Neural Computation
Learning to Drive a Real Car in 20 Minutes

FBIT '07 Proceedings of the 2007 Frontiers in the Convergence of Bioscience and Information Technologies
Hierarchical Average Reward Reinforcement Learning

The Journal of Machine Learning Research
Brief paper: Adaptive optimal control for continuous-time linear systems based on policy iteration

Automatica (Journal of IFAC)
2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems

Neural Networks
2009 Special Issue: Intelligence in the brain: A theory of how it works and how to build it

Neural Networks
Fast gradient-descent methods for temporal-difference learning with linear function approximation

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Reinforcement learning for robot soccer

Autonomous Robots
Machine learning for fast quadrupedal locomotion

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Predicting investment behavior: An augmented reinforcement learning model

Neurocomputing
Hybrid least-squares algorithms for approximate policy evaluation

Machine Learning
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods

Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation

Journal of Artificial Intelligence Research
Reinforcement learning of local shape in the game of go

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Covariant policy search

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Natural actor-critic algorithms

Automatica (Journal of IFAC)
A reinforcement learning approach to job-shop scheduling

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Some studies in machine learning using the game of checkers

IBM Journal of Research and Development
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems

ACC'09 Proceedings of the 2009 conference on American Control Conference
Sequential anomaly detection based on temporal-difference learning: Principles, models and case studies

Applied Soft Computing
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem

Automatica (Journal of IFAC)
Safe state abstraction and reusable continuing subtasks in hierarchical reinforcement learning

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Algorithms for Reinforcement Learning

Algorithms for Reinforcement Learning
Reinforcement Learning and Dynamic Programming Using Function Approximators

Reinforcement Learning and Dynamic Programming Using Function Approximators
Adapting bias by gradient descent: an incremental version of delta-bar-delta

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Reinforcement based mobile robot navigation in dynamic environment

Robotics and Computer-Integrated Manufacturing
Continuous-action reinforcement learning with fast policy search and adaptive basis function selection

Soft Computing - A Fusion of Foundations, Methodologies and Applications - Special issue on Recent advances on machine learning and Cybernetics
Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations

Automatica (Journal of IFAC)
Model selection in reinforcement learning

Machine Learning
Adaptive stock trading with dynamic asset allocation using reinforcement learning

Information Sciences: an International Journal
Delayed reinforcement learning for adaptive image segmentation andfeature extraction

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Guest Editorial - Special Issue on Adaptive Dynamic Programming and Reinforcement Learning in Feedback Control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive critic designs

IEEE Transactions on Neural Networks
Input space versus feature space in kernel-based methods

IEEE Transactions on Neural Networks
Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator

IEEE Transactions on Neural Networks
Helicopter trimming and tracking control using direct neural dynamic programming

IEEE Transactions on Neural Networks
A self-learning call admission control scheme for CDMA cellular networks

IEEE Transactions on Neural Networks
Kernel-Based Least Squares Policy Iteration for Reinforcement Learning

IEEE Transactions on Neural Networks
Transformation Invariant On-Line Target Recognition

IEEE Transactions on Neural Networks
Hierarchical Approximate Policy Iteration With Binary-Tree State Space Decomposition

IEEE Transactions on Neural Networks - Part 1
Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method

IEEE Transactions on Neural Networks - Part 2
Reinforcement Learning With Function Approximation for Traffic Signal Control

IEEE Transactions on Intelligent Transportation Systems

Quantified Score

Hi-index	0.07

Visualization

Abstract

In recent years, the research on reinforcement learning (RL) has focused on function approximation in learning prediction and control of Markov decision processes (MDPs). The usage of function approximation techniques in RL will be essential to deal with MDPs with large or continuous state and action spaces. In this paper, a comprehensive survey is given on recent developments in RL algorithms with function approximation. From a theoretical point of view, the convergence and feature representation of RL algorithms are analyzed. From an empirical aspect, the performance of different RL algorithms was evaluated and compared in several benchmark learning prediction and learning control tasks. The applications of RL with function approximation are also discussed. At last, future works on RL with function approximation are suggested.