Note on learning rate schedules for stochastic optimization
NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Technical Note: \cal Q-Learning
Machine Learning
The Convergence of TD(λ) for General λ
Machine Learning
TD-Gammon, a self-teaching backgammon program, achieves master-level play
Neural Computation
TD(λ) Converges with Probability 1
Machine Learning
An Upper Bound on the Loss from Approximate Optimal-Value Functions
Machine Learning
Incremental dynamic programming for on-line adaptive optimal control
Incremental dynamic programming for on-line adaptive optimal control
Linear least-squares algorithms for temporal difference learning
Machine Learning - Special issue on reinforcement learning
Stochastic approximation with two time scales
Systems & Control Letters
Closed-Loop Object Recognition Using Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Natural gradient works efficiently in learning
Neural Computation
Tree based discretization for continuous state space reinforcement learning
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforcement learning with hierarchies of machines
NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Elevator Group Control Using Multiple Reinforcement Learning Agents
Machine Learning
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning
Artificial Intelligence
Neuro-Dynamic Programming
Kernel-Based Reinforcement Learning
Machine Learning
Technical Update: Least-Squares Temporal Difference Learning
Machine Learning
Recent Advances in Hierarchical Reinforcement Learning
Discrete Event Dynamic Systems
Least Squares Policy Evaluation Algorithms with Linear Function Approximation
Discrete Event Dynamic Systems
Learning to Predict by the Methods of Temporal Differences
Machine Learning
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
State abstraction for programmable reinforcement learning agents
Eighteenth national conference on Artificial intelligence
Adaptive Linear Quadratic Control Using Policy Iteration
Adaptive Linear Quadratic Control Using Policy Iteration
Kernel independent component analysis
The Journal of Machine Learning Research
Least-squares policy iteration
The Journal of Machine Learning Research
Integrating Guidance into Relational Reinforcement Learning
Machine Learning
Integrating Relevance Feedback Techniques for Image Retrieval Using Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Tree-Based Batch Mode Reinforcement Learning
The Journal of Machine Learning Research
Proto-value functions: developmental reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
A fuzzy Actor-Critic reinforcement learning network
Information Sciences: an International Journal
The Journal of Machine Learning Research
Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
Neurocomputing
On the convergence of stochastic iterative dynamic programming algorithms
Neural Computation
Learning to Drive a Real Car in 20 Minutes
FBIT '07 Proceedings of the 2007 Frontiers in the Convergence of Bioscience and Information Technologies
Hierarchical Average Reward Reinforcement Learning
The Journal of Machine Learning Research
Brief paper: Adaptive optimal control for continuous-time linear systems based on policy iteration
Automatica (Journal of IFAC)
Fast gradient-descent methods for temporal-difference learning with linear function approximation
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Reinforcement learning for robot soccer
Autonomous Robots
Machine learning for fast quadrupedal locomotion
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Hybrid least-squares algorithms for approximate policy evaluation
Machine Learning
Hierarchical reinforcement learning with the MAXQ value function decomposition
Journal of Artificial Intelligence Research
Efficient reinforcement learning using recursive least-squares methods
Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Reinforcement learning of local shape in the game of go
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Natural actor-critic algorithms
Automatica (Journal of IFAC)
A reinforcement learning approach to job-shop scheduling
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Some studies in machine learning using the game of checkers
IBM Journal of Research and Development
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems
ACC'09 Proceedings of the 2009 conference on American Control Conference
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
Automatica (Journal of IFAC)
Safe state abstraction and reusable continuing subtasks in hierarchical reinforcement learning
AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Algorithms for Reinforcement Learning
Algorithms for Reinforcement Learning
Reinforcement Learning and Dynamic Programming Using Function Approximators
Reinforcement Learning and Dynamic Programming Using Function Approximators
Adapting bias by gradient descent: an incremental version of delta-bar-delta
AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Reinforcement based mobile robot navigation in dynamic environment
Robotics and Computer-Integrated Manufacturing
Soft Computing - A Fusion of Foundations, Methodologies and Applications - Special issue on Recent advances on machine learning and Cybernetics
Automatica (Journal of IFAC)
Model selection in reinforcement learning
Machine Learning
Adaptive stock trading with dynamic asset allocation using reinforcement learning
Information Sciences: an International Journal
Delayed reinforcement learning for adaptive image segmentation andfeature extraction
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Control
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Neural Networks
Input space versus feature space in kernel-based methods
IEEE Transactions on Neural Networks
IEEE Transactions on Neural Networks
Helicopter trimming and tracking control using direct neural dynamic programming
IEEE Transactions on Neural Networks
A self-learning call admission control scheme for CDMA cellular networks
IEEE Transactions on Neural Networks
Kernel-Based Least Squares Policy Iteration for Reinforcement Learning
IEEE Transactions on Neural Networks
Transformation Invariant On-Line Target Recognition
IEEE Transactions on Neural Networks
Hierarchical Approximate Policy Iteration With Binary-Tree State Space Decomposition
IEEE Transactions on Neural Networks - Part 1
IEEE Transactions on Neural Networks - Part 2
Reinforcement Learning With Function Approximation for Traffic Signal Control
IEEE Transactions on Intelligent Transportation Systems
Hi-index | 0.07 |
In recent years, the research on reinforcement learning (RL) has focused on function approximation in learning prediction and control of Markov decision processes (MDPs). The usage of function approximation techniques in RL will be essential to deal with MDPs with large or continuous state and action spaces. In this paper, a comprehensive survey is given on recent developments in RL algorithms with function approximation. From a theoretical point of view, the convergence and feature representation of RL algorithms are analyzed. From an empirical aspect, the performance of different RL algorithms was evaluated and compared in several benchmark learning prediction and learning control tasks. The applications of RL with function approximation are also discussed. At last, future works on RL with function approximation are suggested.