Temporal difference learning and TD-Gammon
Communications of the ACM
On the worst-case analysis of temporal-difference learning algorithms
Machine Learning - Special issue on reinforcement learning
Reinforcement learning with replacing eligibility traces
Machine Learning - Special issue on reinforcement learning
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Dynamical membership functions: an approach for adaptive fuzzy modelling
Fuzzy Sets and Systems
Adaptive fuzzy control of satellite attitude by reinforcement learning
IEEE Transactions on Fuzzy Systems
Online learning control by association and reinforcement
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
In on-line applications, reinforcement learning based algorithms allow to take into account the environment information in order to propose an action policy for the overall optimization objectives. In this work, it is presented a learning algorithm based on reinforcement learning and temporal differences allowing the on-line parameters adjustment for identification tasks. As a consequence, the reinforcement signal is generically defined in order to minimize the temporal difference.