Reinforcement Learning: Past, Present and Future

Authors:
Richard S. Sutton
Affiliations:
-
Venue:
SEAL'98 Selected papers from the Second Asia-Pacific Conference on Simulated Evolution and Learning on Simulated Evolution and Learning
Year:
1998

Temporal difference learning and TD-Gammon

Communications of the ACM
Multi-time models for temporally abstract planning

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Learning to Predict by the Methods of Temporal Differences

Machine Learning
KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Between MOPs and Semi-MOP: Learning, Planning & Representing Knowledge at Multiple Temporal Scales

Between MOPs and Semi-MOP: Learning, Planning & Representing Knowledge at Multiple Temporal Scales
Reinforcement learning with selective perception and hidden state

Reinforcement learning with selective perception and hidden state

Taming the beast: guided self-organization of behavior in autonomous robots

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Cognitive agents for sense and respond logistics

DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems

Hi-index	0.00