Temporal difference learning and TD-Gammon
Communications of the ACM
Multi-time models for temporally abstract planning
NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Learning to Predict by the Methods of Temporal Differences
Machine Learning
KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Between MOPs and Semi-MOP: Learning, Planning & Representing Knowledge at Multiple Temporal Scales
Between MOPs and Semi-MOP: Learning, Planning & Representing Knowledge at Multiple Temporal Scales
Reinforcement learning with selective perception and hidden state
Reinforcement learning with selective perception and hidden state
Taming the beast: guided self-organization of behavior in autonomous robots
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Cognitive agents for sense and respond logistics
DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Hi-index | 0.00 |