TD Model: Modeling the World at a Mixture of Time Scales

Authors:
R. Sutton
Affiliations:
-
Venue:
TD Model: Modeling the World at a Mixture of Time Scales
Year:
1995

Citing 0
Cited 2

Reinforcement Learning in Continuous Time and Space

Neural Computation
Neural-based downlink scheduling algorithm for broadband wireless networks

Computer Communications

Quantified Score

Hi-index	0.00

Visualization

Abstract

ARRAY(0x83fc18c) hierarchical or multi-level planning and reinforcement learning. In this paper we treat only the prediction problem--that of learning a model and value function for the case of fixed agent behavior. Within this context, we establish the theoretical foundations of multi-scale models and derive TD algorithms for learning them. Two small computational experiments are presented to test and illustrate the theory. This work is an extension and generalization of the work of Singh (1992), Dayan (1993), and Sutton and Pinette (1985).