TD Model: Modeling the World at a Mixture of Time Scales

  • Authors:
  • R. Sutton

  • Affiliations:
  • -

  • Venue:
  • TD Model: Modeling the World at a Mixture of Time Scales
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

ARRAY(0x83fc18c) hierarchical or multi-level planning and reinforcement learning. In this paper we treat only the prediction problem--that of learning a model and value function for the case of fixed agent behavior. Within this context, we establish the theoretical foundations of multi-scale models and derive TD algorithms for learning them. Two small computational experiments are presented to test and illustrate the theory. This work is an extension and generalization of the work of Singh (1992), Dayan (1993), and Sutton and Pinette (1985).