Relative Value Function Approximation TITLE2:

  • Authors:
  • Paul E. Utgoff;Doina Precup

  • Affiliations:
  • -;-

  • Venue:
  • Relative Value Function Approximation TITLE2:
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

A form of temporal difference learning is presented that learns the relative utility of states, instead of the absolute utility. This formulation backs up decisions instead of values, making it possible to learn a simpler function for defining a decision-making policy. A nonlinear relative value function can be learned without increasing the dimensionality of the inputs.