Dopamine ramps are a consequence of reward prediction errors

  • Authors:
  • Samuel J. Gershman

  • Affiliations:
  • -

  • Venue:
  • Neural Computation
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Temporal difference learning models of dopamine assert that phasic levels of dopamine encode a reward prediction error. However, this hypothesis has been challenged by recent observations of gradually ramping stratal dopamine levels as a goal is approached. This note describes conditions under which temporal difference learning models predict dopamine ramping. The key idea is representational: a quadratic transformation of proximity to the goal implies approximately linear ramping, as observed experimentally.