Dopamine ramps are a consequence of reward prediction errors

Authors:
Samuel J. Gershman
Affiliations:
-
Venue:
Neural Computation
Year:
2014

Citing 4
Cited 0

Linear least-squares algorithms for temporal difference learning

Machine Learning - Special issue on reinforcement learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Representation and timing in theories of the dopamine system

Neural Computation
Stimulus representation and the timing of reward-prediction errors in models of the dopamine system

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Temporal difference learning models of dopamine assert that phasic levels of dopamine encode a reward prediction error. However, this hypothesis has been challenged by recent observations of gradually ramping stratal dopamine levels as a goal is approached. This note describes conditions under which temporal difference learning models predict dopamine ramping. The key idea is representational: a quadratic transformation of proximity to the goal implies approximately linear ramping, as observed experimentally.