Dopamine Ramps Are a Consequence of Reward Prediction Errors
Name
Gershman-2013-Dopamine Ramps Are a.pdf
Size
145.08 KB
Format
Adobe PDF
Checksum (MD5)
8b21d9b3db2cda2edd0052c5e7a5d6b4
Author(s)
Gershman, Samuel J.
Date Issued
February 2014
Journal
Neural Computation
Publisher
MIT Press
Citation
Gershman, Samuel J. “Dopamine Ramps Are a Consequence of Reward Prediction Errors.” Neural Computation 26, no. 3 (March 2014): 467–471. © 2014 Massachusetts Institute of Technology
Version
Final published version
Abstract
Temporal difference learning models of dopamine assert that phasic levels of dopamine encode a reward prediction error. However, this hypothesis has been challenged by recent observations of gradually ramping stratal dopamine levels as a goal is approached. This note describes conditions under which temporal difference learning models predict dopamine ramping. The key idea is representational: a quadratic transformation of proximity to the goal implies approximately linear ramping, as observed experimentally.
MIT Department
Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
Terms of Use
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Persistent DSpace Link
DOI of Published Version
http://dx.doi.org/10.1162/NECO_a_00559