Optogenetic Blockade of Dopamine Transients Prevents Learning Induced by Changes in Reward Features
MetadataShow full item record
AbstractPrediction errors are critical for associative learning [1, 2]. Transient changes in dopamine neuron activity correlate with positive and negative reward prediction errors and can mimic their effects [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]. However, although causal studies show that dopamine transients of 1–2 s are sufficient to drive learning about reward, these studies do not address whether they are necessary (but see ). Further, the precise nature of this signal is not yet fully established. Although it has been equated with the cached-value error signal proposed to support model-free reinforcement learning, cached-value errors are typically confounded with errors in the prediction of reward features . Here, we used optogenetic and transgenic approaches to prevent transient changes in midbrain dopamine neuron activity during the critical error-signaling period of two unblocking tasks. In one, learning was unblocked by increasing the number of rewards, a manipulation that induces errors in predicting both value and reward features. In another, learning was unblocked by switching from one to another equally valued reward, a manipulation that induces errors only in reward feature prediction. Preventing dopamine neurons in the ventral tegmental area from firing for 5 s beginning before and continuing until after the changes in reward prevented unblocking of learning in both tasks. A similar duration suppression did not induce extinction when delivered during an expected reward, indicating that it did not act independently as a negative prediction error. This result suggests that dopamine transients play a general role in error signaling rather than being restricted to only signaling errors in value. Copyright 2017
SponsorsThis work was supported by the Intramural Research Program at the National Institute on Drug Abuse ( ZIA-DA000587 ).
Identifier to cite or link to this itemhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85035014998&doi=10.1016%2fj.cub.2017.09.049&partnerID=40&md5=fff763fdff83213af983cffc8a006ea5; http://hdl.handle.net/10713/10041
- Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors.
- Authors: Chang CY, Esber GR, Marrero-Garcia Y, Yau HJ, Bonci A, Schoenbaum G
- Issue date: 2016 Jan
- A causal link between prediction errors, dopamine neurons and learning.
- Authors: Steinberg EE, Keiflin R, Boivin JR, Witten IB, Deisseroth K, Janak PH
- Issue date: 2013 Jul
- Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards.
- Authors: Takahashi YK, Batchelor HM, Liu B, Khanna A, Morales M, Schoenbaum G
- Issue date: 2017 Sep 13
- Brief, But Not Prolonged, Pauses in the Firing of Midbrain Dopamine Neurons Are Sufficient to Produce a Conditioned Inhibitor.
- Authors: Chang CY, Gardner MPH, Conroy JC, Whitaker LR, Schoenbaum G
- Issue date: 2018 Oct 10
- Dopamine transients do not act as model-free prediction errors during associative learning.
- Authors: Sharpe MJ, Batchelor HM, Mueller LE, Yun Chang C, Maes EJP, Niv Y, Schoenbaum G
- Issue date: 2020 Jan 8