Dopamine says do that again

IF 20 1区 医学 Q1 NEUROSCIENCES
Luis A. Mejia
{"title":"Dopamine says do that again","authors":"Luis A. Mejia","doi":"10.1038/s41593-025-01995-2","DOIUrl":null,"url":null,"abstract":"<p>Repeated actions may be reinforced through action prediction error — the difference between an executed action and its prediction given a particular state — but experimental evidence for such a movement-based, value-free teaching signal has been lacking. Greenstreet, Martinez Vergara, Johansson et al. trained mice on an auditory discrimination task and found that dopamine in the tail of the striatum (TS) is needed for learning, and that TS dopamine levels correlate with movement, but not reward, on the choice task. The movement-related dopamine signal in TS decreased over the course of learning, increased when an unfamiliar stimulus was used, and reinforced stimulus–action associations on the choice task; these findings are consistent with dopamine encoding a value-free action prediction error. A dual-controller model incorporating both value-based (reward prediction error via ventral striatum) and value-free (action prediction error via TS) systems learned the task faster than a value-based controller alone and implicated a role for TS during the later stages of learning. These findings suggest that movement-related dopamine signals in TS, and perhaps in other parts of the dorsal striatum, encode a value-free teaching signal that reinforces state–action associations, promoting the repetition of actions. Action prediction error may thus serve as the basis for the implementation of habitual behavior.</p><p><b>Original reference:</b> <i>Nature</i> https://doi.org/10.1038/s41586-025-09008-9 (2025)</p>","PeriodicalId":19076,"journal":{"name":"Nature neuroscience","volume":"523 1","pages":""},"PeriodicalIF":20.0000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41593-025-01995-2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Repeated actions may be reinforced through action prediction error — the difference between an executed action and its prediction given a particular state — but experimental evidence for such a movement-based, value-free teaching signal has been lacking. Greenstreet, Martinez Vergara, Johansson et al. trained mice on an auditory discrimination task and found that dopamine in the tail of the striatum (TS) is needed for learning, and that TS dopamine levels correlate with movement, but not reward, on the choice task. The movement-related dopamine signal in TS decreased over the course of learning, increased when an unfamiliar stimulus was used, and reinforced stimulus–action associations on the choice task; these findings are consistent with dopamine encoding a value-free action prediction error. A dual-controller model incorporating both value-based (reward prediction error via ventral striatum) and value-free (action prediction error via TS) systems learned the task faster than a value-based controller alone and implicated a role for TS during the later stages of learning. These findings suggest that movement-related dopamine signals in TS, and perhaps in other parts of the dorsal striatum, encode a value-free teaching signal that reinforces state–action associations, promoting the repetition of actions. Action prediction error may thus serve as the basis for the implementation of habitual behavior.

Original reference: Nature https://doi.org/10.1038/s41586-025-09008-9 (2025)

多巴胺说再来一次
重复的动作可能会通过动作预测误差(一个已执行的动作与其在特定状态下的预测之间的差异)而得到加强,但这种基于动作的、与价值无关的教学信号的实验证据一直缺乏。Greenstreet, Martinez Vergara, Johansson等人训练小鼠进行听觉辨别任务,发现纹状体尾部的多巴胺是学习所必需的,并且在选择任务中,纹状体尾部的多巴胺水平与运动有关,而与奖励无关。运动相关的多巴胺信号在学习过程中减少,在使用不熟悉的刺激时增加,并在选择任务中加强刺激-行动关联;这些发现与多巴胺编码价值无关的行为预测误差一致。结合基于值(通过腹侧纹状体的奖励预测误差)和无值(通过TS的动作预测误差)系统的双控制器模型比单独基于值的控制器更快地学习任务,并且在学习的后期阶段暗示了TS的作用。这些发现表明,TS中与运动相关的多巴胺信号,可能在背纹状体的其他部分,编码了一种与价值无关的教学信号,这种信号加强了状态-动作的联系,促进了动作的重复。因此,行为预测误差可以作为习惯行为实施的基础。原始参考文献:Nature https://doi.org/10.1038/s41586-025-09008-9 (2025)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature neuroscience
Nature neuroscience 医学-神经科学
CiteScore
38.60
自引率
1.20%
发文量
212
审稿时长
1 months
期刊介绍: Nature Neuroscience, a multidisciplinary journal, publishes papers of the utmost quality and significance across all realms of neuroscience. The editors welcome contributions spanning molecular, cellular, systems, and cognitive neuroscience, along with psychophysics, computational modeling, and nervous system disorders. While no area is off-limits, studies offering fundamental insights into nervous system function receive priority. The journal offers high visibility to both readers and authors, fostering interdisciplinary communication and accessibility to a broad audience. It maintains high standards of copy editing and production, rigorous peer review, rapid publication, and operates independently from academic societies and other vested interests. In addition to primary research, Nature Neuroscience features news and views, reviews, editorials, commentaries, perspectives, book reviews, and correspondence, aiming to serve as the voice of the global neuroscience community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信