Building and Breaking the Chain: A Model of Reward Prediction Error Integration and Segmentation of Memory

IF 3.1 3区 医学 Q2 NEUROSCIENCES
Nina Rouhani;David Clewett;James W. Antony
{"title":"Building and Breaking the Chain: A Model of Reward Prediction Error Integration and Segmentation of Memory","authors":"Nina Rouhani;David Clewett;James W. Antony","doi":"10.1162/jocn_a_02215","DOIUrl":null,"url":null,"abstract":"Prediction errors drive reinforcement learning and organize episodic memory into distinct contexts, but do these effects interact? Here, we review the roles of midbrain dopamine, the locus coeruleus, and the hippocampus in event cognition to propose and simulate the theoretical influence of two prediction error signals in integrating versus segmenting events in memory. We suggest that signed reward prediction errors can build mental models of reward environments, increasing the contextual similarity (integration) of experiences with stronger, more stable reward expectations. On the other hand, unsigned reward prediction errors can signal a new model of the environment, generating a contextual shift (segmentation) between experiences that crossed them. We moreover predicted that these differences in contextual similarity give rise to distinct patterns of temporal-order memory. We combined these ideas in a computational model to account for a seemingly paradoxical pattern of temporal-order memory where greater representational distance helps order memory within context but impairs it across contexts. We found that simulating signed reward prediction error integration and unsigned reward prediction error segmentation differentially enabled the model to perform associative chaining, which involved reactivating items between two tested probes to assist with sequential retrieval. In summary, our simulations provide a unifying explanation for the varied ways that neuromodulatory systems may alter event cognition and memory.","PeriodicalId":51081,"journal":{"name":"Journal of Cognitive Neuroscience","volume":"36 11","pages":"2401-2414"},"PeriodicalIF":3.1000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cognitive Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://ieeexplore.ieee.org/document/10738328/","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Prediction errors drive reinforcement learning and organize episodic memory into distinct contexts, but do these effects interact? Here, we review the roles of midbrain dopamine, the locus coeruleus, and the hippocampus in event cognition to propose and simulate the theoretical influence of two prediction error signals in integrating versus segmenting events in memory. We suggest that signed reward prediction errors can build mental models of reward environments, increasing the contextual similarity (integration) of experiences with stronger, more stable reward expectations. On the other hand, unsigned reward prediction errors can signal a new model of the environment, generating a contextual shift (segmentation) between experiences that crossed them. We moreover predicted that these differences in contextual similarity give rise to distinct patterns of temporal-order memory. We combined these ideas in a computational model to account for a seemingly paradoxical pattern of temporal-order memory where greater representational distance helps order memory within context but impairs it across contexts. We found that simulating signed reward prediction error integration and unsigned reward prediction error segmentation differentially enabled the model to perform associative chaining, which involved reactivating items between two tested probes to assist with sequential retrieval. In summary, our simulations provide a unifying explanation for the varied ways that neuromodulatory systems may alter event cognition and memory.
建立和打破记忆链:奖赏预测错误整合与记忆分割模型。
预测错误会推动强化学习并将外显记忆组织到不同的情境中,但这些影响是否会相互影响呢?在这里,我们回顾了中脑多巴胺、神经节和海马在事件认知中的作用,提出并模拟了两种预测错误信号在整合与分割记忆事件中的理论影响。我们认为,有符号的奖赏预测错误可以建立奖赏环境的心理模型,增加具有更强、更稳定的奖赏预期的经历的情境相似性(整合)。另一方面,无符号的奖励预测错误则可能预示着一种新的环境模型,在与之交叉的经历之间产生情境转换(分割)。此外,我们还预测这些情境相似性的差异会产生不同的时序记忆模式。我们将这些想法结合到一个计算模型中,以解释一种看似矛盾的时序记忆模式,即更大的表征距离有助于情境内的时序记忆,但却会损害跨情境记忆。我们发现,模拟有符号奖励预测误差整合和无符号奖励预测误差分割可使模型执行联想连锁,这涉及重新激活两个测试探针之间的项目,以协助顺序检索。总之,我们的模拟为神经调节系统改变事件认知和记忆的各种方式提供了统一的解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cognitive Neuroscience
Journal of Cognitive Neuroscience 医学-神经科学
CiteScore
5.30
自引率
3.10%
发文量
151
审稿时长
3-8 weeks
期刊介绍: Journal of Cognitive Neuroscience investigates brain–behavior interaction and promotes lively interchange among the mind sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信