通过生物学启发的强化学习模型，补益多巴胺和价值学习中的偏见相关联

bioRxiv (Cold Spring Harbor Laboratory) Pub Date : 2023-11-14 DOI:10.1101/2023.11.10.566580

Naoshige Uchida, Sandra Romero Pinto

{"title":"通过生物学启发的强化学习模型，补益多巴胺和价值学习中的偏见相关联","authors":"Naoshige Uchida, Sandra Romero Pinto","doi":"10.1101/2023.11.10.566580","DOIUrl":null,"url":null,"abstract":"A hallmark of various psychiatric disorders is biased future predictions. Here we examined the mechanisms for biased value learning using reinforcement learning models incorporating recent findings on synaptic plasticity and opponent circuit mechanisms in the basal ganglia. We show that variations in tonic dopamine can alter the balance between learning from positive and negative reward prediction errors, leading to biased value predictions. This bias arises from the sigmoidal shapes of the dose-occupancy curves and distinct affinities of D1- and D2-type dopamine receptors: changes in tonic dopamine differentially alters the slope of the dose-occupancy curves of these receptors, thus sensitivities, at baseline dopamine concentrations. We show that this mechanism can explain biased value learning in both mice and humans and may also contribute to symptoms observed in psychiatric disorders. Our model provides a foundation for understanding the basal ganglia circuit and underscores the significance of tonic dopamine in modulating learning processes.","PeriodicalId":486943,"journal":{"name":"bioRxiv (Cold Spring Harbor Laboratory)","volume":"22 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tonic dopamine and biases in value learning linked through a biologically inspired reinforcement learning model\",\"authors\":\"Naoshige Uchida, Sandra Romero Pinto\",\"doi\":\"10.1101/2023.11.10.566580\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A hallmark of various psychiatric disorders is biased future predictions. Here we examined the mechanisms for biased value learning using reinforcement learning models incorporating recent findings on synaptic plasticity and opponent circuit mechanisms in the basal ganglia. We show that variations in tonic dopamine can alter the balance between learning from positive and negative reward prediction errors, leading to biased value predictions. This bias arises from the sigmoidal shapes of the dose-occupancy curves and distinct affinities of D1- and D2-type dopamine receptors: changes in tonic dopamine differentially alters the slope of the dose-occupancy curves of these receptors, thus sensitivities, at baseline dopamine concentrations. We show that this mechanism can explain biased value learning in both mice and humans and may also contribute to symptoms observed in psychiatric disorders. Our model provides a foundation for understanding the basal ganglia circuit and underscores the significance of tonic dopamine in modulating learning processes.\",\"PeriodicalId\":486943,\"journal\":{\"name\":\"bioRxiv (Cold Spring Harbor Laboratory)\",\"volume\":\"22 9\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv (Cold Spring Harbor Laboratory)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2023.11.10.566580\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv (Cold Spring Harbor Laboratory)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.11.10.566580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

各种精神疾病的一个特点是对未来的预测有偏见。本研究采用强化学习模型，结合最近在基底节区突触可塑性和对手回路机制方面的发现，研究了偏值学习的机制。我们发现，强直性多巴胺的变化可以改变从积极和消极奖励预测错误中学习的平衡，从而导致有偏差的价值预测。这种偏差源于D1型和d2型多巴胺受体的剂量-占用曲线的s型形状和不同的亲和力:在基线多巴胺浓度下，强直性多巴胺的变化不同地改变了这些受体的剂量-占用曲线的斜率，从而改变了灵敏度。我们发现这种机制可以解释小鼠和人类的偏值学习，也可能有助于观察到精神疾病的症状。我们的模型为理解基底神经节回路提供了基础，并强调了强直性多巴胺在调节学习过程中的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Tonic dopamine and biases in value learning linked through a biologically inspired reinforcement learning model

A hallmark of various psychiatric disorders is biased future predictions. Here we examined the mechanisms for biased value learning using reinforcement learning models incorporating recent findings on synaptic plasticity and opponent circuit mechanisms in the basal ganglia. We show that variations in tonic dopamine can alter the balance between learning from positive and negative reward prediction errors, leading to biased value predictions. This bias arises from the sigmoidal shapes of the dose-occupancy curves and distinct affinities of D1- and D2-type dopamine receptors: changes in tonic dopamine differentially alters the slope of the dose-occupancy curves of these receptors, thus sensitivities, at baseline dopamine concentrations. We show that this mechanism can explain biased value learning in both mice and humans and may also contribute to symptoms observed in psychiatric disorders. Our model provides a foundation for understanding the basal ganglia circuit and underscores the significance of tonic dopamine in modulating learning processes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

bioRxiv (Cold Spring Harbor Laboratory)

自引率

0.00%

发文量