The role of reinforcement learning in shaping the decision policy in methamphetamine use disorders

IF 2.4 3区经济学 Q1 ECONOMICS

Journal of Choice Modelling Pub Date : 2024-01-23 DOI:10.1016/j.jocm.2024.100469

Sadegh Ghaderi, Mohammad Hemami, Reza Khosrowabadi, Jamal Amani Rad

{"title":"The role of reinforcement learning in shaping the decision policy in methamphetamine use disorders","authors":"Sadegh Ghaderi, Mohammad Hemami, Reza Khosrowabadi, Jamal Amani Rad","doi":"10.1016/j.jocm.2024.100469","DOIUrl":null,"url":null,"abstract":"<div><p>The prevalence of methamphetamine use disorder (MUD) as a major public health problem has increased dramatically over the last two decades, reaching epidemic levels, which pose high costs to the health care systems worldwide and is commonly associated with experience-based decision-making (EDM) aberrant. However, precise mechanisms underlying such non-optimally in choice patterns still remain poorly understood. In this study, to uncover the latent neurobiological and psychological meaningful processes of such impairment, we apply a reinforcement learning diffusion decision model (RL-DDM) while methamphetamine abuser participants (<span><math><mrow><mi>n</mi><mo>=</mo><mn>18</mn></mrow></math></span>, all men; mean (±SD) age: 27.3±5) and age/sex-matched healthy controls (<span><math><mrow><mi>n</mi><mo>=</mo><mn>25</mn></mrow></math></span>, all men; mean (±SD) age: 26.8.0±3.63) perform choices to resolve uncertainty within a simple probabilistic learning task with rewards and punishments. Preliminary behavior results indicated that addicts made maladaptive patterns of learning that mirrored in both choices and response times (RTs). Furthermore, modeling results revealed that such EDM impairment (maladaptive pattern in optimal selection) in addicts was more imputable to both increased learning rates (more sensitive to outcome fluctuations) and decreased drift rate (less reward sensitivity) compared to healthy. In addition, addicts also showed substantially longer non-decision times (attributed to slower RTs), as well as lower decision boundary criteria (reflection of impulsive choice). Taken together, our findings reveal precise mechanisms associated with EDM impairments in methamphetamine use disorder and confirm the debility of the options values assignment system as the main hub in learning-based decision making.</p></div>","PeriodicalId":46863,"journal":{"name":"Journal of Choice Modelling","volume":"50 ","pages":"Article 100469"},"PeriodicalIF":2.4000,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1755534524000022/pdfft?md5=51b91dba15f58c371ab69e2479d02428&pid=1-s2.0-S1755534524000022-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Choice Modelling","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1755534524000022","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

The prevalence of methamphetamine use disorder (MUD) as a major public health problem has increased dramatically over the last two decades, reaching epidemic levels, which pose high costs to the health care systems worldwide and is commonly associated with experience-based decision-making (EDM) aberrant. However, precise mechanisms underlying such non-optimally in choice patterns still remain poorly understood. In this study, to uncover the latent neurobiological and psychological meaningful processes of such impairment, we apply a reinforcement learning diffusion decision model (RL-DDM) while methamphetamine abuser participants ( $n = 18$ , all men; mean (±SD) age: 27.3±5) and age/sex-matched healthy controls ( $n = 25$ , all men; mean (±SD) age: 26.8.0±3.63) perform choices to resolve uncertainty within a simple probabilistic learning task with rewards and punishments. Preliminary behavior results indicated that addicts made maladaptive patterns of learning that mirrored in both choices and response times (RTs). Furthermore, modeling results revealed that such EDM impairment (maladaptive pattern in optimal selection) in addicts was more imputable to both increased learning rates (more sensitive to outcome fluctuations) and decreased drift rate (less reward sensitivity) compared to healthy. In addition, addicts also showed substantially longer non-decision times (attributed to slower RTs), as well as lower decision boundary criteria (reflection of impulsive choice). Taken together, our findings reveal precise mechanisms associated with EDM impairments in methamphetamine use disorder and confirm the debility of the options values assignment system as the main hub in learning-based decision making.

查看原文本刊更多论文

强化学习在形成甲基苯丙胺使用障碍决策政策中的作用

甲基苯丙胺使用障碍（MUD）作为一个主要的公共卫生问题，其流行率在过去二十年里急剧上升，达到了流行病的水平，给全世界的医疗保健系统带来了高昂的成本，并且通常与基于经验的决策（EDM）失常有关。然而，人们对这种非最佳选择模式的确切机制仍然知之甚少。在本研究中，为了揭示这种障碍的潜在神经生物学和心理学意义过程，我们应用了强化学习扩散决策模型（RL-DDM），让甲基苯丙胺滥用者（18 人，均为男性；平均（±SD）年龄：27.3±5）和年龄/性别匹配的健康对照者（25 人，均为男性；平均（±SD）年龄：26.8.0±3.63）在一个简单的奖惩概率学习任务中进行选择，以解决不确定性。初步的行为结果表明，成瘾者的学习模式是不适应的，这反映在选择和反应时间（RT）上。此外，建模结果显示，与健康人相比，成瘾者的这种 EDM 损伤（最优选择中的不适应模式）更多地归因于学习率的增加（对结果波动更敏感）和漂移率的降低（对奖赏的敏感性降低）。此外，成瘾者还表现出更长的非决策时间（归因于更慢的RT），以及更低的决策边界标准（反映了冲动性选择）。综上所述，我们的研究结果揭示了与甲基苯丙胺使用障碍中的EDM损伤相关的精确机制，并证实了选项价值分配系统作为基于学习的决策制定的主要枢纽的脆弱性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Choice Modelling ECONOMICS-

CiteScore

4.10

自引率

12.50%

发文量