Sadegh Ghaderi, Mohammad Hemami, Reza Khosrowabadi, Jamal Amani Rad
{"title":"The role of reinforcement learning in shaping the decision policy in methamphetamine use disorders","authors":"Sadegh Ghaderi, Mohammad Hemami, Reza Khosrowabadi, Jamal Amani Rad","doi":"10.1016/j.jocm.2024.100469","DOIUrl":null,"url":null,"abstract":"<div><p>The prevalence of methamphetamine use disorder (MUD) as a major public health problem has increased dramatically over the last two decades, reaching epidemic levels, which pose high costs to the health care systems worldwide and is commonly associated with experience-based decision-making (EDM) aberrant. However, precise mechanisms underlying such non-optimally in choice patterns still remain poorly understood. In this study, to uncover the latent neurobiological and psychological meaningful processes of such impairment, we apply a reinforcement learning diffusion decision model (RL-DDM) while methamphetamine abuser participants (<span><math><mrow><mi>n</mi><mo>=</mo><mn>18</mn></mrow></math></span>, all men; mean (±SD) age: 27.3±5) and age/sex-matched healthy controls (<span><math><mrow><mi>n</mi><mo>=</mo><mn>25</mn></mrow></math></span>, all men; mean (±SD) age: 26.8.0±3.63) perform choices to resolve uncertainty within a simple probabilistic learning task with rewards and punishments. Preliminary behavior results indicated that addicts made maladaptive patterns of learning that mirrored in both choices and response times (RTs). Furthermore, modeling results revealed that such EDM impairment (maladaptive pattern in optimal selection) in addicts was more imputable to both increased learning rates (more sensitive to outcome fluctuations) and decreased drift rate (less reward sensitivity) compared to healthy. In addition, addicts also showed substantially longer non-decision times (attributed to slower RTs), as well as lower decision boundary criteria (reflection of impulsive choice). Taken together, our findings reveal precise mechanisms associated with EDM impairments in methamphetamine use disorder and confirm the debility of the options values assignment system as the main hub in learning-based decision making.</p></div>","PeriodicalId":46863,"journal":{"name":"Journal of Choice Modelling","volume":"50 ","pages":"Article 100469"},"PeriodicalIF":2.8000,"publicationDate":"2024-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1755534524000022/pdfft?md5=51b91dba15f58c371ab69e2479d02428&pid=1-s2.0-S1755534524000022-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Choice Modelling","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1755534524000022","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
The prevalence of methamphetamine use disorder (MUD) as a major public health problem has increased dramatically over the last two decades, reaching epidemic levels, which pose high costs to the health care systems worldwide and is commonly associated with experience-based decision-making (EDM) aberrant. However, precise mechanisms underlying such non-optimally in choice patterns still remain poorly understood. In this study, to uncover the latent neurobiological and psychological meaningful processes of such impairment, we apply a reinforcement learning diffusion decision model (RL-DDM) while methamphetamine abuser participants (, all men; mean (±SD) age: 27.3±5) and age/sex-matched healthy controls (, all men; mean (±SD) age: 26.8.0±3.63) perform choices to resolve uncertainty within a simple probabilistic learning task with rewards and punishments. Preliminary behavior results indicated that addicts made maladaptive patterns of learning that mirrored in both choices and response times (RTs). Furthermore, modeling results revealed that such EDM impairment (maladaptive pattern in optimal selection) in addicts was more imputable to both increased learning rates (more sensitive to outcome fluctuations) and decreased drift rate (less reward sensitivity) compared to healthy. In addition, addicts also showed substantially longer non-decision times (attributed to slower RTs), as well as lower decision boundary criteria (reflection of impulsive choice). Taken together, our findings reveal precise mechanisms associated with EDM impairments in methamphetamine use disorder and confirm the debility of the options values assignment system as the main hub in learning-based decision making.