{"title":"在强化学习中对惊喜的偏好是自闭症和神经正常青少年风险偏好的发展变化差异的基础。","authors":"Motofumi Sumiya, Kentaro Katahira, Hironori Akechi, Atsushi Senju","doi":"10.1186/s13229-025-00637-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Risk preference changes nonlinearly across development. Although extensive developmental research on the neurotypical (NTP) population has shown that risk preference is highest during adolescence, developmental changes in risk preference in autistic (AUT) people, who tend to prefer predictable behaviors, have not been investigated. Here, we aimed to investigate these changes and underlying computational mechanisms.</p><p><strong>Method: </strong>We ran a game-like risk-sensitive reinforcement learning task on 75 participants aged 6-30 years (AUT group, n = 31; NTP group, n = 44). Focusing on choices between alternatives with the same objective value but different risks, we calculated the risk preference and stay probability of a risky choice after a rewarding or non-rewarding outcome. Analyses using t-tests and multiple regression analyses were conducted. Using the choice-related data of each participant, we fit four reinforcement learning models and compared the fit of each model to the data. Furthermore, we validated the results of model fitting with multiple methods, model recovery, parameter recovery, and posterior predictive check.</p><p><strong>Results: </strong>We found a significant difference in nonlinear developmental changes in risk preference between the AUT and NTP groups. The computational modeling approach with reinforcement learning models revealed that individual preferences for surprise modulated such preferences.</p><p><strong>Conclusions: </strong>These findings indicate that for NTP people, adolescence is a developmental period involving risk preference, possibly due to lower surprise aversion. Conversely, for AUT people, who show opposite developmental change of risk preference, adolescence could be a developmental period involving risk avoidance because of low surprise preference.</p>","PeriodicalId":18733,"journal":{"name":"Molecular Autism","volume":"16 1","pages":"3"},"PeriodicalIF":6.3000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11740557/pdf/","citationCount":"0","resultStr":"{\"title\":\"The preference for surprise in reinforcement learning underlies the differences in developmental changes in risk preference between autistic and neurotypical youth.\",\"authors\":\"Motofumi Sumiya, Kentaro Katahira, Hironori Akechi, Atsushi Senju\",\"doi\":\"10.1186/s13229-025-00637-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Risk preference changes nonlinearly across development. Although extensive developmental research on the neurotypical (NTP) population has shown that risk preference is highest during adolescence, developmental changes in risk preference in autistic (AUT) people, who tend to prefer predictable behaviors, have not been investigated. Here, we aimed to investigate these changes and underlying computational mechanisms.</p><p><strong>Method: </strong>We ran a game-like risk-sensitive reinforcement learning task on 75 participants aged 6-30 years (AUT group, n = 31; NTP group, n = 44). Focusing on choices between alternatives with the same objective value but different risks, we calculated the risk preference and stay probability of a risky choice after a rewarding or non-rewarding outcome. Analyses using t-tests and multiple regression analyses were conducted. Using the choice-related data of each participant, we fit four reinforcement learning models and compared the fit of each model to the data. Furthermore, we validated the results of model fitting with multiple methods, model recovery, parameter recovery, and posterior predictive check.</p><p><strong>Results: </strong>We found a significant difference in nonlinear developmental changes in risk preference between the AUT and NTP groups. The computational modeling approach with reinforcement learning models revealed that individual preferences for surprise modulated such preferences.</p><p><strong>Conclusions: </strong>These findings indicate that for NTP people, adolescence is a developmental period involving risk preference, possibly due to lower surprise aversion. Conversely, for AUT people, who show opposite developmental change of risk preference, adolescence could be a developmental period involving risk avoidance because of low surprise preference.</p>\",\"PeriodicalId\":18733,\"journal\":{\"name\":\"Molecular Autism\",\"volume\":\"16 1\",\"pages\":\"3\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-01-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11740557/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Autism\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13229-025-00637-5\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Autism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13229-025-00637-5","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
The preference for surprise in reinforcement learning underlies the differences in developmental changes in risk preference between autistic and neurotypical youth.
Background: Risk preference changes nonlinearly across development. Although extensive developmental research on the neurotypical (NTP) population has shown that risk preference is highest during adolescence, developmental changes in risk preference in autistic (AUT) people, who tend to prefer predictable behaviors, have not been investigated. Here, we aimed to investigate these changes and underlying computational mechanisms.
Method: We ran a game-like risk-sensitive reinforcement learning task on 75 participants aged 6-30 years (AUT group, n = 31; NTP group, n = 44). Focusing on choices between alternatives with the same objective value but different risks, we calculated the risk preference and stay probability of a risky choice after a rewarding or non-rewarding outcome. Analyses using t-tests and multiple regression analyses were conducted. Using the choice-related data of each participant, we fit four reinforcement learning models and compared the fit of each model to the data. Furthermore, we validated the results of model fitting with multiple methods, model recovery, parameter recovery, and posterior predictive check.
Results: We found a significant difference in nonlinear developmental changes in risk preference between the AUT and NTP groups. The computational modeling approach with reinforcement learning models revealed that individual preferences for surprise modulated such preferences.
Conclusions: These findings indicate that for NTP people, adolescence is a developmental period involving risk preference, possibly due to lower surprise aversion. Conversely, for AUT people, who show opposite developmental change of risk preference, adolescence could be a developmental period involving risk avoidance because of low surprise preference.
期刊介绍:
Molecular Autism is a peer-reviewed, open access journal that publishes high-quality basic, translational and clinical research that has relevance to the etiology, pathobiology, or treatment of autism and related neurodevelopmental conditions. Research that includes integration across levels is encouraged. Molecular Autism publishes empirical studies, reviews, and brief communications.