Konstantinos Mitsopoulos, Lawrence Baker, Christian Lebiere, Peter Pirolli, Mark Orr, Raffaele Vardavas
{"title":"基于流行病学代理的模拟中认知似是而非的强化学习。","authors":"Konstantinos Mitsopoulos, Lawrence Baker, Christian Lebiere, Peter Pirolli, Mark Orr, Raffaele Vardavas","doi":"10.3389/fepid.2025.1563731","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Human behavior shapes the transmission of infectious diseases and determines the effectiveness of public health measures designed to mitigate transmission. To accurately reflect these dynamics, epidemiological simulation models should endogenously account for both disease transmission and behavioral dynamics. Traditional agent-based models (ABMs) often rely on simplified rules to represent behavior, limiting their ability to capture complex decision-making processes and cognitive dynamics.</p><p><strong>Methods: </strong>Reinforcement Learning (RL) provides a framework for modeling how agents adapt their behavior based on experience and feedback. However, implementing cognitively plausible RL in ABMs is challenging due to high-dimensional state spaces. We propose a novel framework based on Adaptive Control of Thought-Rational (ACT-R) principles and Instance-Based Learning (IBL), which enables agents to dynamically adapt their behavior using nonparametric RL without requiring extensive training on large datasets.</p><p><strong>Results: </strong>To demonstrate this framework, we model mask-wearing behavior during the COVID-19 pandemic, highlighting how individual decisions and social network structures influence disease transmission. Simulations reveal that local social cues drive tightly clustered masking behavior (slope = 0.54, Pearson <i>r</i> = 0.76), while reliance on global cues alone produces weakly disassortative patterns (slope = 0.05, Pearson <i>r</i> = 0.09), underscoring the role of local information in coordinating public health compliance.</p><p><strong>Discussion: </strong>Our results show that this framework provides a scalable and cognitively interpretable approach to integrating adaptive decision-making into epidemiological simulations, offering actionable insights for public health policy.</p>","PeriodicalId":73083,"journal":{"name":"Frontiers in epidemiology","volume":"5 ","pages":"1563731"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12336203/pdf/","citationCount":"0","resultStr":"{\"title\":\"Cognitively-plausible reinforcement learning in epidemiological agent-based simulations.\",\"authors\":\"Konstantinos Mitsopoulos, Lawrence Baker, Christian Lebiere, Peter Pirolli, Mark Orr, Raffaele Vardavas\",\"doi\":\"10.3389/fepid.2025.1563731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Human behavior shapes the transmission of infectious diseases and determines the effectiveness of public health measures designed to mitigate transmission. To accurately reflect these dynamics, epidemiological simulation models should endogenously account for both disease transmission and behavioral dynamics. Traditional agent-based models (ABMs) often rely on simplified rules to represent behavior, limiting their ability to capture complex decision-making processes and cognitive dynamics.</p><p><strong>Methods: </strong>Reinforcement Learning (RL) provides a framework for modeling how agents adapt their behavior based on experience and feedback. However, implementing cognitively plausible RL in ABMs is challenging due to high-dimensional state spaces. We propose a novel framework based on Adaptive Control of Thought-Rational (ACT-R) principles and Instance-Based Learning (IBL), which enables agents to dynamically adapt their behavior using nonparametric RL without requiring extensive training on large datasets.</p><p><strong>Results: </strong>To demonstrate this framework, we model mask-wearing behavior during the COVID-19 pandemic, highlighting how individual decisions and social network structures influence disease transmission. Simulations reveal that local social cues drive tightly clustered masking behavior (slope = 0.54, Pearson <i>r</i> = 0.76), while reliance on global cues alone produces weakly disassortative patterns (slope = 0.05, Pearson <i>r</i> = 0.09), underscoring the role of local information in coordinating public health compliance.</p><p><strong>Discussion: </strong>Our results show that this framework provides a scalable and cognitively interpretable approach to integrating adaptive decision-making into epidemiological simulations, offering actionable insights for public health policy.</p>\",\"PeriodicalId\":73083,\"journal\":{\"name\":\"Frontiers in epidemiology\",\"volume\":\"5 \",\"pages\":\"1563731\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12336203/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fepid.2025.1563731\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fepid.2025.1563731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
导言:人类行为决定了传染病的传播,并决定了旨在减轻传播的公共卫生措施的有效性。为了准确地反映这些动态,流行病学模拟模型应该内生地考虑疾病传播和行为动态。传统的基于主体的模型(ABMs)通常依赖于简化的规则来表示行为,限制了它们捕捉复杂决策过程和认知动态的能力。方法:强化学习(RL)为智能体如何根据经验和反馈调整其行为提供了一个建模框架。然而,由于高维状态空间,在ABMs中实现认知上合理的强化学习是具有挑战性的。我们提出了一个基于思维理性自适应控制(ACT-R)原则和基于实例的学习(IBL)的新框架,该框架使智能体能够使用非参数强化学习动态适应其行为,而无需在大数据集上进行大量训练。结果:为了证明这一框架,我们对COVID-19大流行期间的戴口罩行为进行了建模,突出了个人决策和社会网络结构如何影响疾病传播。模拟结果显示,局部社会线索驱动紧密聚集的掩蔽行为(斜率= 0.54,Pearson r = 0.76),而仅依赖全局线索产生弱失配模式(斜率= 0.05,Pearson r = 0.09),强调了局部信息在协调公共卫生合规中的作用。讨论:我们的研究结果表明,该框架提供了一种可扩展和认知解释的方法,将适应性决策整合到流行病学模拟中,为公共卫生政策提供可操作的见解。
Cognitively-plausible reinforcement learning in epidemiological agent-based simulations.
Introduction: Human behavior shapes the transmission of infectious diseases and determines the effectiveness of public health measures designed to mitigate transmission. To accurately reflect these dynamics, epidemiological simulation models should endogenously account for both disease transmission and behavioral dynamics. Traditional agent-based models (ABMs) often rely on simplified rules to represent behavior, limiting their ability to capture complex decision-making processes and cognitive dynamics.
Methods: Reinforcement Learning (RL) provides a framework for modeling how agents adapt their behavior based on experience and feedback. However, implementing cognitively plausible RL in ABMs is challenging due to high-dimensional state spaces. We propose a novel framework based on Adaptive Control of Thought-Rational (ACT-R) principles and Instance-Based Learning (IBL), which enables agents to dynamically adapt their behavior using nonparametric RL without requiring extensive training on large datasets.
Results: To demonstrate this framework, we model mask-wearing behavior during the COVID-19 pandemic, highlighting how individual decisions and social network structures influence disease transmission. Simulations reveal that local social cues drive tightly clustered masking behavior (slope = 0.54, Pearson r = 0.76), while reliance on global cues alone produces weakly disassortative patterns (slope = 0.05, Pearson r = 0.09), underscoring the role of local information in coordinating public health compliance.
Discussion: Our results show that this framework provides a scalable and cognitively interpretable approach to integrating adaptive decision-making into epidemiological simulations, offering actionable insights for public health policy.