Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity

IF 2.5 3区 医学 Q1 Medicine
Yuxing Tian , Xin Li , Hualing Wang , Heng Yuan , Tao Zhang
{"title":"Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity","authors":"Yuxing Tian ,&nbsp;Xin Li ,&nbsp;Hualing Wang ,&nbsp;Heng Yuan ,&nbsp;Tao Zhang","doi":"10.1016/j.idm.2025.10.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Influenza remains a significant global public health challenge because of its high transmissibility, widespread circulation, and considerable societal impact. Conventional threshold-based nonpharmaceutical interventions (NPIs) provide valuable frameworks for outbreak control; however, these standardized approaches may not fully account for important regional heterogeneity. It remains difficult to weigh regional characteristics and accurately balance infection control and socioeconomic costs.</div></div><div><h3>Methods</h3><div>We propose a susceptible-exposed-infectious-quarantined-removed compartmental model-dueling deep Q-network (SEIQR-Dueling DQN) framework tailored to plains, hilly, and plateau cities. By integrating climatic, demographic, and health care resource data, the model captures regional differences in transmission and recovery dynamics. A multidimensional state space and discrete intervention set allow for the adaptive optimization of NPI strategies across varying epidemic and resource conditions. Model parameters were estimated by sequential Bayesian optimization, and bootstrap resampling was used to quantify uncertainty. In addition, the performance of the SEIQR-Dueling DQN strategy was compared with that of the threshold-based strategy in terms of the reduction in cumulative infections, peak prevalence and length of intervention periods.</div></div><div><h3>Results</h3><div>The threshold-based intervention policy reduced cumulative infections by 3.05 %–3.67 % and peak incidence by 8.26 %–12.58 % but showed limited responsiveness to regional variation, often resulting in either under- or over-control. The SEIQR-Dueling DQN framework dynamically adjusted intervention timing and combinations on the basis of local demographic structures and epidemic trends and reduced cumulative infections by 5.87 %, 5.99 %, and 5.21 % in plains, hilly, and plateau cities, respectively, while achieving peak reductions of 34.92 %, 22.23 %, and 8.12 %, respectively, with a balanced consideration of socioeconomic impact. To assess generalizability, the trained model was applied to cities with differing transmission dynamics and demonstrated consistent performance across settings.</div></div><div><h3>Conclusion</h3><div>The SEIQR-Dueling DQN framework supports tailored interventions across regions and shows promise for broader application in the management of regional heterogeneity and future emerging infectious diseases.</div></div>","PeriodicalId":36831,"journal":{"name":"Infectious Disease Modelling","volume":"11 1","pages":"Pages 203-217"},"PeriodicalIF":2.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infectious Disease Modelling","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468042725001046","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Influenza remains a significant global public health challenge because of its high transmissibility, widespread circulation, and considerable societal impact. Conventional threshold-based nonpharmaceutical interventions (NPIs) provide valuable frameworks for outbreak control; however, these standardized approaches may not fully account for important regional heterogeneity. It remains difficult to weigh regional characteristics and accurately balance infection control and socioeconomic costs.

Methods

We propose a susceptible-exposed-infectious-quarantined-removed compartmental model-dueling deep Q-network (SEIQR-Dueling DQN) framework tailored to plains, hilly, and plateau cities. By integrating climatic, demographic, and health care resource data, the model captures regional differences in transmission and recovery dynamics. A multidimensional state space and discrete intervention set allow for the adaptive optimization of NPI strategies across varying epidemic and resource conditions. Model parameters were estimated by sequential Bayesian optimization, and bootstrap resampling was used to quantify uncertainty. In addition, the performance of the SEIQR-Dueling DQN strategy was compared with that of the threshold-based strategy in terms of the reduction in cumulative infections, peak prevalence and length of intervention periods.

Results

The threshold-based intervention policy reduced cumulative infections by 3.05 %–3.67 % and peak incidence by 8.26 %–12.58 % but showed limited responsiveness to regional variation, often resulting in either under- or over-control. The SEIQR-Dueling DQN framework dynamically adjusted intervention timing and combinations on the basis of local demographic structures and epidemic trends and reduced cumulative infections by 5.87 %, 5.99 %, and 5.21 % in plains, hilly, and plateau cities, respectively, while achieving peak reductions of 34.92 %, 22.23 %, and 8.12 %, respectively, with a balanced consideration of socioeconomic impact. To assess generalizability, the trained model was applied to cities with differing transmission dynamics and demonstrated consistent performance across settings.

Conclusion

The SEIQR-Dueling DQN framework supports tailored interventions across regions and shows promise for broader application in the management of regional heterogeneity and future emerging infectious diseases.

Abstract Image

优化流感的时空非药物干预:区域异质性的适应性强化学习方法
流感由于其高传播性、广泛传播和相当大的社会影响,仍然是一项重大的全球公共卫生挑战。传统的基于阈值的非药物干预措施(npi)为疫情控制提供了有价值的框架;然而,这些标准化的方法可能不能完全解释重要的区域异质性。衡量区域特征并准确平衡感染控制和社会经济成本仍然很困难。方法针对平原、丘陵和高原城市,提出了一种易感-暴露-感染-隔离-去除的分区模型-决斗深度q -网络(seiqr -决斗DQN)框架。通过整合气候、人口和卫生保健资源数据,该模型捕捉到了传播和恢复动态方面的区域差异。多维状态空间和离散干预集允许在不同的流行病和资源条件下自适应优化NPI策略。采用序列贝叶斯优化方法估计模型参数,采用自举重采样方法量化不确定性。此外,将SEIQR-Dueling DQN策略与基于阈值的策略在减少累积感染、峰值流行率和干预期长度方面的表现进行了比较。结果基于阈值的干预政策使累计感染率降低3.05% ~ 3.67%,峰值发病率降低8.26% ~ 12.58%,但对区域差异的响应性有限,往往导致控制不足或控制过度。SEIQR-Dueling DQN框架根据当地人口结构和流行趋势动态调整干预时间和组合,平原、丘陵和高原城市的累计感染率分别降低了5.87%、5.99%和5.21%,在平衡考虑社会经济影响的情况下,峰值感染率分别降低了34.92%、22.23%和8.12%。为了评估通用性,将训练好的模型应用于具有不同传输动态的城市,并在不同设置中证明了一致的性能。SEIQR-Dueling DQN框架支持跨地区的量身定制干预措施,并有望在区域异质性和未来新发传染病管理中得到更广泛的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Infectious Disease Modelling
Infectious Disease Modelling Mathematics-Applied Mathematics
CiteScore
17.00
自引率
3.40%
发文量
73
审稿时长
17 weeks
期刊介绍: Infectious Disease Modelling is an open access journal that undergoes peer-review. Its main objective is to facilitate research that combines mathematical modelling, retrieval and analysis of infection disease data, and public health decision support. The journal actively encourages original research that improves this interface, as well as review articles that highlight innovative methodologies relevant to data collection, informatics, and policy making in the field of public health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信