Yuxing Tian , Xin Li , Hualing Wang , Heng Yuan , Tao Zhang
{"title":"优化流感的时空非药物干预:区域异质性的适应性强化学习方法","authors":"Yuxing Tian , Xin Li , Hualing Wang , Heng Yuan , Tao Zhang","doi":"10.1016/j.idm.2025.10.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Influenza remains a significant global public health challenge because of its high transmissibility, widespread circulation, and considerable societal impact. Conventional threshold-based nonpharmaceutical interventions (NPIs) provide valuable frameworks for outbreak control; however, these standardized approaches may not fully account for important regional heterogeneity. It remains difficult to weigh regional characteristics and accurately balance infection control and socioeconomic costs.</div></div><div><h3>Methods</h3><div>We propose a susceptible-exposed-infectious-quarantined-removed compartmental model-dueling deep Q-network (SEIQR-Dueling DQN) framework tailored to plains, hilly, and plateau cities. By integrating climatic, demographic, and health care resource data, the model captures regional differences in transmission and recovery dynamics. A multidimensional state space and discrete intervention set allow for the adaptive optimization of NPI strategies across varying epidemic and resource conditions. Model parameters were estimated by sequential Bayesian optimization, and bootstrap resampling was used to quantify uncertainty. In addition, the performance of the SEIQR-Dueling DQN strategy was compared with that of the threshold-based strategy in terms of the reduction in cumulative infections, peak prevalence and length of intervention periods.</div></div><div><h3>Results</h3><div>The threshold-based intervention policy reduced cumulative infections by 3.05 %–3.67 % and peak incidence by 8.26 %–12.58 % but showed limited responsiveness to regional variation, often resulting in either under- or over-control. The SEIQR-Dueling DQN framework dynamically adjusted intervention timing and combinations on the basis of local demographic structures and epidemic trends and reduced cumulative infections by 5.87 %, 5.99 %, and 5.21 % in plains, hilly, and plateau cities, respectively, while achieving peak reductions of 34.92 %, 22.23 %, and 8.12 %, respectively, with a balanced consideration of socioeconomic impact. To assess generalizability, the trained model was applied to cities with differing transmission dynamics and demonstrated consistent performance across settings.</div></div><div><h3>Conclusion</h3><div>The SEIQR-Dueling DQN framework supports tailored interventions across regions and shows promise for broader application in the management of regional heterogeneity and future emerging infectious diseases.</div></div>","PeriodicalId":36831,"journal":{"name":"Infectious Disease Modelling","volume":"11 1","pages":"Pages 203-217"},"PeriodicalIF":2.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity\",\"authors\":\"Yuxing Tian , Xin Li , Hualing Wang , Heng Yuan , Tao Zhang\",\"doi\":\"10.1016/j.idm.2025.10.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Influenza remains a significant global public health challenge because of its high transmissibility, widespread circulation, and considerable societal impact. Conventional threshold-based nonpharmaceutical interventions (NPIs) provide valuable frameworks for outbreak control; however, these standardized approaches may not fully account for important regional heterogeneity. It remains difficult to weigh regional characteristics and accurately balance infection control and socioeconomic costs.</div></div><div><h3>Methods</h3><div>We propose a susceptible-exposed-infectious-quarantined-removed compartmental model-dueling deep Q-network (SEIQR-Dueling DQN) framework tailored to plains, hilly, and plateau cities. By integrating climatic, demographic, and health care resource data, the model captures regional differences in transmission and recovery dynamics. A multidimensional state space and discrete intervention set allow for the adaptive optimization of NPI strategies across varying epidemic and resource conditions. Model parameters were estimated by sequential Bayesian optimization, and bootstrap resampling was used to quantify uncertainty. In addition, the performance of the SEIQR-Dueling DQN strategy was compared with that of the threshold-based strategy in terms of the reduction in cumulative infections, peak prevalence and length of intervention periods.</div></div><div><h3>Results</h3><div>The threshold-based intervention policy reduced cumulative infections by 3.05 %–3.67 % and peak incidence by 8.26 %–12.58 % but showed limited responsiveness to regional variation, often resulting in either under- or over-control. The SEIQR-Dueling DQN framework dynamically adjusted intervention timing and combinations on the basis of local demographic structures and epidemic trends and reduced cumulative infections by 5.87 %, 5.99 %, and 5.21 % in plains, hilly, and plateau cities, respectively, while achieving peak reductions of 34.92 %, 22.23 %, and 8.12 %, respectively, with a balanced consideration of socioeconomic impact. To assess generalizability, the trained model was applied to cities with differing transmission dynamics and demonstrated consistent performance across settings.</div></div><div><h3>Conclusion</h3><div>The SEIQR-Dueling DQN framework supports tailored interventions across regions and shows promise for broader application in the management of regional heterogeneity and future emerging infectious diseases.</div></div>\",\"PeriodicalId\":36831,\"journal\":{\"name\":\"Infectious Disease Modelling\",\"volume\":\"11 1\",\"pages\":\"Pages 203-217\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infectious Disease Modelling\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468042725001046\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infectious Disease Modelling","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468042725001046","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity
Background
Influenza remains a significant global public health challenge because of its high transmissibility, widespread circulation, and considerable societal impact. Conventional threshold-based nonpharmaceutical interventions (NPIs) provide valuable frameworks for outbreak control; however, these standardized approaches may not fully account for important regional heterogeneity. It remains difficult to weigh regional characteristics and accurately balance infection control and socioeconomic costs.
Methods
We propose a susceptible-exposed-infectious-quarantined-removed compartmental model-dueling deep Q-network (SEIQR-Dueling DQN) framework tailored to plains, hilly, and plateau cities. By integrating climatic, demographic, and health care resource data, the model captures regional differences in transmission and recovery dynamics. A multidimensional state space and discrete intervention set allow for the adaptive optimization of NPI strategies across varying epidemic and resource conditions. Model parameters were estimated by sequential Bayesian optimization, and bootstrap resampling was used to quantify uncertainty. In addition, the performance of the SEIQR-Dueling DQN strategy was compared with that of the threshold-based strategy in terms of the reduction in cumulative infections, peak prevalence and length of intervention periods.
Results
The threshold-based intervention policy reduced cumulative infections by 3.05 %–3.67 % and peak incidence by 8.26 %–12.58 % but showed limited responsiveness to regional variation, often resulting in either under- or over-control. The SEIQR-Dueling DQN framework dynamically adjusted intervention timing and combinations on the basis of local demographic structures and epidemic trends and reduced cumulative infections by 5.87 %, 5.99 %, and 5.21 % in plains, hilly, and plateau cities, respectively, while achieving peak reductions of 34.92 %, 22.23 %, and 8.12 %, respectively, with a balanced consideration of socioeconomic impact. To assess generalizability, the trained model was applied to cities with differing transmission dynamics and demonstrated consistent performance across settings.
Conclusion
The SEIQR-Dueling DQN framework supports tailored interventions across regions and shows promise for broader application in the management of regional heterogeneity and future emerging infectious diseases.
期刊介绍:
Infectious Disease Modelling is an open access journal that undergoes peer-review. Its main objective is to facilitate research that combines mathematical modelling, retrieval and analysis of infection disease data, and public health decision support. The journal actively encourages original research that improves this interface, as well as review articles that highlight innovative methodologies relevant to data collection, informatics, and policy making in the field of public health.