COVID-19大流行后缓解呼吸道感染爆发的最佳适应性非药物干预措施:中国香港的深度强化学习研究

Yao Yao, Hanchu Zhou, Zhidong Cao, D. Zeng, Qingpeng Zhang
{"title":"COVID-19大流行后缓解呼吸道感染爆发的最佳适应性非药物干预措施:中国香港的深度强化学习研究","authors":"Yao Yao, Hanchu Zhou, Zhidong Cao, D. Zeng, Qingpeng Zhang","doi":"10.2139/ssrn.4243792","DOIUrl":null,"url":null,"abstract":"BACKGROUND\nLong-lasting nonpharmaceutical interventions (NPIs) suppressed the infection of COVID-19 but came at a substantial economic cost and the elevated risk of the outbreak of respiratory infectious diseases (RIDs) following the pandemic. Policymakers need data-driven evidence to guide the relaxation with adaptive NPIs that consider the risk of both COVID-19 and other RIDs outbreaks, as well as the available healthcare resources.\n\n\nMETHODS\nCombining the COVID-19 data of the sixth wave in Hong Kong between May 31, 2022 and August 28, 2022, 6-year epidemic data of other RIDs (2014-2019), and the healthcare resources data, we constructed compartment models to predict the epidemic curves of RIDs after the COVID-19-targeted NPIs. A deep reinforcement learning (DRL) model was developed to learn the optimal adaptive NPIs strategies to mitigate the outbreak of RIDs after COVID-19-targeted NPIs are lifted with minimal health and economic cost. The performance was validated by simulations of 1000 days starting August 29, 2022. We also extended the model to Beijing context.\n\n\nFINDINGS\nWithout any NPIs, Hong Kong experienced a major COVID-19 resurgence far exceeding the hospital bed capacity. Simulation results showed that the proposed DRL-based adaptive NPIs successfully suppressed the outbreak of COVID-19 and other RIDs to lower than capacity. DRL carefully controlled the epidemic curve to be close to the full capacity so that herd immunity can be reached in a relatively short period with minimal cost. DRL derived more stringent adaptive NPIs in Beijing.\n\n\nINTERPRETATION\nDRL is a feasible method to identify the optimal adaptive NPIs that lead to minimal health and economic cost by facilitating gradual herd immunity of COVID-19 and mitigating the other RIDs outbreaks without overwhelming the hospitals. The insights can be extended to other countries/regions.","PeriodicalId":236137,"journal":{"name":"Journal of the American Medical Informatics Association : JAMIA","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal adaptive nonpharmaceutical interventions to mitigate the outbreak of respiratory infections following the COVID-19 pandemic: a deep reinforcement learning study in Hong Kong, China\",\"authors\":\"Yao Yao, Hanchu Zhou, Zhidong Cao, D. Zeng, Qingpeng Zhang\",\"doi\":\"10.2139/ssrn.4243792\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\nLong-lasting nonpharmaceutical interventions (NPIs) suppressed the infection of COVID-19 but came at a substantial economic cost and the elevated risk of the outbreak of respiratory infectious diseases (RIDs) following the pandemic. Policymakers need data-driven evidence to guide the relaxation with adaptive NPIs that consider the risk of both COVID-19 and other RIDs outbreaks, as well as the available healthcare resources.\\n\\n\\nMETHODS\\nCombining the COVID-19 data of the sixth wave in Hong Kong between May 31, 2022 and August 28, 2022, 6-year epidemic data of other RIDs (2014-2019), and the healthcare resources data, we constructed compartment models to predict the epidemic curves of RIDs after the COVID-19-targeted NPIs. A deep reinforcement learning (DRL) model was developed to learn the optimal adaptive NPIs strategies to mitigate the outbreak of RIDs after COVID-19-targeted NPIs are lifted with minimal health and economic cost. The performance was validated by simulations of 1000 days starting August 29, 2022. We also extended the model to Beijing context.\\n\\n\\nFINDINGS\\nWithout any NPIs, Hong Kong experienced a major COVID-19 resurgence far exceeding the hospital bed capacity. Simulation results showed that the proposed DRL-based adaptive NPIs successfully suppressed the outbreak of COVID-19 and other RIDs to lower than capacity. DRL carefully controlled the epidemic curve to be close to the full capacity so that herd immunity can be reached in a relatively short period with minimal cost. DRL derived more stringent adaptive NPIs in Beijing.\\n\\n\\nINTERPRETATION\\nDRL is a feasible method to identify the optimal adaptive NPIs that lead to minimal health and economic cost by facilitating gradual herd immunity of COVID-19 and mitigating the other RIDs outbreaks without overwhelming the hospitals. The insights can be extended to other countries/regions.\",\"PeriodicalId\":236137,\"journal\":{\"name\":\"Journal of the American Medical Informatics Association : JAMIA\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Medical Informatics Association : JAMIA\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.4243792\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association : JAMIA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.4243792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:长期非药物干预措施(npi)抑制了COVID-19的感染,但付出了巨大的经济代价,并增加了大流行后呼吸道传染病(rid)爆发的风险。政策制定者需要数据驱动的证据,以自适应的npi来指导放松,这些npi要考虑到COVID-19和其他传染病爆发的风险,以及可用的医疗资源。方法结合2022年5月31日至2022年8月28日香港第六波新冠肺炎疫情数据、其他地区6年(2014-2019年)疫情数据以及医疗资源数据,构建隔室模型,预测新冠肺炎靶向性npi实施后香港地区新冠肺炎疫情曲线。建立了一个深度强化学习(DRL)模型,在以最小的健康和经济成本解除针对covid -19的npi后,学习最优自适应npi策略,以减轻rid的爆发。从2022年8月29日开始,通过1000天的模拟验证了该性能。我们还将该模型扩展到北京。调查结果在没有任何非营利机构的情况下,香港经历了COVID-19的大规模复苏,远远超过了医院病床的容量。仿真结果表明,基于drl的自适应npi成功地抑制了COVID-19和其他rid的爆发,使其低于容量。DRL仔细控制流行曲线,使其接近满负荷,以便在相对较短的时间内以最小的成本实现群体免疫。DRL在北京得到了更严格的适应性npi。drl是一种可行的方法,可以通过促进COVID-19的逐步群体免疫和减轻其他rid暴发而不使医院超负荷,从而确定导致最小健康和经济成本的最佳适应性npi。这些见解可以扩展到其他国家/地区。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimal adaptive nonpharmaceutical interventions to mitigate the outbreak of respiratory infections following the COVID-19 pandemic: a deep reinforcement learning study in Hong Kong, China
BACKGROUND Long-lasting nonpharmaceutical interventions (NPIs) suppressed the infection of COVID-19 but came at a substantial economic cost and the elevated risk of the outbreak of respiratory infectious diseases (RIDs) following the pandemic. Policymakers need data-driven evidence to guide the relaxation with adaptive NPIs that consider the risk of both COVID-19 and other RIDs outbreaks, as well as the available healthcare resources. METHODS Combining the COVID-19 data of the sixth wave in Hong Kong between May 31, 2022 and August 28, 2022, 6-year epidemic data of other RIDs (2014-2019), and the healthcare resources data, we constructed compartment models to predict the epidemic curves of RIDs after the COVID-19-targeted NPIs. A deep reinforcement learning (DRL) model was developed to learn the optimal adaptive NPIs strategies to mitigate the outbreak of RIDs after COVID-19-targeted NPIs are lifted with minimal health and economic cost. The performance was validated by simulations of 1000 days starting August 29, 2022. We also extended the model to Beijing context. FINDINGS Without any NPIs, Hong Kong experienced a major COVID-19 resurgence far exceeding the hospital bed capacity. Simulation results showed that the proposed DRL-based adaptive NPIs successfully suppressed the outbreak of COVID-19 and other RIDs to lower than capacity. DRL carefully controlled the epidemic curve to be close to the full capacity so that herd immunity can be reached in a relatively short period with minimal cost. DRL derived more stringent adaptive NPIs in Beijing. INTERPRETATION DRL is a feasible method to identify the optimal adaptive NPIs that lead to minimal health and economic cost by facilitating gradual herd immunity of COVID-19 and mitigating the other RIDs outbreaks without overwhelming the hospitals. The insights can be extended to other countries/regions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信