A data science pipeline applied to Australia's 2022 COVID-19 Omicron waves

IF 8.8 3区 医学 Q1 Medicine
James M. Trauer , Angus E. Hughes , David S. Shipman , Michael T. Meehan , Alec S. Henderson , Emma S. McBryde , Romain Ragonnet
{"title":"A data science pipeline applied to Australia's 2022 COVID-19 Omicron waves","authors":"James M. Trauer ,&nbsp;Angus E. Hughes ,&nbsp;David S. Shipman ,&nbsp;Michael T. Meehan ,&nbsp;Alec S. Henderson ,&nbsp;Emma S. McBryde ,&nbsp;Romain Ragonnet","doi":"10.1016/j.idm.2024.08.005","DOIUrl":null,"url":null,"abstract":"<div><p>The field of software engineering is advancing at astonishing speed, with packages now available to support many stages of data science pipelines. These packages can support infectious disease modelling to be more robust, efficient and transparent, which has been particularly important during the COVID-19 pandemic. We developed a package for the construction of infectious disease models, integrated it with several open-source libraries and applied this composite pipeline to multiple data sources that provided insights into Australia's 2022 COVID-19 epidemic. We aimed to identify the key processes relevant to COVID-19 transmission dynamics and thereby develop a model that could quantify relevant epidemiological parameters.</p><p>The pipeline's advantages include markedly increased speed, an expressive application programming interface, the transparency of open-source development, easy access to a broad range of calibration and optimisation tools and consideration of the full workflow from input manipulation through to algorithmic generation of the publication materials. Extending the base model to include mobility effects slightly improved model fit to data, with this approach selected as the model configuration for further epidemiological inference. Under our assumption of widespread immunity against severe outcomes from recent vaccination, incorporating an additional effect of the main vaccination programs rolled out during 2022 on transmission did not further improve model fit. Our simulations suggested that one in every two to six COVID-19 episodes were detected, subsequently emerging Omicron subvariants escaped 30–60% of recently acquired natural immunity and that natural immunity lasted only one to eight months on average. We documented our analyses algorithmically and present our methods in conjunction with interactive online code notebooks and plots.</p><p>We demonstrate the feasibility of integrating a flexible domain-specific syntax library with state-of-the-art packages in high performance computing, calibration, optimisation and visualisation to create an end-to-end pipeline for infectious disease modelling. We used the resulting platform to demonstrate key epidemiological characteristics of the transition from the emergency to the endemic phase of the COVID-19 pandemic.</p></div>","PeriodicalId":36831,"journal":{"name":"Infectious Disease Modelling","volume":"10 1","pages":"Pages 99-109"},"PeriodicalIF":8.8000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468042724000988/pdfft?md5=4f608618181fc13e1498c89ce78afb35&pid=1-s2.0-S2468042724000988-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infectious Disease Modelling","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468042724000988","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

The field of software engineering is advancing at astonishing speed, with packages now available to support many stages of data science pipelines. These packages can support infectious disease modelling to be more robust, efficient and transparent, which has been particularly important during the COVID-19 pandemic. We developed a package for the construction of infectious disease models, integrated it with several open-source libraries and applied this composite pipeline to multiple data sources that provided insights into Australia's 2022 COVID-19 epidemic. We aimed to identify the key processes relevant to COVID-19 transmission dynamics and thereby develop a model that could quantify relevant epidemiological parameters.

The pipeline's advantages include markedly increased speed, an expressive application programming interface, the transparency of open-source development, easy access to a broad range of calibration and optimisation tools and consideration of the full workflow from input manipulation through to algorithmic generation of the publication materials. Extending the base model to include mobility effects slightly improved model fit to data, with this approach selected as the model configuration for further epidemiological inference. Under our assumption of widespread immunity against severe outcomes from recent vaccination, incorporating an additional effect of the main vaccination programs rolled out during 2022 on transmission did not further improve model fit. Our simulations suggested that one in every two to six COVID-19 episodes were detected, subsequently emerging Omicron subvariants escaped 30–60% of recently acquired natural immunity and that natural immunity lasted only one to eight months on average. We documented our analyses algorithmically and present our methods in conjunction with interactive online code notebooks and plots.

We demonstrate the feasibility of integrating a flexible domain-specific syntax library with state-of-the-art packages in high performance computing, calibration, optimisation and visualisation to create an end-to-end pipeline for infectious disease modelling. We used the resulting platform to demonstrate key epidemiological characteristics of the transition from the emergency to the endemic phase of the COVID-19 pandemic.

应用于澳大利亚 2022 年 COVID-19 Omicron 波的数据科学管道
软件工程领域正以惊人的速度向前发展,现在已有软件包可支持数据科学管道的许多阶段。这些软件包可以支持传染病建模,使其更加稳健、高效和透明,这在 COVID-19 大流行期间尤为重要。我们开发了一个用于构建传染病模型的软件包,将其与多个开源库集成,并将这一复合管道应用于多个数据源,从而深入了解澳大利亚 2022 年的 COVID-19 疫情。我们的目标是确定与 COVID-19 传播动态相关的关键过程,从而开发出一种能够量化相关流行病学参数的模型。该管道的优势包括:速度明显提高、应用编程接口表现力强、开源开发透明、可轻松访问广泛的校准和优化工具,以及考虑了从输入操作到算法生成出版材料的整个工作流程。对基础模型进行扩展以包括流动效应后,模型与数据的拟合程度略有提高,因此我们选择了这种方法作为进一步流行病学推断的模型配置。我们假定近期接种的疫苗会对严重后果产生广泛的免疫力,因此将 2022 年期间推出的主要疫苗接种计划对传播的额外影响纳入模型并不会进一步改善模型的拟合度。我们的模拟结果表明,每两到六次 COVID-19 事件中就有一次被检测到,随后出现的 Omicron 亚变异体逃脱了 30-60% 近期获得的自然免疫,自然免疫平均只持续一到八个月。我们展示了将灵活的特定领域语法库与最先进的高性能计算、校准、优化和可视化软件包整合在一起以创建端到端传染病建模管道的可行性。我们利用该平台展示了 COVID-19 大流行从紧急阶段向流行阶段过渡的关键流行病学特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Infectious Disease Modelling
Infectious Disease Modelling Mathematics-Applied Mathematics
CiteScore
17.00
自引率
3.40%
发文量
73
审稿时长
17 weeks
期刊介绍: Infectious Disease Modelling is an open access journal that undergoes peer-review. Its main objective is to facilitate research that combines mathematical modelling, retrieval and analysis of infection disease data, and public health decision support. The journal actively encourages original research that improves this interface, as well as review articles that highlight innovative methodologies relevant to data collection, informatics, and policy making in the field of public health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信