Isaac Ogi-Gittins, Nicholas Steyn, Jonathan Polonsky, William S Hart, Mory Keita, Steve Ahuka-Mundeke, Edward M Hill, Robin N Thompson
{"title":"Simulation-based inference of the time-dependent reproduction number from temporally aggregated and under-reported disease incidence time series data.","authors":"Isaac Ogi-Gittins, Nicholas Steyn, Jonathan Polonsky, William S Hart, Mory Keita, Steve Ahuka-Mundeke, Edward M Hill, Robin N Thompson","doi":"10.1098/rsta.2024.0412","DOIUrl":null,"url":null,"abstract":"<p><p>During infectious disease outbreaks, the time-dependent reproduction number ([Formula: see text]) can be estimated to monitor pathogen transmission. In previous work, we developed a simulation-based method for estimating [Formula: see text] from temporally aggregated disease incidence data (e.g. weekly case reports). While that approach is straightforward to use, it assumes implicitly that all cases are reported and the computation can be slow when applied to large datasets. In this article, we extend our previous approach and develop a computationally efficient simulation-based method for estimating [Formula: see text] in real-time accounting for both temporal aggregation of incidence data and under-reporting (with a fixed reporting probability per case). Using simulated data, we show that failing to consider stochastic under-reporting can lead to inappropriately precise estimates, including scenarios in which the true [Formula: see text] value lies outside inferred credible intervals more often than expected. We then apply our approach to data from the 2018 to 2020 Ebola outbreak in the Democratic Republic of the Congo (DRC), again exploring the effects of case under-reporting. Finally, we show how our method can be extended to account for temporal variations in reporting. Given information about the level of case reporting, our framework can be used to estimate [Formula: see text] during future outbreaks with under-reported and temporally aggregated case data.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 2)'.</p>","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"383 2293","pages":"20240412"},"PeriodicalIF":4.3000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0412","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
During infectious disease outbreaks, the time-dependent reproduction number ([Formula: see text]) can be estimated to monitor pathogen transmission. In previous work, we developed a simulation-based method for estimating [Formula: see text] from temporally aggregated disease incidence data (e.g. weekly case reports). While that approach is straightforward to use, it assumes implicitly that all cases are reported and the computation can be slow when applied to large datasets. In this article, we extend our previous approach and develop a computationally efficient simulation-based method for estimating [Formula: see text] in real-time accounting for both temporal aggregation of incidence data and under-reporting (with a fixed reporting probability per case). Using simulated data, we show that failing to consider stochastic under-reporting can lead to inappropriately precise estimates, including scenarios in which the true [Formula: see text] value lies outside inferred credible intervals more often than expected. We then apply our approach to data from the 2018 to 2020 Ebola outbreak in the Democratic Republic of the Congo (DRC), again exploring the effects of case under-reporting. Finally, we show how our method can be extended to account for temporal variations in reporting. Given information about the level of case reporting, our framework can be used to estimate [Formula: see text] during future outbreaks with under-reported and temporally aggregated case data.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 2)'.
期刊介绍:
Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.