Estimating the size of undetected cases of the COVID-19 outbreak in Europe: an upper bound estimator

Q3 Mathematics
Irene Rocchetti, D. Böhning, H. Holling, A. Maruotti
{"title":"Estimating the size of undetected cases of the COVID-19 outbreak in Europe: an upper bound estimator","authors":"Irene Rocchetti, D. Böhning, H. Holling, A. Maruotti","doi":"10.1515/em-2020-0024","DOIUrl":null,"url":null,"abstract":"Abstract Background While the number of detected COVID-19 infections are widely available, an understanding of the extent of undetected cases is urgently needed for an effective tackling of the pandemic. The aim of this work is to estimate the true number of COVID-19 (detected and undetected) infections in several European countries. The question being asked is: How many cases have actually occurred? Methods We propose an upper bound estimator under cumulative data distributions, in an open population, based on a day-wise estimator that allows for heterogeneity. The estimator is data-driven and can be easily computed from the distributions of daily cases and deaths. Uncertainty surrounding the estimates is obtained using bootstrap methods. Results We focus on the ratio of the total estimated cases to the observed cases at April 17th. Differences arise at the country level, and we get estimates ranging from the 3.93 times of Norway to the 7.94 times of France. Accurate estimates are obtained, as bootstrap-based intervals are rather narrow. Conclusions Many parametric or semi-parametric models have been developed to estimate the population size from aggregated counts leading to an approximation of the missed population and/or to the estimate of the threshold under which the number of missed people cannot fall (i.e. a lower bound). Here, we provide a methodological contribution introducing an upper bound estimator and provide reliable estimates on the dark number, i.e. how many undetected cases are going around for several European countries, where the epidemic spreads differently.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epidemiologic Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/em-2020-0024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 12

Abstract

Abstract Background While the number of detected COVID-19 infections are widely available, an understanding of the extent of undetected cases is urgently needed for an effective tackling of the pandemic. The aim of this work is to estimate the true number of COVID-19 (detected and undetected) infections in several European countries. The question being asked is: How many cases have actually occurred? Methods We propose an upper bound estimator under cumulative data distributions, in an open population, based on a day-wise estimator that allows for heterogeneity. The estimator is data-driven and can be easily computed from the distributions of daily cases and deaths. Uncertainty surrounding the estimates is obtained using bootstrap methods. Results We focus on the ratio of the total estimated cases to the observed cases at April 17th. Differences arise at the country level, and we get estimates ranging from the 3.93 times of Norway to the 7.94 times of France. Accurate estimates are obtained, as bootstrap-based intervals are rather narrow. Conclusions Many parametric or semi-parametric models have been developed to estimate the population size from aggregated counts leading to an approximation of the missed population and/or to the estimate of the threshold under which the number of missed people cannot fall (i.e. a lower bound). Here, we provide a methodological contribution introducing an upper bound estimator and provide reliable estimates on the dark number, i.e. how many undetected cases are going around for several European countries, where the epidemic spreads differently.
估计欧洲未发现的COVID-19暴发病例的规模:上界估计值
抽象背景而发现COVID-19感染广泛可用的、未被发现的情况下的程度的理解是迫切需要一个有效的解决的大流行。这项工作的目的是估计几个欧洲国家COVID-19(已发现和未发现)感染的真实数量。问题是:实际发生了多少病例?方法:在开放人群中,基于允许异质性的日估计量,我们提出了累积数据分布下的上界估计量。估算器是数据驱动的,可以很容易地从每日病例和死亡的分布中计算出来。估计的不确定性是用自举法得到的。结果重点关注4月17日估计病例数与观察病例数之比。在国家层面上存在差异,我们得到的估计从挪威的3.93倍到法国的7.94倍不等。由于基于bootstrap的区间相当窄,得到了准确的估计。已经开发了许多参数或半参数模型来从汇总计数估计人口规模,从而近似估计错过的人口和/或估计错过的人数不能下降的阈值(即下界)。在这里,我们提供了方法上的贡献,引入了上界估计量,并提供了关于暗数字的可靠估计,即在几个流行病传播不同的欧洲国家有多少未被发现的病例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Epidemiologic Methods
Epidemiologic Methods Mathematics-Applied Mathematics
CiteScore
2.10
自引率
0.00%
发文量
7
期刊介绍: Epidemiologic Methods (EM) seeks contributions comparable to those of the leading epidemiologic journals, but also invites papers that may be more technical or of greater length than what has traditionally been allowed by journals in epidemiology. Applications and examples with real data to illustrate methodology are strongly encouraged but not required. Topics. genetic epidemiology, infectious disease, pharmaco-epidemiology, ecologic studies, environmental exposures, screening, surveillance, social networks, comparative effectiveness, statistical modeling, causal inference, measurement error, study design, meta-analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信