COVID-19 and Statistical Challenges

S. Kheiri
{"title":"COVID-19 and Statistical Challenges","authors":"S. Kheiri","doi":"10.34172/ijer.2022.01","DOIUrl":null,"url":null,"abstract":"Since the outbreak of COVID-19 in December 2019, there has been an explosion of statistics and information about the disease, the number of cases, the number of deaths, and the number of recoveries. During this period several statistical and mathematical models have been developed and used to predict the disease. Much of this information has been helpful and paved the way for disease control; however, inaccurate or ambiguous information has been published in some cases, which can briefly be divided into three main categories. The first category is related to the publication of official statistics by governmental centers in countries, which has faced many errors. Although some of these errors are unintentional due to the definition of the disease based on definitive polymerase chain reaction (PCR) testing, death within 28 days of infection, or due to the similarity of the disease outcomes with other diseases,1 in many cases, the statistics regarding the disease and its consequences have been presented by governments with a manipulation, mainly on the small number of patients with COVID-19.2 This issue caused a great deal of controversy among academic centers, and many of them attempted to explore the differences between the official statistics provided by countries and other types of data based on other sources. Fewer reports on the actual number of positive cases of the disease may lead to false optimism and thus negligence in dealing with COVID-19 and an increase in the number of patients. The second category contributes to incorrect or incomplete use of statistical indexes. During this period, some information has occasionally been published by some public media in which statistical indexes have not been used correctly and appropriately. For example, mentioning the percentage of increase in positive cases over a specified period of time without mentioning the base number or the percentage of deaths without mentioning the number, can provide incorrect information to the audience. The presentation of incomplete statistics and information is considered among the examples of lying with statistics, indicating that academic researchers need to tackle and challenge this disagreeable phenomenon.3 The third category belongs to the challenges of epidemiological modeling in COVID-19. With the advent of COVID-19, many models emerged to predict its incidence and consequences. Although many of the predictions were true, some of them were incorrect or inaccurate. The main reasons for the inaccuracy of these predictions were the consideration of incorrect or weak assumptions, the existence of incomplete data, the shortterm view, the use of point estimates instead of interval estimates, and the lack of a multidimensional view of the problem. Accordingly, considering the above-mentioned points and findings from the observed problems in predicting the course of the disease and the resulting mortality during the COVID-19 pandemic, one can hope for the ability to model the prediction of similar diseases in the future.4","PeriodicalId":73448,"journal":{"name":"International journal of epidemiologic research","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of epidemiologic research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34172/ijer.2022.01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Since the outbreak of COVID-19 in December 2019, there has been an explosion of statistics and information about the disease, the number of cases, the number of deaths, and the number of recoveries. During this period several statistical and mathematical models have been developed and used to predict the disease. Much of this information has been helpful and paved the way for disease control; however, inaccurate or ambiguous information has been published in some cases, which can briefly be divided into three main categories. The first category is related to the publication of official statistics by governmental centers in countries, which has faced many errors. Although some of these errors are unintentional due to the definition of the disease based on definitive polymerase chain reaction (PCR) testing, death within 28 days of infection, or due to the similarity of the disease outcomes with other diseases,1 in many cases, the statistics regarding the disease and its consequences have been presented by governments with a manipulation, mainly on the small number of patients with COVID-19.2 This issue caused a great deal of controversy among academic centers, and many of them attempted to explore the differences between the official statistics provided by countries and other types of data based on other sources. Fewer reports on the actual number of positive cases of the disease may lead to false optimism and thus negligence in dealing with COVID-19 and an increase in the number of patients. The second category contributes to incorrect or incomplete use of statistical indexes. During this period, some information has occasionally been published by some public media in which statistical indexes have not been used correctly and appropriately. For example, mentioning the percentage of increase in positive cases over a specified period of time without mentioning the base number or the percentage of deaths without mentioning the number, can provide incorrect information to the audience. The presentation of incomplete statistics and information is considered among the examples of lying with statistics, indicating that academic researchers need to tackle and challenge this disagreeable phenomenon.3 The third category belongs to the challenges of epidemiological modeling in COVID-19. With the advent of COVID-19, many models emerged to predict its incidence and consequences. Although many of the predictions were true, some of them were incorrect or inaccurate. The main reasons for the inaccuracy of these predictions were the consideration of incorrect or weak assumptions, the existence of incomplete data, the shortterm view, the use of point estimates instead of interval estimates, and the lack of a multidimensional view of the problem. Accordingly, considering the above-mentioned points and findings from the observed problems in predicting the course of the disease and the resulting mortality during the COVID-19 pandemic, one can hope for the ability to model the prediction of similar diseases in the future.4
COVID-19和统计挑战
自2019年12月新冠肺炎爆发以来,有关该疾病、病例数、死亡人数和康复人数的统计数据和信息激增。在此期间,已经开发了几个统计和数学模型,并用于预测该疾病。这些信息中的大部分都很有帮助,为疾病控制铺平了道路;然而,在某些情况下,发布了不准确或模棱两可的信息,可以简单地分为三大类。第一类与各国政府中心发布的官方统计数据有关,这些数据存在许多错误。尽管这些错误中的一些是无意的,因为疾病的定义是基于明确的聚合酶链式反应(PCR)检测,感染后28天内死亡,或者由于疾病结果与其他疾病相似,1在许多情况下,有关疾病及其后果的统计数据是由政府篡改的,主要是关于少数新冠肺炎患者。2这个问题在学术中心之间引起了很大争议,他们中的许多人试图探讨各国提供的官方统计数据与基于其他来源的其他类型数据之间的差异。关于该疾病阳性病例实际数量的报告减少可能会导致错误的乐观情绪,从而忽视应对新冠肺炎和患者数量的增加。第二类导致统计指数的使用不正确或不完整。在此期间,一些公共媒体偶尔会发布一些信息,其中没有正确和适当地使用统计指数。例如,在不提及基数的情况下提及特定时间段内阳性病例的增加百分比,或在不提及数字的情况下提到死亡百分比,可能会向观众提供不正确的信息。不完全统计和信息的呈现被认为是对统计撒谎的例子之一,这表明学术研究人员需要应对和挑战这一令人不快的现象。3第三类属于新冠肺炎流行病学建模的挑战。随着新冠肺炎的出现,出现了许多预测其发病率和后果的模型。尽管许多预测是真实的,但其中一些预测是不正确或不准确的。这些预测不准确的主要原因是考虑了不正确或薄弱的假设、不完整数据的存在、短期观点、使用点估计而不是区间估计,以及缺乏对问题的多维看法。因此,考虑到上述要点和新冠肺炎大流行期间预测疾病过程和由此导致的死亡率方面观察到的问题的发现,人们可以希望未来有能力对类似疾病的预测进行建模。4
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
6 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信