{"title":"COVID-19和统计挑战","authors":"S. Kheiri","doi":"10.34172/ijer.2022.01","DOIUrl":null,"url":null,"abstract":"Since the outbreak of COVID-19 in December 2019, there has been an explosion of statistics and information about the disease, the number of cases, the number of deaths, and the number of recoveries. During this period several statistical and mathematical models have been developed and used to predict the disease. Much of this information has been helpful and paved the way for disease control; however, inaccurate or ambiguous information has been published in some cases, which can briefly be divided into three main categories. The first category is related to the publication of official statistics by governmental centers in countries, which has faced many errors. Although some of these errors are unintentional due to the definition of the disease based on definitive polymerase chain reaction (PCR) testing, death within 28 days of infection, or due to the similarity of the disease outcomes with other diseases,1 in many cases, the statistics regarding the disease and its consequences have been presented by governments with a manipulation, mainly on the small number of patients with COVID-19.2 This issue caused a great deal of controversy among academic centers, and many of them attempted to explore the differences between the official statistics provided by countries and other types of data based on other sources. Fewer reports on the actual number of positive cases of the disease may lead to false optimism and thus negligence in dealing with COVID-19 and an increase in the number of patients. The second category contributes to incorrect or incomplete use of statistical indexes. During this period, some information has occasionally been published by some public media in which statistical indexes have not been used correctly and appropriately. For example, mentioning the percentage of increase in positive cases over a specified period of time without mentioning the base number or the percentage of deaths without mentioning the number, can provide incorrect information to the audience. The presentation of incomplete statistics and information is considered among the examples of lying with statistics, indicating that academic researchers need to tackle and challenge this disagreeable phenomenon.3 The third category belongs to the challenges of epidemiological modeling in COVID-19. With the advent of COVID-19, many models emerged to predict its incidence and consequences. Although many of the predictions were true, some of them were incorrect or inaccurate. The main reasons for the inaccuracy of these predictions were the consideration of incorrect or weak assumptions, the existence of incomplete data, the shortterm view, the use of point estimates instead of interval estimates, and the lack of a multidimensional view of the problem. Accordingly, considering the above-mentioned points and findings from the observed problems in predicting the course of the disease and the resulting mortality during the COVID-19 pandemic, one can hope for the ability to model the prediction of similar diseases in the future.4","PeriodicalId":73448,"journal":{"name":"International journal of epidemiologic research","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"COVID-19 and Statistical Challenges\",\"authors\":\"S. Kheiri\",\"doi\":\"10.34172/ijer.2022.01\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since the outbreak of COVID-19 in December 2019, there has been an explosion of statistics and information about the disease, the number of cases, the number of deaths, and the number of recoveries. During this period several statistical and mathematical models have been developed and used to predict the disease. Much of this information has been helpful and paved the way for disease control; however, inaccurate or ambiguous information has been published in some cases, which can briefly be divided into three main categories. The first category is related to the publication of official statistics by governmental centers in countries, which has faced many errors. Although some of these errors are unintentional due to the definition of the disease based on definitive polymerase chain reaction (PCR) testing, death within 28 days of infection, or due to the similarity of the disease outcomes with other diseases,1 in many cases, the statistics regarding the disease and its consequences have been presented by governments with a manipulation, mainly on the small number of patients with COVID-19.2 This issue caused a great deal of controversy among academic centers, and many of them attempted to explore the differences between the official statistics provided by countries and other types of data based on other sources. Fewer reports on the actual number of positive cases of the disease may lead to false optimism and thus negligence in dealing with COVID-19 and an increase in the number of patients. The second category contributes to incorrect or incomplete use of statistical indexes. During this period, some information has occasionally been published by some public media in which statistical indexes have not been used correctly and appropriately. For example, mentioning the percentage of increase in positive cases over a specified period of time without mentioning the base number or the percentage of deaths without mentioning the number, can provide incorrect information to the audience. The presentation of incomplete statistics and information is considered among the examples of lying with statistics, indicating that academic researchers need to tackle and challenge this disagreeable phenomenon.3 The third category belongs to the challenges of epidemiological modeling in COVID-19. With the advent of COVID-19, many models emerged to predict its incidence and consequences. Although many of the predictions were true, some of them were incorrect or inaccurate. The main reasons for the inaccuracy of these predictions were the consideration of incorrect or weak assumptions, the existence of incomplete data, the shortterm view, the use of point estimates instead of interval estimates, and the lack of a multidimensional view of the problem. Accordingly, considering the above-mentioned points and findings from the observed problems in predicting the course of the disease and the resulting mortality during the COVID-19 pandemic, one can hope for the ability to model the prediction of similar diseases in the future.4\",\"PeriodicalId\":73448,\"journal\":{\"name\":\"International journal of epidemiologic research\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of epidemiologic research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34172/ijer.2022.01\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of epidemiologic research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34172/ijer.2022.01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Since the outbreak of COVID-19 in December 2019, there has been an explosion of statistics and information about the disease, the number of cases, the number of deaths, and the number of recoveries. During this period several statistical and mathematical models have been developed and used to predict the disease. Much of this information has been helpful and paved the way for disease control; however, inaccurate or ambiguous information has been published in some cases, which can briefly be divided into three main categories. The first category is related to the publication of official statistics by governmental centers in countries, which has faced many errors. Although some of these errors are unintentional due to the definition of the disease based on definitive polymerase chain reaction (PCR) testing, death within 28 days of infection, or due to the similarity of the disease outcomes with other diseases,1 in many cases, the statistics regarding the disease and its consequences have been presented by governments with a manipulation, mainly on the small number of patients with COVID-19.2 This issue caused a great deal of controversy among academic centers, and many of them attempted to explore the differences between the official statistics provided by countries and other types of data based on other sources. Fewer reports on the actual number of positive cases of the disease may lead to false optimism and thus negligence in dealing with COVID-19 and an increase in the number of patients. The second category contributes to incorrect or incomplete use of statistical indexes. During this period, some information has occasionally been published by some public media in which statistical indexes have not been used correctly and appropriately. For example, mentioning the percentage of increase in positive cases over a specified period of time without mentioning the base number or the percentage of deaths without mentioning the number, can provide incorrect information to the audience. The presentation of incomplete statistics and information is considered among the examples of lying with statistics, indicating that academic researchers need to tackle and challenge this disagreeable phenomenon.3 The third category belongs to the challenges of epidemiological modeling in COVID-19. With the advent of COVID-19, many models emerged to predict its incidence and consequences. Although many of the predictions were true, some of them were incorrect or inaccurate. The main reasons for the inaccuracy of these predictions were the consideration of incorrect or weak assumptions, the existence of incomplete data, the shortterm view, the use of point estimates instead of interval estimates, and the lack of a multidimensional view of the problem. Accordingly, considering the above-mentioned points and findings from the observed problems in predicting the course of the disease and the resulting mortality during the COVID-19 pandemic, one can hope for the ability to model the prediction of similar diseases in the future.4