用于模拟巴西COVID-19病例总数的Morgan-Mercer-Flodin (MMF)模型残差的离群值和正态性检验

Bioremediation Science and Technology Research Pub Date : 2021-07-31 DOI:10.54987/bstr.v9i1.592

G. Uba, Nuhu Danlahi Zandam, A. Mansur, M. Shukor

{"title":"用于模拟巴西COVID-19病例总数的Morgan-Mercer-Flodin (MMF)模型残差的离群值和正态性检验","authors":"G. Uba, Nuhu Danlahi Zandam, A. Mansur, M. Shukor","doi":"10.54987/bstr.v9i1.592","DOIUrl":null,"url":null,"abstract":"Traditionally, testing for outliers is performed by first creating a null hypothesis, H0, indicating that the suspected results do not differ significantly from those of other members of the data set, and then rejecting it if the likelihood of getting the experimental results is extremely low (e.g., p=0.05). Similarly, if H0 can be rejected, the questionable findings may be discarded as outliers as well. If H0 is retained in the data set, it is important to keep the dubious findings in the data set. In general, in nonlinear regression, the residuals of the curve must be normally distributed before any test for the existence of outliers is performed. This is often accomplished through the use of normalcy tests such as the Kolmogorov-Smirnov, Wilks-Shapiro, D'Agostino-Pearson, and Grubb's tests, the latter of which checks for the presence of an outlier and is the subject of this study. Normality tests for residues used in general nonlinear regression revealed that the usage of the Morgan-Mercer-Flodin (MMF) Model used for Modelling the Total Number of COVID-19 Cases for Brazil was adequate due to lack of an outlier. The critical value of Z from statistical table for Grubbsâ€™ test for a single outlier using mean and SD was 0.114 (n=50). The Grubbs (Alpha = 0.05) g value was 3.597. Individual Z value indicates that the residual with a value of -3 (row 3) was far from the rest and is deemed a significant outlier (p < 0.05). This outlier was removed, and subsequent Grubbâ€™s test show the absence of other outliers. As the Grubbsâ€™ test require for the normality of the residuals, several normality tests (Kolmogorov-Smirnov, Wilks-Shapiro, Anderson-Darling and the D'Agostino-Pearson omnibus K2 test) were carried out and the results were found to conform to normality. In addition, a visual inspection of the modelâ€™s normal probability or Q-Q plot shows a nearly straight and appeared to exhibit no underlying pattern. The resulting histogram overlaid with the ensuing normal distribution curve also reveals that the residuals were truly random and that the model used was adequately fitted.","PeriodicalId":436607,"journal":{"name":"Bioremediation Science and Technology Research","volume":"133 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Outlier and Normality Testing of the Residuals for the Morgan-Mercer-Flodin (MMF) Model Used for Modelling the Total Number of COVID-19 Cases for Brazil\",\"authors\":\"G. Uba, Nuhu Danlahi Zandam, A. Mansur, M. Shukor\",\"doi\":\"10.54987/bstr.v9i1.592\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditionally, testing for outliers is performed by first creating a null hypothesis, H0, indicating that the suspected results do not differ significantly from those of other members of the data set, and then rejecting it if the likelihood of getting the experimental results is extremely low (e.g., p=0.05). Similarly, if H0 can be rejected, the questionable findings may be discarded as outliers as well. If H0 is retained in the data set, it is important to keep the dubious findings in the data set. In general, in nonlinear regression, the residuals of the curve must be normally distributed before any test for the existence of outliers is performed. This is often accomplished through the use of normalcy tests such as the Kolmogorov-Smirnov, Wilks-Shapiro, D'Agostino-Pearson, and Grubb's tests, the latter of which checks for the presence of an outlier and is the subject of this study. Normality tests for residues used in general nonlinear regression revealed that the usage of the Morgan-Mercer-Flodin (MMF) Model used for Modelling the Total Number of COVID-19 Cases for Brazil was adequate due to lack of an outlier. The critical value of Z from statistical table for Grubbsâ€™ test for a single outlier using mean and SD was 0.114 (n=50). The Grubbs (Alpha = 0.05) g value was 3.597. Individual Z value indicates that the residual with a value of -3 (row 3) was far from the rest and is deemed a significant outlier (p < 0.05). This outlier was removed, and subsequent Grubbâ€™s test show the absence of other outliers. As the Grubbsâ€™ test require for the normality of the residuals, several normality tests (Kolmogorov-Smirnov, Wilks-Shapiro, Anderson-Darling and the D'Agostino-Pearson omnibus K2 test) were carried out and the results were found to conform to normality. In addition, a visual inspection of the modelâ€™s normal probability or Q-Q plot shows a nearly straight and appeared to exhibit no underlying pattern. The resulting histogram overlaid with the ensuing normal distribution curve also reveals that the residuals were truly random and that the model used was adequately fitted.\",\"PeriodicalId\":436607,\"journal\":{\"name\":\"Bioremediation Science and Technology Research\",\"volume\":\"133 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioremediation Science and Technology Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54987/bstr.v9i1.592\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioremediation Science and Technology Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54987/bstr.v9i1.592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

传统上，对异常值的检验是通过首先创建一个零假设H0来执行的，这表明怀疑的结果与数据集中其他成员的结果没有显著差异，然后如果获得实验结果的可能性极低(例如，p=0.05)，则拒绝它。同样，如果H0可以被拒绝，那么可疑的发现也可以作为异常值被丢弃。如果H0保留在数据集中，那么将可疑的发现保留在数据集中是很重要的。一般来说，在非线性回归中，在检验异常值是否存在之前，曲线的残差必须是正态分布的。这通常是通过使用正常测试，如Kolmogorov-Smirnov, Wilks-Shapiro, D'Agostino-Pearson和Grubb的测试来完成的，后者检查异常值的存在，是本研究的主题。一般非线性回归中使用的残差正态性检验表明，由于缺乏离群值，使用摩根-美世-弗洛丁(MMF)模型对巴西COVID-19病例总数进行建模是足够的。使用平均值和SD对单个异常值进行grubbs€™检验的统计表Z的临界值为0.114 (n=50)。Grubbs (Alpha = 0.05) g值为3.597。个别Z值表示残差值为-3(第3行)与其他残差相差甚远，认为是显著异常值(p < 0.05)。该异常值被删除，随后的grubb测试显示没有其他异常值。由于grubbs€™检验要求残差的正态性，因此进行了几种正态性检验(Kolmogorov-Smirnov, Wilks-Shapiro, Anderson-Darling和D'Agostino-Pearson综合K2检验)，发现结果符合正态性。此外，对模型的正态概率或Q-Q图的视觉检查显示几乎是直线的，似乎没有显示出潜在的模式。与随后的正态分布曲线叠加的结果直方图也表明，残差是真正随机的，并且所使用的模型是充分拟合的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Outlier and Normality Testing of the Residuals for the Morgan-Mercer-Flodin (MMF) Model Used for Modelling the Total Number of COVID-19 Cases for Brazil

Traditionally, testing for outliers is performed by first creating a null hypothesis, H0, indicating that the suspected results do not differ significantly from those of other members of the data set, and then rejecting it if the likelihood of getting the experimental results is extremely low (e.g., p=0.05). Similarly, if H0 can be rejected, the questionable findings may be discarded as outliers as well. If H0 is retained in the data set, it is important to keep the dubious findings in the data set. In general, in nonlinear regression, the residuals of the curve must be normally distributed before any test for the existence of outliers is performed. This is often accomplished through the use of normalcy tests such as the Kolmogorov-Smirnov, Wilks-Shapiro, D'Agostino-Pearson, and Grubb's tests, the latter of which checks for the presence of an outlier and is the subject of this study. Normality tests for residues used in general nonlinear regression revealed that the usage of the Morgan-Mercer-Flodin (MMF) Model used for Modelling the Total Number of COVID-19 Cases for Brazil was adequate due to lack of an outlier. The critical value of Z from statistical table for Grubbsâ€™ test for a single outlier using mean and SD was 0.114 (n=50). The Grubbs (Alpha = 0.05) g value was 3.597. Individual Z value indicates that the residual with a value of -3 (row 3) was far from the rest and is deemed a significant outlier (p < 0.05). This outlier was removed, and subsequent Grubbâ€™s test show the absence of other outliers. As the Grubbsâ€™ test require for the normality of the residuals, several normality tests (Kolmogorov-Smirnov, Wilks-Shapiro, Anderson-Darling and the D'Agostino-Pearson omnibus K2 test) were carried out and the results were found to conform to normality. In addition, a visual inspection of the modelâ€™s normal probability or Q-Q plot shows a nearly straight and appeared to exhibit no underlying pattern. The resulting histogram overlaid with the ensuing normal distribution curve also reveals that the residuals were truly random and that the model used was adequately fitted.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Bioremediation Science and Technology Research

自引率

0.00%

发文量