Revealing Behavior Patterns of SARS-CoV-2 using Clustering Analysis and XGBoost Error Forecasting Models

Q4 Medicine
Nasrin Talkhi, Narges Akhavan Fatemi, M. Jabbari Nooghabi
{"title":"Revealing Behavior Patterns of SARS-CoV-2 using Clustering Analysis and XGBoost Error Forecasting Models","authors":"Nasrin Talkhi, Narges Akhavan Fatemi, M. Jabbari Nooghabi","doi":"10.30699/ijmm.16.3.221","DOIUrl":null,"url":null,"abstract":"Background and Aim: COVID-19 is a highly contagious infectious disease, and it has affected people's daily life and has raised great concern for governments and public health officials. Forecasting its future behavior may be useful for allocating medical resources and defining effective strategies for disease control, etc. Materials and Methods: The collected data was the cumulative and the absolute number of confirmed, death, and recovered cases of COVID-19 from February 20 to July 03, 2021. We used hierarchical cluster analysis. To forecast the future behavior of COVID-19, the Auto-Regressive Integrated Moving Average (ARIMA), Exponential Smoothing (ETS), Automatic Forecasting Procedure (Prophet), Naive, Seasonal Naive (s-Naive), boosted ARIMA, and boosted Prophet models were used. Results: The results of clustering showed a similar behavior of coronavirus in Iran and other countries such as France, Russia, Turkey, United Kingdom (UK), Argentina, Colombia, Italy, Spain, Germany, Poland, Mexico, and Indonesia. It also revealed similar patterns of SARS-CoV-2 for the same countries in six groups. Results showed that XGBoost models' family had higher accuracy than other models. Conclusion: In Iran, COVID-19 showed similar behavior patterns compared to the studied developed countries. The family of XGBoost models showed practical results and high precision in forecasting behavior patterns of the virus. Concerning the rapid spread of the virus worldwide, these models can be used to forecast the behavior patterns of SARS-CoV-2. Preventing the spread of the coronavirus, controlling the disease, and breaking down its chain necessitates community assistance, and in this mission, the role of statisticians cannot be neglected.","PeriodicalId":14580,"journal":{"name":"Iranian Journal of Medical Microbiology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iranian Journal of Medical Microbiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/ijmm.16.3.221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Background and Aim: COVID-19 is a highly contagious infectious disease, and it has affected people's daily life and has raised great concern for governments and public health officials. Forecasting its future behavior may be useful for allocating medical resources and defining effective strategies for disease control, etc. Materials and Methods: The collected data was the cumulative and the absolute number of confirmed, death, and recovered cases of COVID-19 from February 20 to July 03, 2021. We used hierarchical cluster analysis. To forecast the future behavior of COVID-19, the Auto-Regressive Integrated Moving Average (ARIMA), Exponential Smoothing (ETS), Automatic Forecasting Procedure (Prophet), Naive, Seasonal Naive (s-Naive), boosted ARIMA, and boosted Prophet models were used. Results: The results of clustering showed a similar behavior of coronavirus in Iran and other countries such as France, Russia, Turkey, United Kingdom (UK), Argentina, Colombia, Italy, Spain, Germany, Poland, Mexico, and Indonesia. It also revealed similar patterns of SARS-CoV-2 for the same countries in six groups. Results showed that XGBoost models' family had higher accuracy than other models. Conclusion: In Iran, COVID-19 showed similar behavior patterns compared to the studied developed countries. The family of XGBoost models showed practical results and high precision in forecasting behavior patterns of the virus. Concerning the rapid spread of the virus worldwide, these models can be used to forecast the behavior patterns of SARS-CoV-2. Preventing the spread of the coronavirus, controlling the disease, and breaking down its chain necessitates community assistance, and in this mission, the role of statisticians cannot be neglected.
利用聚类分析和XGBoost误差预测模型揭示严重急性呼吸系统综合征冠状病毒2型的行为模式
背景与目的:新冠肺炎是一种传染性极强的传染病,它影响着人们的日常生活,引起了政府和公共卫生官员的高度关注。预测其未来行为可能有助于分配医疗资源和制定有效的疾病控制策略等。材料和方法:收集的数据是2021年2月20日至7月3日新冠肺炎确诊、死亡和康复病例的累计和绝对数。我们使用了层次聚类分析。为了预测新冠肺炎的未来行为,使用了自回归综合移动平均(ARIMA)、指数平滑(ETS)、自动预测程序(Prophet)、天真、季节天真(s-Naive)、增强ARIMA和增强Prophet模型。结果:聚类结果显示,伊朗和法国、俄罗斯、土耳其、英国、阿根廷、哥伦比亚、意大利、西班牙、德国、波兰、墨西哥和印度尼西亚等其他国家的冠状病毒行为相似。它还揭示了六组相同国家的严重急性呼吸系统综合征冠状病毒2型的相似模式。结果表明,XGBoost模型族的精度高于其他模型。结论:在伊朗,与所研究的发达国家相比,新冠肺炎表现出相似的行为模式。XGBoost模型家族在预测病毒行为模式方面显示出实际效果和高精度。关于病毒在全球范围内的快速传播,这些模型可用于预测严重急性呼吸系统综合征冠状病毒2型的行为模式。预防冠状病毒的传播、控制疾病并打破其传播链需要社区援助,在这项任务中,统计学家的作用不容忽视。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Iranian Journal of Medical Microbiology
Iranian Journal of Medical Microbiology Medicine-Infectious Diseases
CiteScore
1.60
自引率
0.00%
发文量
70
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信