基于机器学习的COVID-19病例筛查解决方案:社会人口统计学和行为因素分析与COVID-19检测

K. M. Aslam Uddin, Farida Siddiqi Prity, Maisha Tasnim, Sumiya Nur Jannat, Mohammad Omar Faruk, Jahirul Islam, Saydul Akbar Murad, Apurba Adhikary, Anupam Kumar Bairagi
{"title":"基于机器学习的COVID-19病例筛查解决方案:社会人口统计学和行为因素分析与COVID-19检测","authors":"K. M. Aslam Uddin, Farida Siddiqi Prity, Maisha Tasnim, Sumiya Nur Jannat, Mohammad Omar Faruk, Jahirul Islam, Saydul Akbar Murad, Apurba Adhikary, Anupam Kumar Bairagi","doi":"10.1007/s44230-023-00049-9","DOIUrl":null,"url":null,"abstract":"Abstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.","PeriodicalId":303535,"journal":{"name":"Human-Centric Intelligent Systems","volume":"10 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection\",\"authors\":\"K. M. Aslam Uddin, Farida Siddiqi Prity, Maisha Tasnim, Sumiya Nur Jannat, Mohammad Omar Faruk, Jahirul Islam, Saydul Akbar Murad, Apurba Adhikary, Anupam Kumar Bairagi\",\"doi\":\"10.1007/s44230-023-00049-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.\",\"PeriodicalId\":303535,\"journal\":{\"name\":\"Human-Centric Intelligent Systems\",\"volume\":\"10 9\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human-Centric Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s44230-023-00049-9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human-Centric Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s44230-023-00049-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

2019冠状病毒病大流行引发了一场前所未有的全球危机,引发了前所未有的疾病、死亡和经济混乱浪潮。许多社会和行为方面共同助长了COVID-19在全球的猖獗传播。这些因素包括人口稠密地区、遵守佩戴口罩协议、意识水平不足以及各种行为和社会习俗。尽管围绕COVID-19检测进行了广泛的研究,但不幸的是,缺乏研究来仔细评估社会人口和行为因素与COVID-19感染可能性之间复杂的相互作用。因此,我们有条不紊地进行了一项全面的在线横断面调查,从500名受访者的大量样本中收集了数据。精确设计的调查问卷包含各种变量,包括社会人口统计、行为和社会因素。双变量皮尔逊卡方关联检验被巧妙地用于揭示解释变量与COVID-19感染之间的复杂关联。还引入了特征重要性方法来辨别支撑这种传染性困境的最关键特征。采用四种不同的机器学习算法,即决策树、随机森林、CatBoost和XGBoost,在综合分析社会人口和行为因素的基础上,准确预测COVID-19感染。使用一系列评估指标对这些模型的性能进行了严格评估,包括准确性、召回率、精度、ROC-AUC评分和F1评分。皮尔逊卡方检验显示,疫苗接种状况与COVID-19感染之间存在统计学上显著的关联。使用消毒剂和口罩、感染时间以及第一次和第二次接种疫苗的间隔与感染COVID-19病毒的可能性显著相关。在测试的ML模型中,XGBoost分类器的分类准确率最高,达到了令人印象深刻的97.6%。这些发现为个人、社区和政策制定者实施旨在减轻COVID-19大流行影响的有针对性战略提供了宝贵见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
Abstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信