基于队列数据的20种机器学习分类算法的COVID-19住院预测建模

Zeynab Salehnasab, A. Mousavizadeh, Ghasem Ghalamfarsa, A. Garavand, C. Salehnasab
{"title":"基于队列数据的20种机器学习分类算法的COVID-19住院预测建模","authors":"Zeynab Salehnasab, A. Mousavizadeh, Ghasem Ghalamfarsa, A. Garavand, C. Salehnasab","doi":"10.30699/fhi.v12i0.473","DOIUrl":null,"url":null,"abstract":"Introduction: The global COVID-19 pandemic has led to a health crisis, emphasizing the need to identify high-risk patients for effective resource allocation and prioritized hospitalization. Previous studies have been limited in their use of algorithms and variables, while this research expands to include lifestyle factors and optimizes hyperparameters for twenty machine learning algorithms, enhancing prediction accuracy and identifying key predictors.Material and Methods: In this cross-sectional study, we analyzed data from 207 COVID-19 patients. The Boruta algorithm was used to select the best features for twenty classification algorithms, and RandomizedSearchCV was utilized to optimize hyperparameters. The models were evaluated using performance metrics such as accuracy, f-measure, and area under the curve (AUC).Results: The study identified eight key predictors of COVID-19 hospitalization, which include gamma-glutamyl transpeptidase, alkaline phosphatase, diagnosis by CT scan, mean platelet volume, mean corpuscular volume, fasting blood sugar, red blood cell count, and mean corpuscular hemoglobin concentration. By optimizing the hyperparameters of twenty machine learning algorithms, the accuracy and AUC were improved. With an outstanding AUC of 81.25, the XGBClassifier model exhibited superior performance.Conclusion: The findings of this study can assist clinicians in allocating resources effectively and improving patient care. Additionally, this approach can aid healthcare researchers in leveraging artificial intelligence to manage diseases.","PeriodicalId":154611,"journal":{"name":"Frontiers in Health Informatics","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predictive Modeling of COVID-19 Hospitalization Using Twenty Machine Learning Classification Algorithms on Cohort Data\",\"authors\":\"Zeynab Salehnasab, A. Mousavizadeh, Ghasem Ghalamfarsa, A. Garavand, C. Salehnasab\",\"doi\":\"10.30699/fhi.v12i0.473\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: The global COVID-19 pandemic has led to a health crisis, emphasizing the need to identify high-risk patients for effective resource allocation and prioritized hospitalization. Previous studies have been limited in their use of algorithms and variables, while this research expands to include lifestyle factors and optimizes hyperparameters for twenty machine learning algorithms, enhancing prediction accuracy and identifying key predictors.Material and Methods: In this cross-sectional study, we analyzed data from 207 COVID-19 patients. The Boruta algorithm was used to select the best features for twenty classification algorithms, and RandomizedSearchCV was utilized to optimize hyperparameters. The models were evaluated using performance metrics such as accuracy, f-measure, and area under the curve (AUC).Results: The study identified eight key predictors of COVID-19 hospitalization, which include gamma-glutamyl transpeptidase, alkaline phosphatase, diagnosis by CT scan, mean platelet volume, mean corpuscular volume, fasting blood sugar, red blood cell count, and mean corpuscular hemoglobin concentration. By optimizing the hyperparameters of twenty machine learning algorithms, the accuracy and AUC were improved. With an outstanding AUC of 81.25, the XGBClassifier model exhibited superior performance.Conclusion: The findings of this study can assist clinicians in allocating resources effectively and improving patient care. Additionally, this approach can aid healthcare researchers in leveraging artificial intelligence to manage diseases.\",\"PeriodicalId\":154611,\"journal\":{\"name\":\"Frontiers in Health Informatics\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30699/fhi.v12i0.473\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/fhi.v12i0.473","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

导语:2019冠状病毒病(COVID-19)全球大流行导致了一场健康危机,强调了识别高风险患者以进行有效资源分配和优先住院治疗的必要性。以往的研究在算法和变量的使用上受到限制,而本研究扩展到包括生活方式因素,并优化了20种机器学习算法的超参数,提高了预测精度并识别了关键预测因子。材料和方法:在这项横断面研究中,我们分析了207例COVID-19患者的数据。使用Boruta算法为20种分类算法选择最佳特征,并使用RandomizedSearchCV对超参数进行优化。使用精度、f-measure和曲线下面积(AUC)等性能指标对模型进行评估。结果:本研究确定了γ -谷氨酰转肽酶、碱性磷酸酶、CT扫描诊断、平均血小板体积、平均红细胞体积、空腹血糖、红细胞计数、平均红细胞血红蛋白浓度等8个预测COVID-19住院的关键因素。通过对20种机器学习算法的超参数进行优化,提高了精度和AUC。XGBClassifier模型的AUC为81.25,表现出优异的性能。结论:本研究结果有助于临床医生有效地分配资源,改善患者护理。此外,这种方法可以帮助医疗保健研究人员利用人工智能来管理疾病。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive Modeling of COVID-19 Hospitalization Using Twenty Machine Learning Classification Algorithms on Cohort Data
Introduction: The global COVID-19 pandemic has led to a health crisis, emphasizing the need to identify high-risk patients for effective resource allocation and prioritized hospitalization. Previous studies have been limited in their use of algorithms and variables, while this research expands to include lifestyle factors and optimizes hyperparameters for twenty machine learning algorithms, enhancing prediction accuracy and identifying key predictors.Material and Methods: In this cross-sectional study, we analyzed data from 207 COVID-19 patients. The Boruta algorithm was used to select the best features for twenty classification algorithms, and RandomizedSearchCV was utilized to optimize hyperparameters. The models were evaluated using performance metrics such as accuracy, f-measure, and area under the curve (AUC).Results: The study identified eight key predictors of COVID-19 hospitalization, which include gamma-glutamyl transpeptidase, alkaline phosphatase, diagnosis by CT scan, mean platelet volume, mean corpuscular volume, fasting blood sugar, red blood cell count, and mean corpuscular hemoglobin concentration. By optimizing the hyperparameters of twenty machine learning algorithms, the accuracy and AUC were improved. With an outstanding AUC of 81.25, the XGBClassifier model exhibited superior performance.Conclusion: The findings of this study can assist clinicians in allocating resources effectively and improving patient care. Additionally, this approach can aid healthcare researchers in leveraging artificial intelligence to manage diseases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信