Predicting Clinical Outcomes in COVID-19 and Pneumonia Patients: A Machine Learning Approach.

IF 3.8 3区 医学 Q2 VIROLOGY
Viruses-Basel Pub Date : 2024-10-17 DOI:10.3390/v16101624
Kaida Cai, Zhengyan Wang, Xiaofang Yang, Wenzhi Fu, Xin Zhao
{"title":"Predicting Clinical Outcomes in COVID-19 and Pneumonia Patients: A Machine Learning Approach.","authors":"Kaida Cai, Zhengyan Wang, Xiaofang Yang, Wenzhi Fu, Xin Zhao","doi":"10.3390/v16101624","DOIUrl":null,"url":null,"abstract":"<p><p>In the clinical diagnosis of pneumonia, particularly during the COVID-19 pandemic, individuals who progress to a critical stage requiring mechanical ventilation are classified as mechanically ventilated critically ill patients. Accurately predicting the discharge outcomes for this specific cohort, especially those with COVID-19, is of paramount clinical importance. Missing data, a common issue in medical research, can significantly impact the validity of analyses. In this work, we address this challenge by employing two missing data imputation techniques: multiple imputation and missForest, to enhance data completeness. Additionally, we utilize the smoothly clipped absolute deviation (SCAD) penalized logistic regression method to select significant features. Our real data analysis compares the predictive performances of extreme learning machines, random forests, support vector machines, and XGBoost using 10-fold cross-validation. The results consistently show that XGBoost outperforms the other methods in predicting discharge outcomes, making it a reliable tool for clinical decision-making in the treatment of severe pneumonia, including COVID-19 cases. Within this context, the random forest imputation method generally enhances performance, underscoring its effectiveness in managing missing data compared to multiple imputation.</p>","PeriodicalId":49328,"journal":{"name":"Viruses-Basel","volume":"16 10","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512216/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Viruses-Basel","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/v16101624","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"VIROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

In the clinical diagnosis of pneumonia, particularly during the COVID-19 pandemic, individuals who progress to a critical stage requiring mechanical ventilation are classified as mechanically ventilated critically ill patients. Accurately predicting the discharge outcomes for this specific cohort, especially those with COVID-19, is of paramount clinical importance. Missing data, a common issue in medical research, can significantly impact the validity of analyses. In this work, we address this challenge by employing two missing data imputation techniques: multiple imputation and missForest, to enhance data completeness. Additionally, we utilize the smoothly clipped absolute deviation (SCAD) penalized logistic regression method to select significant features. Our real data analysis compares the predictive performances of extreme learning machines, random forests, support vector machines, and XGBoost using 10-fold cross-validation. The results consistently show that XGBoost outperforms the other methods in predicting discharge outcomes, making it a reliable tool for clinical decision-making in the treatment of severe pneumonia, including COVID-19 cases. Within this context, the random forest imputation method generally enhances performance, underscoring its effectiveness in managing missing data compared to multiple imputation.

预测 COVID-19 和肺炎患者的临床结果:机器学习方法
在肺炎的临床诊断中,尤其是在 COVID-19 大流行期间,进展到需要机械通气的危重阶段的患者被归类为机械通气重症患者。准确预测这一特殊群体(尤其是 COVID-19 患者)的出院预后具有极其重要的临床意义。缺失数据是医学研究中常见的问题,会严重影响分析的有效性。在这项工作中,我们采用了两种缺失数据估算技术:多重估算和 missForest,以提高数据的完整性,从而应对这一挑战。此外,我们还利用平滑剪切绝对偏差(SCAD)惩罚逻辑回归方法来选择重要特征。我们的真实数据分析使用 10 倍交叉验证比较了极限学习机、随机森林、支持向量机和 XGBoost 的预测性能。结果一致表明,XGBoost 在预测出院结果方面优于其他方法,是治疗重症肺炎(包括 COVID-19 病例)的可靠临床决策工具。在此背景下,随机森林归因法普遍提高了性能,与多重归因法相比,凸显了其在管理缺失数据方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Viruses-Basel
Viruses-Basel VIROLOGY-
CiteScore
7.30
自引率
12.80%
发文量
2445
审稿时长
1 months
期刊介绍: Viruses (ISSN 1999-4915) is an open access journal which provides an advanced forum for studies of viruses. It publishes reviews, regular research papers, communications, conference reports and short notes. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced. We also encourage the publication of timely reviews and commentaries on topics of interest to the virology community and feature highlights from the virology literature in the ''News and Views'' section. Electronic files or software regarding the full details of the calculation and experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信