重症COVID-19 ICU入院和死亡率预测:机器学习方法

G. Crowley, S. Kwon, L. Mengling, A. Nolan
{"title":"重症COVID-19 ICU入院和死亡率预测:机器学习方法","authors":"G. Crowley, S. Kwon, L. Mengling, A. Nolan","doi":"10.1164/AJRCCM-CONFERENCE.2021.203.1_MEETINGABSTRACTS.A2564","DOIUrl":null,"url":null,"abstract":"RATIONALE. Coronavirus Disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in Wuhan, China in late 2019 and has expanded into a global pandemic. This mass casualty triage, while requiring tremendous resources, also required early decision-making and ultimately allocation of critical care resources. Primary Objective was to ascertain risk factors of COVID-19 patients requiring ICU admission. Identify biomarkers, clinical risk factors, and comorbid conditions that can predict clinical outcome. METHODS. Setting and Study Design. We performed a natural history/retrospective chart review of patient admissions (n=5,568) at our facility for patients admitted between March 1-May 1, 2020, over 18 years of age, with a positive test less than 15 days before admission. We examined their inflammatory biomarker profile and clinical phenotype collected as part of their standard of care (n=90 variables). Analysis. Random forests used as a variable selector were assessed via a modified hamming distance between variable importance rankings of models with identical hyperparameters. The top 10% of variables (n=9) by mean decrease accuracy (MDA) were then included in a gradient-boosted tree model (xgboost package, R-Project) to build classifiers of ICU admissions and mortality. A random hyperparameter space search determined a final model that maximized 5-fold cross-validated AUCROC. All data was collected in compliance with the Code of Federal Regulations, Title 21, Part 11 and approved by the NYU IRB#20-00473. RESULTS. The classifiers of ICU admission and mortality had AUCROC = 0.93 and 0.90, and classification error = 15.6% and 20.2% based on the Youden's index-optimal probability threshold, respectively. Variables in the final predictive models of ICU admission and mortality are shown by rank (by MDA) in each model, with rank of 1 being the most important, Figure 1. In predicting ICU admission, the three most important variables were triglycerides, procalcitonin, and c-reactive protein;age, initial O2 flow (L/min), and blood O2 saturation were the three most important predictors of mortality. Procalcitonin, blood O2 saturation, lactate, and initial O2 flow (L/min) were predicted both ICU admission and mortality. CONCLUSION. Our models will be included in an online calculator that will be made available and can be used at point of care by providers to assist risk assessment and triage. Our analysis suggests that novel biomarker combinations may be important in assessment of COVID-19 severity. Future work will include validation of these models in other populations.","PeriodicalId":111156,"journal":{"name":"TP49. TP049 COVID: ARDS AND ICU MANAGEMENT","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"ICU Admission and Mortality Prediction in Severe COVID-19: A Machine Learning Approach\",\"authors\":\"G. Crowley, S. Kwon, L. Mengling, A. Nolan\",\"doi\":\"10.1164/AJRCCM-CONFERENCE.2021.203.1_MEETINGABSTRACTS.A2564\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"RATIONALE. Coronavirus Disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in Wuhan, China in late 2019 and has expanded into a global pandemic. This mass casualty triage, while requiring tremendous resources, also required early decision-making and ultimately allocation of critical care resources. Primary Objective was to ascertain risk factors of COVID-19 patients requiring ICU admission. Identify biomarkers, clinical risk factors, and comorbid conditions that can predict clinical outcome. METHODS. Setting and Study Design. We performed a natural history/retrospective chart review of patient admissions (n=5,568) at our facility for patients admitted between March 1-May 1, 2020, over 18 years of age, with a positive test less than 15 days before admission. We examined their inflammatory biomarker profile and clinical phenotype collected as part of their standard of care (n=90 variables). Analysis. Random forests used as a variable selector were assessed via a modified hamming distance between variable importance rankings of models with identical hyperparameters. The top 10% of variables (n=9) by mean decrease accuracy (MDA) were then included in a gradient-boosted tree model (xgboost package, R-Project) to build classifiers of ICU admissions and mortality. A random hyperparameter space search determined a final model that maximized 5-fold cross-validated AUCROC. All data was collected in compliance with the Code of Federal Regulations, Title 21, Part 11 and approved by the NYU IRB#20-00473. RESULTS. The classifiers of ICU admission and mortality had AUCROC = 0.93 and 0.90, and classification error = 15.6% and 20.2% based on the Youden's index-optimal probability threshold, respectively. Variables in the final predictive models of ICU admission and mortality are shown by rank (by MDA) in each model, with rank of 1 being the most important, Figure 1. In predicting ICU admission, the three most important variables were triglycerides, procalcitonin, and c-reactive protein;age, initial O2 flow (L/min), and blood O2 saturation were the three most important predictors of mortality. Procalcitonin, blood O2 saturation, lactate, and initial O2 flow (L/min) were predicted both ICU admission and mortality. CONCLUSION. Our models will be included in an online calculator that will be made available and can be used at point of care by providers to assist risk assessment and triage. Our analysis suggests that novel biomarker combinations may be important in assessment of COVID-19 severity. Future work will include validation of these models in other populations.\",\"PeriodicalId\":111156,\"journal\":{\"name\":\"TP49. TP049 COVID: ARDS AND ICU MANAGEMENT\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"TP49. TP049 COVID: ARDS AND ICU MANAGEMENT\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1164/AJRCCM-CONFERENCE.2021.203.1_MEETINGABSTRACTS.A2564\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"TP49. TP049 COVID: ARDS AND ICU MANAGEMENT","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1164/AJRCCM-CONFERENCE.2021.203.1_MEETINGABSTRACTS.A2564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

基本原理。2019冠状病毒病(COVID-19)是由严重急性呼吸系统综合征冠状病毒2 (SARS-CoV-2)引起的,于2019年底在中国武汉出现,现已扩大为全球大流行。这种大规模伤亡分诊虽然需要大量资源,但也需要早期决策和最终分配重症护理资源。主要目的是了解COVID-19患者需要进入ICU的危险因素。识别可以预测临床结果的生物标志物、临床危险因素和合并症。方法。环境和研究设计。我们对2020年3月1日至5月1日期间入院的患者(n= 5568)进行了自然病史/回顾性图表回顾,这些患者年龄超过18岁,入院前不到15天检测呈阳性。我们检查了他们的炎症生物标志物概况和临床表型,作为他们的标准护理的一部分(n=90个变量)。分析。随机森林用作变量选择器,通过具有相同超参数的模型的变量重要性排名之间的修改汉明距离进行评估。然后将平均降低准确率(MDA)前10%的变量(n=9)纳入梯度增强树模型(xgboost package, R-Project),以构建ICU入院和死亡率分类器。随机超参数空间搜索确定最终模型,最大化5倍交叉验证的AUCROC。所有数据的收集符合联邦法规第21篇第11部分,并经纽约大学IRB#20-00473批准。结果。根据约登指数-最优概率阈值,ICU住院和死亡率的分类器AUCROC分别为0.93和0.90,分类误差分别为15.6%和20.2%。ICU住院率和死亡率最终预测模型中的变量按每个模型中的等级(通过MDA)表示,其中等级为1的变量最重要,见图1。在预测ICU入院时,三个最重要的变量是甘油三酯、降钙素原和c反应蛋白;年龄、初始氧流量(L/min)和血氧饱和度是死亡率的三个最重要的预测因素。预测降钙素原、血氧饱和度、乳酸、初始血氧流量(L/min)及死亡率。结论。我们的模型将包含在一个在线计算器中,该计算器将提供,并可由提供者在护理点使用,以协助风险评估和分诊。我们的分析表明,新的生物标志物组合可能对评估COVID-19严重程度很重要。未来的工作将包括在其他人群中验证这些模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
ICU Admission and Mortality Prediction in Severe COVID-19: A Machine Learning Approach
RATIONALE. Coronavirus Disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in Wuhan, China in late 2019 and has expanded into a global pandemic. This mass casualty triage, while requiring tremendous resources, also required early decision-making and ultimately allocation of critical care resources. Primary Objective was to ascertain risk factors of COVID-19 patients requiring ICU admission. Identify biomarkers, clinical risk factors, and comorbid conditions that can predict clinical outcome. METHODS. Setting and Study Design. We performed a natural history/retrospective chart review of patient admissions (n=5,568) at our facility for patients admitted between March 1-May 1, 2020, over 18 years of age, with a positive test less than 15 days before admission. We examined their inflammatory biomarker profile and clinical phenotype collected as part of their standard of care (n=90 variables). Analysis. Random forests used as a variable selector were assessed via a modified hamming distance between variable importance rankings of models with identical hyperparameters. The top 10% of variables (n=9) by mean decrease accuracy (MDA) were then included in a gradient-boosted tree model (xgboost package, R-Project) to build classifiers of ICU admissions and mortality. A random hyperparameter space search determined a final model that maximized 5-fold cross-validated AUCROC. All data was collected in compliance with the Code of Federal Regulations, Title 21, Part 11 and approved by the NYU IRB#20-00473. RESULTS. The classifiers of ICU admission and mortality had AUCROC = 0.93 and 0.90, and classification error = 15.6% and 20.2% based on the Youden's index-optimal probability threshold, respectively. Variables in the final predictive models of ICU admission and mortality are shown by rank (by MDA) in each model, with rank of 1 being the most important, Figure 1. In predicting ICU admission, the three most important variables were triglycerides, procalcitonin, and c-reactive protein;age, initial O2 flow (L/min), and blood O2 saturation were the three most important predictors of mortality. Procalcitonin, blood O2 saturation, lactate, and initial O2 flow (L/min) were predicted both ICU admission and mortality. CONCLUSION. Our models will be included in an online calculator that will be made available and can be used at point of care by providers to assist risk assessment and triage. Our analysis suggests that novel biomarker combinations may be important in assessment of COVID-19 severity. Future work will include validation of these models in other populations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信