Multicenter Development and Prospective Validation of eCARTv5: A Gradient Boosted Machine Learning Early Warning Score

medRxiv - Intensive Care and Critical Care Medicine Pub Date : 2024-03-19 DOI:10.1101/2024.03.18.24304462

Matthew M. Churpek, Kyle A. Carey, Ashley Snyder, Christopher J. Winslow, Emily R. Gilbert, Nirav S. Shah, Brian W. Patterson, Majid Afshar, Alan Weiss, Devendra N. Amin, Deborah J. Rhodes, Dana P. Edelson

{"title":"Multicenter Development and Prospective Validation of eCARTv5: A Gradient Boosted Machine Learning Early Warning Score","authors":"Matthew M. Churpek, Kyle A. Carey, Ashley Snyder, Christopher J. Winslow, Emily R. Gilbert, Nirav S. Shah, Brian W. Patterson, Majid Afshar, Alan Weiss, Devendra N. Amin, Deborah J. Rhodes, Dana P. Edelson","doi":"10.1101/2024.03.18.24304462","DOIUrl":null,"url":null,"abstract":"Rationale: Early detection of clinical deterioration using early warning scores may improve outcomes. However, most implemented scores were developed using logistic regression, only underwent retrospective internal validation, and were not tested in important patient subgroups.\nObjectives: To develop a gradient boosted machine model (eCARTv5) for identifying clinical deterioration and then validate externally, test prospectively, and evaluate across patient subgroups. Methods: All adult patients hospitalized on the wards in seven hospitals from 2008-2022 were used to develop eCARTv5, with demographics, vital signs, clinician documentation, and laboratory values utilized to predict intensive care unit transfer or death in the next 24 hours. The model was externally validated retrospectively in 21 hospitals from 2009-2023 and prospectively in 10 hospitals from February to May 2023. eCARTv5 was compared to the Modified Early Warning Score (MEWS) and the National Early Warning Score (NEWS) using the area under the receiver operating characteristic curve (AUROC). Measurements and Main Results: The development cohort included 901,491 admissions, the retrospective validation cohort included 1,769,461 admissions, and the prospective validation cohort included 46,330 admissions. In retrospective validation, eCART had the highest AUROC (0.835; 95%CI 0.834, 0.835), followed by NEWS (0.766 (95%CI 0.766, 0.767)), and MEWS (0.704 (95%CI 0.703, 0.704)). eCART′s performance remained high (AUROC ≥ 0.80) across a range of patient demographics, clinical conditions, and during prospective validation.\nConclusions: We developed eCARTv5, which accurately identifies early clinical deterioration in hospitalized ward patients. Our model performed better than the NEWS and MEWS retrospectively, prospectively, and across a range of subgroups.","PeriodicalId":501249,"journal":{"name":"medRxiv - Intensive Care and Critical Care Medicine","volume":"100 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Intensive Care and Critical Care Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.03.18.24304462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Rationale: Early detection of clinical deterioration using early warning scores may improve outcomes. However, most implemented scores were developed using logistic regression, only underwent retrospective internal validation, and were not tested in important patient subgroups. Objectives: To develop a gradient boosted machine model (eCARTv5) for identifying clinical deterioration and then validate externally, test prospectively, and evaluate across patient subgroups. Methods: All adult patients hospitalized on the wards in seven hospitals from 2008-2022 were used to develop eCARTv5, with demographics, vital signs, clinician documentation, and laboratory values utilized to predict intensive care unit transfer or death in the next 24 hours. The model was externally validated retrospectively in 21 hospitals from 2009-2023 and prospectively in 10 hospitals from February to May 2023. eCARTv5 was compared to the Modified Early Warning Score (MEWS) and the National Early Warning Score (NEWS) using the area under the receiver operating characteristic curve (AUROC). Measurements and Main Results: The development cohort included 901,491 admissions, the retrospective validation cohort included 1,769,461 admissions, and the prospective validation cohort included 46,330 admissions. In retrospective validation, eCART had the highest AUROC (0.835; 95%CI 0.834, 0.835), followed by NEWS (0.766 (95%CI 0.766, 0.767)), and MEWS (0.704 (95%CI 0.703, 0.704)). eCART′s performance remained high (AUROC ≥ 0.80) across a range of patient demographics, clinical conditions, and during prospective validation. Conclusions: We developed eCARTv5, which accurately identifies early clinical deterioration in hospitalized ward patients. Our model performed better than the NEWS and MEWS retrospectively, prospectively, and across a range of subgroups.

查看原文本刊更多论文

eCARTv5 的多中心开发和前瞻性验证：梯度提升的机器学习预警分数

理由：使用早期预警评分及早发现临床病情恶化可改善预后。然而，大多数已实施的评分都是使用逻辑回归法开发的，只进行了回顾性内部验证，而且没有在重要的患者亚群中进行测试：目的：开发梯度提升机器模型（eCARTv5），用于识别临床恶化，然后进行外部验证、前瞻性测试，并在患者亚群中进行评估。方法2008-2022 年间在七家医院病房住院的所有成人患者都被用来开发 eCARTv5，利用人口统计学、生命体征、临床医生记录和实验室值来预测未来 24 小时内转入重症监护病房或死亡的情况。eCARTv5 采用接收者操作特征曲线下面积 (AUROC) 与改良早期预警评分 (MEWS) 和国家早期预警评分 (NEWS) 进行比较。测量和主要结果：开发队列包括 901,491 例入院患者，回顾性验证队列包括 1,769,461 例入院患者，前瞻性验证队列包括 46,330 例入院患者。在回顾性验证中，eCART的AUROC最高（0.835；95%CI 0.834，0.835），其次是NEWS（0.766（95%CI 0.766，0.767））和MEWS（0.704（95%CI 0.703，0.704））。在一系列患者人口统计学、临床条件和前瞻性验证中，eCART的性能仍然很高（AUROC≥0.80）：我们开发的 eCARTv5 能准确识别住院病房患者的早期临床恶化。我们的模型在回顾性、前瞻性和各种亚组中的表现均优于 NEWS 和 MEWS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

medRxiv - Intensive Care and Critical Care Medicine

自引率

0.00%

发文量