On-scene machine learning prediction model for massive transfusion in trauma and its association with in-hospital mortality.

IF 4.5 3区 医学 Q1 SURGERY
BJS Open Pub Date : 2025-12-29 DOI:10.1093/bjsopen/zraf167
Byungchul Yu, Jaehyeong Cho, Hyunjee Kim, Seung Ha Hwang, Jiyoung Hwang, Soeun Kim, Jiyeon Oh, Sooji Lee, Do Wan Kim, Junepill Seok, Kyounghwan Kim, Jinseok Lee, Dong Keon Yon, Wu Seong Kang
{"title":"On-scene machine learning prediction model for massive transfusion in trauma and its association with in-hospital mortality.","authors":"Byungchul Yu, Jaehyeong Cho, Hyunjee Kim, Seung Ha Hwang, Jiyoung Hwang, Soeun Kim, Jiyeon Oh, Sooji Lee, Do Wan Kim, Junepill Seok, Kyounghwan Kim, Jinseok Lee, Dong Keon Yon, Wu Seong Kang","doi":"10.1093/bjsopen/zraf167","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Early triage for massive transfusion (MT) is essential in trauma care but most existing scoring systems rely on in-hospital data. To address this limitation, a machine learning model using only prehospital variables to predict MT and stratify mortality risk was developed and externally validated.</p><p><strong>Methods: </strong>Data from the Korean Trauma Data Bank from 19 trauma centres (2017-22) was used for model development and internal validation, with 2023 data for patients from four additional centres used for external validation. Trauma cases were identified using S or T codes from the Korean Classification of Diseases, 7th edition. MT was defined as ≥ 5 units packed red blood cells within 4 hours or ≥ 10 units within 24 hours. Machine learning models were trained using 21 prehospital variables, with a final ensemble model constructed from the top-performing algorithms. Model interpretability was assessed using Shapley additive explanations (SHAP), and the association between predicted probability tertiles (T1-T3) and in-hospital mortality was evaluated using logistic regression.</p><p><strong>Results: </strong>In all, 227 567 patients were included in the development cohort and internal validation cohort, with 8867 patients in the external validation cohort. The soft-voting ensemble model, combining random forest and AdaBoost, showed high predictive performance, with area under the receiver operating characteristic curve values of 0.837 (internal validation) and 0.837 (external validation). SHAP analysis identified accident type as the most influential predictor, followed by consciousness level, and circulatory assistance. Higher model probability was associated with increased in-hospital mortality (adjusted odds ratios (95% confidence intervals) 2.34 (2.16 to 2.55), 2.70 (2.49 to 2.92), and 3.53 (3.25 to 3.83) for T1, T2, and T3, respectively).</p><p><strong>Conclusion: </strong>A prehospital ensemble learning model to predict MT was developed and validated, and its predictions were significantly associated with in-hospital mortality. However, this study is limited by the inclusion of a single ethnicity, and future research needs to integrate data from multiple populations to enhance generalizability.</p>","PeriodicalId":9028,"journal":{"name":"BJS Open","volume":"10 1","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12914466/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BJS Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/bjsopen/zraf167","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Early triage for massive transfusion (MT) is essential in trauma care but most existing scoring systems rely on in-hospital data. To address this limitation, a machine learning model using only prehospital variables to predict MT and stratify mortality risk was developed and externally validated.

Methods: Data from the Korean Trauma Data Bank from 19 trauma centres (2017-22) was used for model development and internal validation, with 2023 data for patients from four additional centres used for external validation. Trauma cases were identified using S or T codes from the Korean Classification of Diseases, 7th edition. MT was defined as ≥ 5 units packed red blood cells within 4 hours or ≥ 10 units within 24 hours. Machine learning models were trained using 21 prehospital variables, with a final ensemble model constructed from the top-performing algorithms. Model interpretability was assessed using Shapley additive explanations (SHAP), and the association between predicted probability tertiles (T1-T3) and in-hospital mortality was evaluated using logistic regression.

Results: In all, 227 567 patients were included in the development cohort and internal validation cohort, with 8867 patients in the external validation cohort. The soft-voting ensemble model, combining random forest and AdaBoost, showed high predictive performance, with area under the receiver operating characteristic curve values of 0.837 (internal validation) and 0.837 (external validation). SHAP analysis identified accident type as the most influential predictor, followed by consciousness level, and circulatory assistance. Higher model probability was associated with increased in-hospital mortality (adjusted odds ratios (95% confidence intervals) 2.34 (2.16 to 2.55), 2.70 (2.49 to 2.92), and 3.53 (3.25 to 3.83) for T1, T2, and T3, respectively).

Conclusion: A prehospital ensemble learning model to predict MT was developed and validated, and its predictions were significantly associated with in-hospital mortality. However, this study is limited by the inclusion of a single ethnicity, and future research needs to integrate data from multiple populations to enhance generalizability.

创伤中大量输血的现场机器学习预测模型及其与住院死亡率的关系。
背景:大量输血(MT)的早期分诊在创伤护理中至关重要,但大多数现有的评分系统依赖于医院内的数据。为了解决这一局限性,我们开发了一个仅使用院前变量来预测MT和分层死亡风险的机器学习模型,并进行了外部验证。方法:来自韩国创伤数据库的19个创伤中心(2017-22年)的数据用于模型开发和内部验证,另外4个中心的2023例患者数据用于外部验证。使用韩国疾病分类第7版中的S或T代码识别创伤病例。MT定义为4小时内≥5个单位或24小时内≥10个单位的红细胞。使用21个院前变量训练机器学习模型,并使用性能最好的算法构建最终的集成模型。采用Shapley加性解释(SHAP)评估模型可解释性,采用logistic回归评估预测概率三位数(T1-T3)与住院死亡率之间的关系。结果:共有227567例患者被纳入开发队列和内部验证队列,8867例患者被纳入外部验证队列。结合随机森林和AdaBoost的软投票集成模型具有较高的预测性能,其接收者工作特征曲线下面积分别为0.837(内部验证)和0.837(外部验证)。SHAP分析发现事故类型是影响最大的预测因素,其次是意识水平和循环辅助。较高的模型概率与住院死亡率增加相关(T1、T2和T3的校正比值比(95%置信区间)分别为2.34(2.16 ~ 2.55)、2.70(2.49 ~ 2.92)和3.53(3.25 ~ 3.83))。结论:建立并验证了院前集成学习模型预测MT,其预测结果与院内死亡率显著相关。然而,这项研究受到单一种族的限制,未来的研究需要整合来自多个人群的数据,以提高普遍性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BJS Open
BJS Open SURGERY-
CiteScore
6.00
自引率
3.20%
发文量
144
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书