An Explainable Two-Stage Machine Learning Model for Predicting the Post-Thrombolysis Complications in Stroke Patients: A Multi-Center Study.

IF 10.7 1区 综合性期刊 Q1 Multidisciplinary
Research Pub Date : 2025-08-19 eCollection Date: 2025-01-01 DOI:10.34133/research.0817
Hongling Zhu, Qing Ye, Shurui Wang, Hongsen Cai, Mairihaba Maimaiti, Jinsheng Lai, Chuan Qin, Ping Zhang, Yanyan Chen, Qiushi Luo, Hong Wu, Danyang Chen, Shiling Chen, Shudan Zhu, Yuting Lv, Yanxiang Xu, Jian Zhang, Benshan Hu, Yuanxiang Yin, Yan Xie, Dongmei Zhu, Xiaoxing Ming, Zhouping Tang, Hesong Zeng
{"title":"An Explainable Two-Stage Machine Learning Model for Predicting the Post-Thrombolysis Complications in Stroke Patients: A Multi-Center Study.","authors":"Hongling Zhu, Qing Ye, Shurui Wang, Hongsen Cai, Mairihaba Maimaiti, Jinsheng Lai, Chuan Qin, Ping Zhang, Yanyan Chen, Qiushi Luo, Hong Wu, Danyang Chen, Shiling Chen, Shudan Zhu, Yuting Lv, Yanxiang Xu, Jian Zhang, Benshan Hu, Yuanxiang Yin, Yan Xie, Dongmei Zhu, Xiaoxing Ming, Zhouping Tang, Hesong Zeng","doi":"10.34133/research.0817","DOIUrl":null,"url":null,"abstract":"<p><p>Current tools for predicting the thrombolysis risk in patients after stroke exhibit limited event prediction in early post-thrombolysis hemorrhagic events. This highlights an unmet medical need to improve the tools for stroke management. We developed an explainable 2-stage machine learning model for stroke risk stratification to predict the risk of bleeding, composite complications, and all-cause death in patients before and after thrombolysis therapy. The model integrated LightGBM, XGBoost, random forest model (RF), decision tree model (DT), and logistic regression model (LR), and was trained on data from 5,333 patients from Tongji Hospital, achieving improved predictive accuracy in the post-thrombolysis stage compared to the pre-thrombolysis stage. The model exhibited increased area under the curve (AUC) of 0.7581 [95% confidence interval (CI), 0.6955 to 0.8177] and 0.7234 (0.6527 to 0.7909) (bleeding), 0.7625 (0.7324 to 0.7936) and 0.7035 (0.6685 to 0.7392) (composite complications), and 0.9264 (0.8736 to 0.9660) and 0.845 (0.7454 to 0.9375) (death) in post-thrombolysis stage than in pre-thrombolysis stage. External validation using data of 526 patients across 2 different hospitals confirmed the robustness of the model. Key predictors such as temperature, vital signs, and demographic factors were identified. A prototype embedding the best-performing model was constructed. This model enhances thrombolysis risk prediction and supports personalized patient care management, demonstrating its potential for clinical decision support system integration into stroke management strategies.</p>","PeriodicalId":21120,"journal":{"name":"Research","volume":"8 ","pages":"0817"},"PeriodicalIF":10.7000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12364525/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0817","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

Abstract

Current tools for predicting the thrombolysis risk in patients after stroke exhibit limited event prediction in early post-thrombolysis hemorrhagic events. This highlights an unmet medical need to improve the tools for stroke management. We developed an explainable 2-stage machine learning model for stroke risk stratification to predict the risk of bleeding, composite complications, and all-cause death in patients before and after thrombolysis therapy. The model integrated LightGBM, XGBoost, random forest model (RF), decision tree model (DT), and logistic regression model (LR), and was trained on data from 5,333 patients from Tongji Hospital, achieving improved predictive accuracy in the post-thrombolysis stage compared to the pre-thrombolysis stage. The model exhibited increased area under the curve (AUC) of 0.7581 [95% confidence interval (CI), 0.6955 to 0.8177] and 0.7234 (0.6527 to 0.7909) (bleeding), 0.7625 (0.7324 to 0.7936) and 0.7035 (0.6685 to 0.7392) (composite complications), and 0.9264 (0.8736 to 0.9660) and 0.845 (0.7454 to 0.9375) (death) in post-thrombolysis stage than in pre-thrombolysis stage. External validation using data of 526 patients across 2 different hospitals confirmed the robustness of the model. Key predictors such as temperature, vital signs, and demographic factors were identified. A prototype embedding the best-performing model was constructed. This model enhances thrombolysis risk prediction and supports personalized patient care management, demonstrating its potential for clinical decision support system integration into stroke management strategies.

预测脑卒中患者溶栓后并发症的可解释的两阶段机器学习模型:一项多中心研究
目前预测脑卒中患者溶栓风险的工具在早期溶栓后出血事件的预测中表现出有限的事件预测。这凸显了改进卒中管理工具的医疗需求尚未得到满足。我们开发了一个可解释的两阶段机器学习模型,用于卒中风险分层,以预测溶栓治疗前后患者出血、复合并发症和全因死亡的风险。该模型集成了LightGBM、XGBoost、随机森林模型(RF)、决策树模型(DT)和逻辑回归模型(LR),并对同济医院5333例患者的数据进行了训练,在溶栓后阶段的预测准确率高于溶栓前阶段。该模型的曲线下面积(AUC)分别为0.7581(95%可信区间(CI) 0.6955 ~ 0.8177)、0.7234(0.6527 ~ 0.7909)(出血)、0.7625(0.7324 ~ 0.7936)、0.7035(0.6685 ~ 0.7392)(复合并发症)和0.9264(0.8736 ~ 0.9660)、0.845(0.7454 ~ 0.9375)(死亡),高于溶栓前。使用来自两家不同医院的526名患者的数据进行外部验证,证实了模型的稳健性。确定了关键预测因素,如温度、生命体征和人口因素。构造了嵌入性能最佳模型的原型。该模型增强了溶栓风险预测,并支持个性化的患者护理管理,显示了将临床决策支持系统集成到卒中管理策略中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Research
Research Multidisciplinary-Multidisciplinary
CiteScore
13.40
自引率
3.60%
发文量
0
审稿时长
14 weeks
期刊介绍: Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe. Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信