Enhancing prediction and stratifying risk: machine learning and bayesian-learning models for catheter-related thrombosis in chemotherapy patients.

IF 3.4 2区 医学 Q2 ONCOLOGY
Tao An, Han Han, Junying Xie, Yifan Wang, Yiqi Zhao, Hao Jia, Yanfeng Wang
{"title":"Enhancing prediction and stratifying risk: machine learning and bayesian-learning models for catheter-related thrombosis in chemotherapy patients.","authors":"Tao An, Han Han, Junying Xie, Yifan Wang, Yiqi Zhao, Hao Jia, Yanfeng Wang","doi":"10.1186/s12885-025-13946-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Catheter-related thrombosis (CRT) is a serious complication in cancer patients undergoing chemotherapy, yet existing risk prediction models demonstrate limited accuracy. This study aimed to evaluate the clinical utility of machine learning (ML) and Bayesian-learning models for CRT prediction in a large cohort of breast cancer patients undergoing catheterization.</p><p><strong>Methods: </strong>A total of 3337 breast cancer patients with central venous catheters (Cohort 1) were included to develop and test ML models. Given the suboptimal clinical feasibility of ML models, the Bayesian-learning model was constructed using odds ratio analysis and Gaussian distribution. The hazard ratio for the high-risk and low-risk groups was calculated using Cox proportional hazards regression analysis, and the model was validated in an independent cohort of 1274 patients (Cohort 2).</p><p><strong>Results: </strong>In Cohort 1, 246 patients (7.37%) developed CRT. Among the eight ML algorithms tested, WeightedEnsemble model exhibited relatively stable performance, achieving area under the receiver operating characteristic curves of 0.89 in the training set and 0.69 in the test set. WeightedEnsemble improved generalization by integrating multiple base models. The odds ratio analysis and Bayesian-learning modeling identified 4 independent risk factors: hemoglobin (threshold point [TP]: 134.63 g/L), activated partial thromboplastin time (TP: 31.71 s), total cholesterol (TP: 11.19 mmol/L), and catheterization approach (TP: peripherally inserted central catheters). A simplified risk stratification system was developed, categorizing patients into low-risk (0-1 factors) and high-risk (2-4 factors) groups. This system exhibited strong CRT risk discriminative ability, as confirmed through survival analysis (P < 0.001 in both cohorts). In Cohort 1, cox regression analysis showed that the high-risk group had hazard ratio (HR) of 1.60 (95% confidence interval [CI], 1.15-2.22) for both catheter indwelling time and catheter use duration. In Cohort 2, the system maintained stable discriminative ability, with an HR of 5.63 (95% CI, 3.46-9.21) for catheter indwelling time and 5.62 (95% CI, 3.46-9.12) for catheter use duration.</p><p><strong>Conclusions: </strong>While ML models demonstrated high predictive performance, their clinical applicability was limited due to complexity. The Bayesian-learning-based risk stratification model provided a simplified yet robust alternative, effectively predicting CRT risk and offering a clinically feasible tool for risk assessment in breast cancer patients with chemotherapy. Further validation in diverse cancer populations is warranted to refine its generalizability.</p>","PeriodicalId":9131,"journal":{"name":"BMC Cancer","volume":"25 1","pages":"552"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12885-025-13946-y","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Catheter-related thrombosis (CRT) is a serious complication in cancer patients undergoing chemotherapy, yet existing risk prediction models demonstrate limited accuracy. This study aimed to evaluate the clinical utility of machine learning (ML) and Bayesian-learning models for CRT prediction in a large cohort of breast cancer patients undergoing catheterization.

Methods: A total of 3337 breast cancer patients with central venous catheters (Cohort 1) were included to develop and test ML models. Given the suboptimal clinical feasibility of ML models, the Bayesian-learning model was constructed using odds ratio analysis and Gaussian distribution. The hazard ratio for the high-risk and low-risk groups was calculated using Cox proportional hazards regression analysis, and the model was validated in an independent cohort of 1274 patients (Cohort 2).

Results: In Cohort 1, 246 patients (7.37%) developed CRT. Among the eight ML algorithms tested, WeightedEnsemble model exhibited relatively stable performance, achieving area under the receiver operating characteristic curves of 0.89 in the training set and 0.69 in the test set. WeightedEnsemble improved generalization by integrating multiple base models. The odds ratio analysis and Bayesian-learning modeling identified 4 independent risk factors: hemoglobin (threshold point [TP]: 134.63 g/L), activated partial thromboplastin time (TP: 31.71 s), total cholesterol (TP: 11.19 mmol/L), and catheterization approach (TP: peripherally inserted central catheters). A simplified risk stratification system was developed, categorizing patients into low-risk (0-1 factors) and high-risk (2-4 factors) groups. This system exhibited strong CRT risk discriminative ability, as confirmed through survival analysis (P < 0.001 in both cohorts). In Cohort 1, cox regression analysis showed that the high-risk group had hazard ratio (HR) of 1.60 (95% confidence interval [CI], 1.15-2.22) for both catheter indwelling time and catheter use duration. In Cohort 2, the system maintained stable discriminative ability, with an HR of 5.63 (95% CI, 3.46-9.21) for catheter indwelling time and 5.62 (95% CI, 3.46-9.12) for catheter use duration.

Conclusions: While ML models demonstrated high predictive performance, their clinical applicability was limited due to complexity. The Bayesian-learning-based risk stratification model provided a simplified yet robust alternative, effectively predicting CRT risk and offering a clinically feasible tool for risk assessment in breast cancer patients with chemotherapy. Further validation in diverse cancer populations is warranted to refine its generalizability.

背景:导管相关血栓形成(CRT)是接受化疗的癌症患者的一种严重并发症,但现有的风险预测模型显示出有限的准确性。本研究旨在评估机器学习(ML)和贝叶斯学习模型在接受导管治疗的大批乳腺癌患者中预测 CRT 的临床实用性:共纳入了 3337 名使用中心静脉导管的乳腺癌患者(队列 1),以开发和测试 ML 模型。鉴于 ML 模型的临床可行性不理想,我们使用几率比分析和高斯分布构建了贝叶斯学习模型。使用 Cox 比例危险回归分析法计算了高风险组和低风险组的危险比,并在一个由 1274 名患者组成的独立队列(队列 2)中对该模型进行了验证:在队列 1 中,有 246 名患者(7.37%)发展为 CRT。在测试的八种 ML 算法中,WeightedEnsemble 模型表现出相对稳定的性能,在训练集和测试集的接收者操作特征曲线下面积分别为 0.89 和 0.69。加权集合模型通过整合多个基础模型提高了泛化能力。几率比分析和贝叶斯学习模型确定了 4 个独立的风险因素:血红蛋白(阈值点 [TP]:134.63 g/L)、活化部分凝血活酶时间(TP:31.71 s)、总胆固醇(TP:11.19 mmol/L)和导管插入方式(TP:外周插入中心导管)。我们开发了一套简化的风险分层系统,将患者分为低风险组(0-1 个因素)和高风险组(2-4 个因素)。通过生存分析证实,该系统具有很强的 CRT 风险判别能力(P 结论):虽然 ML 模型具有很高的预测性能,但由于其复杂性,其临床适用性受到了限制。基于贝叶斯学习的风险分层模型提供了一种简化但稳健的替代方法,它能有效预测 CRT 风险,并为乳腺癌化疗患者的风险评估提供了一种临床可行的工具。该模型需要在不同的癌症人群中进一步验证,以完善其通用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Cancer
BMC Cancer 医学-肿瘤学
CiteScore
6.00
自引率
2.60%
发文量
1204
审稿时长
6.8 months
期刊介绍: BMC Cancer is an open access, peer-reviewed journal that considers articles on all aspects of cancer research, including the pathophysiology, prevention, diagnosis and treatment of cancers. The journal welcomes submissions concerning molecular and cellular biology, genetics, epidemiology, and clinical trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信