Shuai Jin , Dan Qin , Chong Wang , Baosheng Liang , Lichuan Zhang , Weiyin Gao , Xiao Wang , Bo Jiang , Benqiang Rao , Hanping Shi , Lihui Liu , Qian Lu
{"title":"癌症相关静脉血栓栓塞风险预测模型的发展、验证和临床应用:一项回顾性和前瞻性队列研究","authors":"Shuai Jin , Dan Qin , Chong Wang , Baosheng Liang , Lichuan Zhang , Weiyin Gao , Xiao Wang , Bo Jiang , Benqiang Rao , Hanping Shi , Lihui Liu , Qian Lu","doi":"10.1016/j.apjon.2025.100691","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>This study aims to develop cancer-associated venous thromboembolism (CA-VTE) risk prediction models using survival machine learning (ML) algorithms.</div></div><div><h3>Methods</h3><div>This study employed a double-cohort study design (retrospective and prospective). The retrospective cohort (<em>n</em> = 1036) was used as training set (70.0%, <em>n</em> = 725) and internal validation set (30.0%, <em>n</em> = 311); while the prospective cohort (<em>n</em> = 321) was used as external validation set. Seven survival ML algorithms, including COX regression, classification, regression and survival tree, random survival forest, gradient boosting survival machine tree, extreme gradient boosting survival tree, survival support vector analysis, and survival artificial neural network, were applied to train CA-VTE models.</div></div><div><h3>Results</h3><div>Univariate analysis and LASSO-COX regression both selected five predictors: age, previous VTE history, ICU/CCU, CCI, and D-dimer. The seven survival ML models (C-index: 0.709–0.760; Brier Score: 0.212–0.243) all outperformed Khorana Score (C-index: 0.632; Brier Score: 0.260) in external validation set. Among all models, the COX_DD model (COX regression + D-dimer) performed best. However, ML models and Khorana Score predicted CA-VTE risk on <span><math><mrow><mo>≥</mo></mrow></math></span> 7 days of hospitalization with an increase in Brier Score <span><math><mrow><mo>≥</mo></mrow></math></span> 0.25, showing poor calibration.</div></div><div><h3>Conclusions</h3><div>In this study, the CA-VTE risk prediction models developed in seven survival ML algorithms outperformed Khorana Score. Combining with D-dimer can improve model performance. Applying the nomogram based on the optimal COX_DD model allows oncology nurse to reassess CA-VTE risk once a week. The prediction models developed using survival ML algorithms in this study may contribute to the dynamic and accurate risk assessment of CA-VTE for cancer survivors.</div></div>","PeriodicalId":8569,"journal":{"name":"Asia-Pacific Journal of Oncology Nursing","volume":"12 ","pages":"Article 100691"},"PeriodicalIF":2.4000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development, validation, and clinical utility of risk prediction models for cancer-associated venous thromboembolism: A retrospective and prospective cohort study\",\"authors\":\"Shuai Jin , Dan Qin , Chong Wang , Baosheng Liang , Lichuan Zhang , Weiyin Gao , Xiao Wang , Bo Jiang , Benqiang Rao , Hanping Shi , Lihui Liu , Qian Lu\",\"doi\":\"10.1016/j.apjon.2025.100691\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>This study aims to develop cancer-associated venous thromboembolism (CA-VTE) risk prediction models using survival machine learning (ML) algorithms.</div></div><div><h3>Methods</h3><div>This study employed a double-cohort study design (retrospective and prospective). The retrospective cohort (<em>n</em> = 1036) was used as training set (70.0%, <em>n</em> = 725) and internal validation set (30.0%, <em>n</em> = 311); while the prospective cohort (<em>n</em> = 321) was used as external validation set. Seven survival ML algorithms, including COX regression, classification, regression and survival tree, random survival forest, gradient boosting survival machine tree, extreme gradient boosting survival tree, survival support vector analysis, and survival artificial neural network, were applied to train CA-VTE models.</div></div><div><h3>Results</h3><div>Univariate analysis and LASSO-COX regression both selected five predictors: age, previous VTE history, ICU/CCU, CCI, and D-dimer. The seven survival ML models (C-index: 0.709–0.760; Brier Score: 0.212–0.243) all outperformed Khorana Score (C-index: 0.632; Brier Score: 0.260) in external validation set. Among all models, the COX_DD model (COX regression + D-dimer) performed best. However, ML models and Khorana Score predicted CA-VTE risk on <span><math><mrow><mo>≥</mo></mrow></math></span> 7 days of hospitalization with an increase in Brier Score <span><math><mrow><mo>≥</mo></mrow></math></span> 0.25, showing poor calibration.</div></div><div><h3>Conclusions</h3><div>In this study, the CA-VTE risk prediction models developed in seven survival ML algorithms outperformed Khorana Score. Combining with D-dimer can improve model performance. Applying the nomogram based on the optimal COX_DD model allows oncology nurse to reassess CA-VTE risk once a week. The prediction models developed using survival ML algorithms in this study may contribute to the dynamic and accurate risk assessment of CA-VTE for cancer survivors.</div></div>\",\"PeriodicalId\":8569,\"journal\":{\"name\":\"Asia-Pacific Journal of Oncology Nursing\",\"volume\":\"12 \",\"pages\":\"Article 100691\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asia-Pacific Journal of Oncology Nursing\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2347562525000393\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NURSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Journal of Oncology Nursing","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2347562525000393","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}
Development, validation, and clinical utility of risk prediction models for cancer-associated venous thromboembolism: A retrospective and prospective cohort study
Objectives
This study aims to develop cancer-associated venous thromboembolism (CA-VTE) risk prediction models using survival machine learning (ML) algorithms.
Methods
This study employed a double-cohort study design (retrospective and prospective). The retrospective cohort (n = 1036) was used as training set (70.0%, n = 725) and internal validation set (30.0%, n = 311); while the prospective cohort (n = 321) was used as external validation set. Seven survival ML algorithms, including COX regression, classification, regression and survival tree, random survival forest, gradient boosting survival machine tree, extreme gradient boosting survival tree, survival support vector analysis, and survival artificial neural network, were applied to train CA-VTE models.
Results
Univariate analysis and LASSO-COX regression both selected five predictors: age, previous VTE history, ICU/CCU, CCI, and D-dimer. The seven survival ML models (C-index: 0.709–0.760; Brier Score: 0.212–0.243) all outperformed Khorana Score (C-index: 0.632; Brier Score: 0.260) in external validation set. Among all models, the COX_DD model (COX regression + D-dimer) performed best. However, ML models and Khorana Score predicted CA-VTE risk on 7 days of hospitalization with an increase in Brier Score 0.25, showing poor calibration.
Conclusions
In this study, the CA-VTE risk prediction models developed in seven survival ML algorithms outperformed Khorana Score. Combining with D-dimer can improve model performance. Applying the nomogram based on the optimal COX_DD model allows oncology nurse to reassess CA-VTE risk once a week. The prediction models developed using survival ML algorithms in this study may contribute to the dynamic and accurate risk assessment of CA-VTE for cancer survivors.