Ruoyu Zhang , Zengshuai Wang , Min Yang , Bo Chen , Mei Liu , Minhua Zheng , Peter Xiaoping Liu , Liming Wang
{"title":"Combining traditional analysis and machine learning to predict early, middle, and long-term recurrence of intrahepatic cholangiocarcinoma","authors":"Ruoyu Zhang , Zengshuai Wang , Min Yang , Bo Chen , Mei Liu , Minhua Zheng , Peter Xiaoping Liu , Liming Wang","doi":"10.1016/j.ejso.2025.110141","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Intrahepatic cholangiocarcinoma (ICC) is a rare and highly aggressive cancer. Few patients are eligible for radical surgery, and most face the high risk of recurrence.</div></div><div><h3>Methods</h3><div>We developed early-, middle- and long-term (1-, 2-, and 3-year) ICC disease-free survival (DFS) prediction models using traditional Logistic analysis combined with machine learning (ML) and systematically compared the performance of traditional analysis and MLs.</div></div><div><h3>Results</h3><div>275, 256, and 238 ICC patients under radical surgery were included in the 1-, 2-, and 3-year DFS groups respectively. Five-fold cross-validation results demonstrated that both traditional Logistics and ML models exhibited remarkable robustness. MLs outperformed traditional Logistic models for DFS prediction across the AUC, accuracy and F1-scores. Specifically, the average AUC of training cohorts for the ML models were 0.878, 0.897 and 0.917 in 3 groups, compared to 0.657 (P < 0.001), 0.817 (P = 0.05), and 0.798 (P = 0.005) in traditional models. The average AUCs of testing cohorts for ML models were 0.831, 0.768, 0.803 in ML models in 3 groups, compared to 0.619 (P < 0.001), 0.719 (P = 0.008), 0.698 (P < 0.001) in traditional models. SHAP analysis identified lymph node metastasis played significant role in all-round recurrence, T stage and neural invasion had strong correction with middle and long-term recurrence in ICC patients.</div></div><div><h3>Conclusion</h3><div>Models with high predictive efficiency across early, middle, and long-term recurrence have been successfully built. ML models outperformed Logistic models for DFS prediction in ICC patients. This study suggests new possibilities for advancing statistical analysis software, such as SPSS and Stata, through ML integration.</div></div>","PeriodicalId":11522,"journal":{"name":"Ejso","volume":"51 9","pages":"Article 110141"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ejso","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0748798325005694","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Intrahepatic cholangiocarcinoma (ICC) is a rare and highly aggressive cancer. Few patients are eligible for radical surgery, and most face the high risk of recurrence.
Methods
We developed early-, middle- and long-term (1-, 2-, and 3-year) ICC disease-free survival (DFS) prediction models using traditional Logistic analysis combined with machine learning (ML) and systematically compared the performance of traditional analysis and MLs.
Results
275, 256, and 238 ICC patients under radical surgery were included in the 1-, 2-, and 3-year DFS groups respectively. Five-fold cross-validation results demonstrated that both traditional Logistics and ML models exhibited remarkable robustness. MLs outperformed traditional Logistic models for DFS prediction across the AUC, accuracy and F1-scores. Specifically, the average AUC of training cohorts for the ML models were 0.878, 0.897 and 0.917 in 3 groups, compared to 0.657 (P < 0.001), 0.817 (P = 0.05), and 0.798 (P = 0.005) in traditional models. The average AUCs of testing cohorts for ML models were 0.831, 0.768, 0.803 in ML models in 3 groups, compared to 0.619 (P < 0.001), 0.719 (P = 0.008), 0.698 (P < 0.001) in traditional models. SHAP analysis identified lymph node metastasis played significant role in all-round recurrence, T stage and neural invasion had strong correction with middle and long-term recurrence in ICC patients.
Conclusion
Models with high predictive efficiency across early, middle, and long-term recurrence have been successfully built. ML models outperformed Logistic models for DFS prediction in ICC patients. This study suggests new possibilities for advancing statistical analysis software, such as SPSS and Stata, through ML integration.
期刊介绍:
JSO - European Journal of Surgical Oncology ("the Journal of Cancer Surgery") is the Official Journal of the European Society of Surgical Oncology and BASO ~ the Association for Cancer Surgery.
The EJSO aims to advance surgical oncology research and practice through the publication of original research articles, review articles, editorials, debates and correspondence.