Development and validation of machine-learning model based on dynamic tumor markers in predicting pathological complete response after neoadjuvant chemoradiotherapy in patients with locally advanced rectal cancer: a multicenter cohort study.
{"title":"Development and validation of machine-learning model based on dynamic tumor markers in predicting pathological complete response after neoadjuvant chemoradiotherapy in patients with locally advanced rectal cancer: a multicenter cohort study.","authors":"Bin Chen, Tengyi Peng, Zhen Pan, Shoufeng Li, Ye Wang, Shaoqing Zheng, Jinfu Zhuang, Xing Liu, Xingrong Lu, Changqing Zeng, Guoxian Guan","doi":"10.1007/s00384-025-04993-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>In this study, we constructed a new pCR predictor based on dynamic tumor marker changes before and after NCRT, the dynamic tumor marker score (DTMS), and combined it with other clinicopathological features to build a machine-learning model.</p><p><strong>Methods: </strong>In this retrospective study of patients with LARC between September 2010 and October 2017 at The First Affiliated Hospital of Fujian Medical University (FJMUFAH), Fujian Medical University Union Hospital (FJMUUH), and Fujian Provincial Hospital (FJPH), the DTMS predictor was constructed using logistic regression. Factors associated with pCR were screened using single-factor and multifactorial logistic regression, and 10 machine-learning algorithms were used to construct a pCR prediction model. Additionally, various metrics, including the area under the receiver operating characteristic curve (AUC), area under the precision-recall curve (AUPRC), decision curve analysis, and calibration curves, were obtained to validate the model performance and verified using an external validation set. Finally, SHapley Additive exPlanations (SHAP) values were used to interpret the predictive model. Moreover, we developed a website to facilitate the use of prediction modeling.</p><p><strong>Results: </strong>After analyzing the data of 892 patients with LARC from FJMUFAH, DTMS, tumor size, N stage, and tumor distance from the anal verge were identified as independent predictive factors for pCR using univariate and multivariate regression analyses. The \"extreme gradient boosting\" (XGB) model displayed the best performance in the training set, with a mean AUC value of 0.86, an AUPRC value of 0.732, and SHAP values utilized in the analysis. In the two external validation sets, the model yielded AUC values of 0.80 and 0.82, along with corresponding AUPRC values of 0.519 and 0.593, respectively, which were the highest among all ten evaluated models, incorporating the use of SHAP values in the analysis. The model maintained superior predictive efficacy in the external validation cohorts (FJMUUH and FJPH).</p><p><strong>Conclusions: </strong>As a novel marker based on dynamic changes in CEA and CA19-9 levels, DTMS effectively predicted pCR within the XGB model, providing clinicians with a practical tool for treatment decision-making regarding LARC.</p>","PeriodicalId":13789,"journal":{"name":"International Journal of Colorectal Disease","volume":"40 1","pages":"204"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12474689/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Colorectal Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00384-025-04993-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: In this study, we constructed a new pCR predictor based on dynamic tumor marker changes before and after NCRT, the dynamic tumor marker score (DTMS), and combined it with other clinicopathological features to build a machine-learning model.
Methods: In this retrospective study of patients with LARC between September 2010 and October 2017 at The First Affiliated Hospital of Fujian Medical University (FJMUFAH), Fujian Medical University Union Hospital (FJMUUH), and Fujian Provincial Hospital (FJPH), the DTMS predictor was constructed using logistic regression. Factors associated with pCR were screened using single-factor and multifactorial logistic regression, and 10 machine-learning algorithms were used to construct a pCR prediction model. Additionally, various metrics, including the area under the receiver operating characteristic curve (AUC), area under the precision-recall curve (AUPRC), decision curve analysis, and calibration curves, were obtained to validate the model performance and verified using an external validation set. Finally, SHapley Additive exPlanations (SHAP) values were used to interpret the predictive model. Moreover, we developed a website to facilitate the use of prediction modeling.
Results: After analyzing the data of 892 patients with LARC from FJMUFAH, DTMS, tumor size, N stage, and tumor distance from the anal verge were identified as independent predictive factors for pCR using univariate and multivariate regression analyses. The "extreme gradient boosting" (XGB) model displayed the best performance in the training set, with a mean AUC value of 0.86, an AUPRC value of 0.732, and SHAP values utilized in the analysis. In the two external validation sets, the model yielded AUC values of 0.80 and 0.82, along with corresponding AUPRC values of 0.519 and 0.593, respectively, which were the highest among all ten evaluated models, incorporating the use of SHAP values in the analysis. The model maintained superior predictive efficacy in the external validation cohorts (FJMUUH and FJPH).
Conclusions: As a novel marker based on dynamic changes in CEA and CA19-9 levels, DTMS effectively predicted pCR within the XGB model, providing clinicians with a practical tool for treatment decision-making regarding LARC.
期刊介绍:
The International Journal of Colorectal Disease, Clinical and Molecular Gastroenterology and Surgery aims to publish novel and state-of-the-art papers which deal with the physiology and pathophysiology of diseases involving the entire gastrointestinal tract. In addition to original research articles, the following categories will be included: reviews (usually commissioned but may also be submitted), case reports, letters to the editor, and protocols on clinical studies.
The journal offers its readers an interdisciplinary forum for clinical science and molecular research related to gastrointestinal disease.