{"title":"老年膀胱癌根治性膀胱切除术后癌症特异性死亡率的增强预后预测:一项XGBoost模型研究","authors":"Gaowei Li, Kang Xia","doi":"10.21037/tcr-24-2023","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Tumor stage, surgery and age are positively correlated with cancer-specific mortality (CSM) in patients diagnosed with bladder cancer (BCa). In light of the successful application of machine learning to process big data in many fields outside of medicine, we aimed to establish and validate whether machine learning models could improve our ability to predict the development of CSM in elderly BCa patients after radical cystectomy (RC).</p><p><strong>Methods: </strong>Data on eligible patients diagnosed with BCa were obtained from the Surveillance, Epidemiology, and End Results database (2000-2021) and divided into training and validation cohorts in a ratio of 7:3. First, risk factors for the development of CSM in patients were identified by Cox regression analysis. Then, iterative testing and tuning through automated hyperparameter optimization and ten-fold cross-validation were performed to generate stable extreme gradient boosting (XGBoost) models with optimal performance. Receiver operating characteristic (ROC) curve, area under the curve (AUC), calibration curve and confusion matrix were used to evaluate the performance of XGBoost model.</p><p><strong>Results: </strong>There were 11,763 patients included, of which 5,788 died from BCa. By the comparison of different machine learning models, the final XGBoost model we constructed showed high accuracy and precision in predicting the development of CSM in BCa patients (6-month CSM: AUC =0.799, 12-month CSM: AUC =0.756, 36-month CSM: AUC =0.746, and 60-month CSM: AUC =0.745). The results of accuracy, precision, recall and F1 score confirmed the superior performance of the XGBoost model. The important scores for clinical characteristics and the Shapley Additive Explanations plots highlighted the importance of key factors: chemotherapy, tumor stage, marital status, and tumor size were the top four factors in all models.</p><p><strong>Conclusions: </strong>Our study validated and confirmed the feasibility and high performance of the XGBoost model in predicting CSM in elderly BCa patients after RC. The potential of machine learning contributes to accurately predict the prognosis of cancer.</p>","PeriodicalId":23216,"journal":{"name":"Translational cancer research","volume":"14 3","pages":"1902-1914"},"PeriodicalIF":1.5000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985172/pdf/","citationCount":"0","resultStr":"{\"title\":\"Enhanced prognostic prediction of cancer-specific mortality in elderly bladder cancer patients post-radical cystectomy: an XGBoost model study.\",\"authors\":\"Gaowei Li, Kang Xia\",\"doi\":\"10.21037/tcr-24-2023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Tumor stage, surgery and age are positively correlated with cancer-specific mortality (CSM) in patients diagnosed with bladder cancer (BCa). In light of the successful application of machine learning to process big data in many fields outside of medicine, we aimed to establish and validate whether machine learning models could improve our ability to predict the development of CSM in elderly BCa patients after radical cystectomy (RC).</p><p><strong>Methods: </strong>Data on eligible patients diagnosed with BCa were obtained from the Surveillance, Epidemiology, and End Results database (2000-2021) and divided into training and validation cohorts in a ratio of 7:3. First, risk factors for the development of CSM in patients were identified by Cox regression analysis. Then, iterative testing and tuning through automated hyperparameter optimization and ten-fold cross-validation were performed to generate stable extreme gradient boosting (XGBoost) models with optimal performance. Receiver operating characteristic (ROC) curve, area under the curve (AUC), calibration curve and confusion matrix were used to evaluate the performance of XGBoost model.</p><p><strong>Results: </strong>There were 11,763 patients included, of which 5,788 died from BCa. By the comparison of different machine learning models, the final XGBoost model we constructed showed high accuracy and precision in predicting the development of CSM in BCa patients (6-month CSM: AUC =0.799, 12-month CSM: AUC =0.756, 36-month CSM: AUC =0.746, and 60-month CSM: AUC =0.745). The results of accuracy, precision, recall and F1 score confirmed the superior performance of the XGBoost model. The important scores for clinical characteristics and the Shapley Additive Explanations plots highlighted the importance of key factors: chemotherapy, tumor stage, marital status, and tumor size were the top four factors in all models.</p><p><strong>Conclusions: </strong>Our study validated and confirmed the feasibility and high performance of the XGBoost model in predicting CSM in elderly BCa patients after RC. The potential of machine learning contributes to accurately predict the prognosis of cancer.</p>\",\"PeriodicalId\":23216,\"journal\":{\"name\":\"Translational cancer research\",\"volume\":\"14 3\",\"pages\":\"1902-1914\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985172/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational cancer research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/tcr-24-2023\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/27 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tcr-24-2023","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/27 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
Enhanced prognostic prediction of cancer-specific mortality in elderly bladder cancer patients post-radical cystectomy: an XGBoost model study.
Background: Tumor stage, surgery and age are positively correlated with cancer-specific mortality (CSM) in patients diagnosed with bladder cancer (BCa). In light of the successful application of machine learning to process big data in many fields outside of medicine, we aimed to establish and validate whether machine learning models could improve our ability to predict the development of CSM in elderly BCa patients after radical cystectomy (RC).
Methods: Data on eligible patients diagnosed with BCa were obtained from the Surveillance, Epidemiology, and End Results database (2000-2021) and divided into training and validation cohorts in a ratio of 7:3. First, risk factors for the development of CSM in patients were identified by Cox regression analysis. Then, iterative testing and tuning through automated hyperparameter optimization and ten-fold cross-validation were performed to generate stable extreme gradient boosting (XGBoost) models with optimal performance. Receiver operating characteristic (ROC) curve, area under the curve (AUC), calibration curve and confusion matrix were used to evaluate the performance of XGBoost model.
Results: There were 11,763 patients included, of which 5,788 died from BCa. By the comparison of different machine learning models, the final XGBoost model we constructed showed high accuracy and precision in predicting the development of CSM in BCa patients (6-month CSM: AUC =0.799, 12-month CSM: AUC =0.756, 36-month CSM: AUC =0.746, and 60-month CSM: AUC =0.745). The results of accuracy, precision, recall and F1 score confirmed the superior performance of the XGBoost model. The important scores for clinical characteristics and the Shapley Additive Explanations plots highlighted the importance of key factors: chemotherapy, tumor stage, marital status, and tumor size were the top four factors in all models.
Conclusions: Our study validated and confirmed the feasibility and high performance of the XGBoost model in predicting CSM in elderly BCa patients after RC. The potential of machine learning contributes to accurately predict the prognosis of cancer.
期刊介绍:
Translational Cancer Research (Transl Cancer Res TCR; Print ISSN: 2218-676X; Online ISSN 2219-6803; http://tcr.amegroups.com/) is an Open Access, peer-reviewed journal, indexed in Science Citation Index Expanded (SCIE). TCR publishes laboratory studies of novel therapeutic interventions as well as clinical trials which evaluate new treatment paradigms for cancer; results of novel research investigations which bridge the laboratory and clinical settings including risk assessment, cellular and molecular characterization, prevention, detection, diagnosis and treatment of human cancers with the overall goal of improving the clinical care of cancer patients. The focus of TCR is original, peer-reviewed, science-based research that successfully advances clinical medicine toward the goal of improving patients'' quality of life. The editors and an international advisory group of scientists and clinician-scientists as well as other experts will hold TCR articles to the high-quality standards. We accept Original Articles as well as Review Articles, Editorials and Brief Articles.