Enhanced prognostic prediction of cancer-specific mortality in elderly bladder cancer patients post-radical cystectomy: an XGBoost model study.

IF 1.5 4区 医学 Q4 ONCOLOGY
Translational cancer research Pub Date : 2025-03-30 Epub Date: 2025-03-27 DOI:10.21037/tcr-24-2023
Gaowei Li, Kang Xia
{"title":"Enhanced prognostic prediction of cancer-specific mortality in elderly bladder cancer patients post-radical cystectomy: an XGBoost model study.","authors":"Gaowei Li, Kang Xia","doi":"10.21037/tcr-24-2023","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Tumor stage, surgery and age are positively correlated with cancer-specific mortality (CSM) in patients diagnosed with bladder cancer (BCa). In light of the successful application of machine learning to process big data in many fields outside of medicine, we aimed to establish and validate whether machine learning models could improve our ability to predict the development of CSM in elderly BCa patients after radical cystectomy (RC).</p><p><strong>Methods: </strong>Data on eligible patients diagnosed with BCa were obtained from the Surveillance, Epidemiology, and End Results database (2000-2021) and divided into training and validation cohorts in a ratio of 7:3. First, risk factors for the development of CSM in patients were identified by Cox regression analysis. Then, iterative testing and tuning through automated hyperparameter optimization and ten-fold cross-validation were performed to generate stable extreme gradient boosting (XGBoost) models with optimal performance. Receiver operating characteristic (ROC) curve, area under the curve (AUC), calibration curve and confusion matrix were used to evaluate the performance of XGBoost model.</p><p><strong>Results: </strong>There were 11,763 patients included, of which 5,788 died from BCa. By the comparison of different machine learning models, the final XGBoost model we constructed showed high accuracy and precision in predicting the development of CSM in BCa patients (6-month CSM: AUC =0.799, 12-month CSM: AUC =0.756, 36-month CSM: AUC =0.746, and 60-month CSM: AUC =0.745). The results of accuracy, precision, recall and F1 score confirmed the superior performance of the XGBoost model. The important scores for clinical characteristics and the Shapley Additive Explanations plots highlighted the importance of key factors: chemotherapy, tumor stage, marital status, and tumor size were the top four factors in all models.</p><p><strong>Conclusions: </strong>Our study validated and confirmed the feasibility and high performance of the XGBoost model in predicting CSM in elderly BCa patients after RC. The potential of machine learning contributes to accurately predict the prognosis of cancer.</p>","PeriodicalId":23216,"journal":{"name":"Translational cancer research","volume":"14 3","pages":"1902-1914"},"PeriodicalIF":1.5000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985172/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tcr-24-2023","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/27 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Tumor stage, surgery and age are positively correlated with cancer-specific mortality (CSM) in patients diagnosed with bladder cancer (BCa). In light of the successful application of machine learning to process big data in many fields outside of medicine, we aimed to establish and validate whether machine learning models could improve our ability to predict the development of CSM in elderly BCa patients after radical cystectomy (RC).

Methods: Data on eligible patients diagnosed with BCa were obtained from the Surveillance, Epidemiology, and End Results database (2000-2021) and divided into training and validation cohorts in a ratio of 7:3. First, risk factors for the development of CSM in patients were identified by Cox regression analysis. Then, iterative testing and tuning through automated hyperparameter optimization and ten-fold cross-validation were performed to generate stable extreme gradient boosting (XGBoost) models with optimal performance. Receiver operating characteristic (ROC) curve, area under the curve (AUC), calibration curve and confusion matrix were used to evaluate the performance of XGBoost model.

Results: There were 11,763 patients included, of which 5,788 died from BCa. By the comparison of different machine learning models, the final XGBoost model we constructed showed high accuracy and precision in predicting the development of CSM in BCa patients (6-month CSM: AUC =0.799, 12-month CSM: AUC =0.756, 36-month CSM: AUC =0.746, and 60-month CSM: AUC =0.745). The results of accuracy, precision, recall and F1 score confirmed the superior performance of the XGBoost model. The important scores for clinical characteristics and the Shapley Additive Explanations plots highlighted the importance of key factors: chemotherapy, tumor stage, marital status, and tumor size were the top four factors in all models.

Conclusions: Our study validated and confirmed the feasibility and high performance of the XGBoost model in predicting CSM in elderly BCa patients after RC. The potential of machine learning contributes to accurately predict the prognosis of cancer.

老年膀胱癌根治性膀胱切除术后癌症特异性死亡率的增强预后预测:一项XGBoost模型研究
背景:膀胱癌(BCa)患者的肿瘤分期、手术和年龄与肿瘤特异性死亡率(CSM)呈正相关。鉴于机器学习在医学以外的许多领域成功应用于处理大数据,我们旨在建立并验证机器学习模型是否可以提高我们预测根治性膀胱切除术(RC)后老年BCa患者CSM发展的能力。方法:从监测、流行病学和最终结果数据库(2000-2021)中获得诊断为BCa的合格患者的数据,并按7:3的比例分为培训和验证队列。首先,通过Cox回归分析确定CSM患者发生的危险因素。然后,通过自动超参数优化和十倍交叉验证进行迭代测试和调优,以生成性能最优的稳定的极限梯度提升(XGBoost)模型。采用受试者工作特征(ROC)曲线、曲线下面积(AUC)、校准曲线和混淆矩阵对XGBoost模型的性能进行评价。结果:共纳入11763例患者,其中5788例患者死于BCa。通过对不同机器学习模型的比较,我们最终构建的XGBoost模型在预测BCa患者CSM的发展方面具有较高的准确度和精度(6个月CSM: AUC =0.799, 12个月CSM: AUC =0.756, 36个月CSM: AUC =0.746, 60个月CSM: AUC =0.745)。正确率、精密度、召回率和F1分数的结果证实了XGBoost模型的优越性能。临床特征和Shapley加法解释图的重要得分突出了关键因素的重要性:化疗、肿瘤分期、婚姻状况和肿瘤大小是所有模型中排名前四的因素。结论:我们的研究验证并证实了XGBoost模型预测老年BCa患者RC后CSM的可行性和高性能。机器学习的潜力有助于准确预测癌症的预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.10
自引率
0.00%
发文量
252
期刊介绍: Translational Cancer Research (Transl Cancer Res TCR; Print ISSN: 2218-676X; Online ISSN 2219-6803; http://tcr.amegroups.com/) is an Open Access, peer-reviewed journal, indexed in Science Citation Index Expanded (SCIE). TCR publishes laboratory studies of novel therapeutic interventions as well as clinical trials which evaluate new treatment paradigms for cancer; results of novel research investigations which bridge the laboratory and clinical settings including risk assessment, cellular and molecular characterization, prevention, detection, diagnosis and treatment of human cancers with the overall goal of improving the clinical care of cancer patients. The focus of TCR is original, peer-reviewed, science-based research that successfully advances clinical medicine toward the goal of improving patients'' quality of life. The editors and an international advisory group of scientists and clinician-scientists as well as other experts will hold TCR articles to the high-quality standards. We accept Original Articles as well as Review Articles, Editorials and Brief Articles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信