{"title":"用于预测前列腺癌Gleason评分升级的可解释机器学习模型的开发和验证。","authors":"Shu-Feng Li, Jin-Ge Zhao, Chen-Yi Jiang, Shi-Yuan Wang, Si-Yu Liu, Yi-Jun Zhang, Hao Zeng, Fu-Jun Zhao","doi":"10.21037/tau-2025-178","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The high incidence of Gleason score upgrade (GSU) can lead urologists to underestimate tumor aggressiveness, resulting in suboptimal treatment decisions. This study aimed to develop an interpretable machine learning model to predict the risk of GSU in individuals with prostate cancer (PCa) based on readily available clinical parameters.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on patients who underwent radical prostatectomy (RP) at Shanghai General Hospital and West China Hospital. Data from Shanghai General Hospital were categorized into a training set (80%) and a test set (20%), while data from West China Hospital were used for external validation. Preoperative clinical and pathological data were collected. Nine machine learning models [including random forest (RF) and light gradient boosting machine (LightGBM)], were developed, and the model demonstrating the best predictive performance was selected as the final model. Model performance was evaluated using receiver operating characteristic (ROC) curves, calibration curves, decision curves, and SHapley Additive exPlanations (SHAP) interpretation.</p><p><strong>Results: </strong>The LightGBM model demonstrated strong predictive performance, achieving an area under the ROC curve of 84.53% in the test set and 76.61% in external validation. Significant factors associated with GSU included the International Society of Urological Pathology (ISUP) grade, age, clinical tumor stage (T stage), body mass index, prostate-specific antigen (PSA), free-to-total PSA ratio (f/t PSA), platelet-to-lymphocyte ratio (PLR), and bilateral tumor involvement. An online prediction tool was developed based on this model.</p><p><strong>Conclusions: </strong>A machine learning model and an online prediction tool were developed to accurately predict GSU and identify factors associated with this process. This approach may assist clinicians in identifying individuals at high-risk for GSU and facilitating evidence-based treatment decisions.</p>","PeriodicalId":23270,"journal":{"name":"Translational andrology and urology","volume":"14 6","pages":"1631-1644"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271951/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of an interpretable machine learning model for predicting Gleason score upgrade in prostate cancer.\",\"authors\":\"Shu-Feng Li, Jin-Ge Zhao, Chen-Yi Jiang, Shi-Yuan Wang, Si-Yu Liu, Yi-Jun Zhang, Hao Zeng, Fu-Jun Zhao\",\"doi\":\"10.21037/tau-2025-178\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The high incidence of Gleason score upgrade (GSU) can lead urologists to underestimate tumor aggressiveness, resulting in suboptimal treatment decisions. This study aimed to develop an interpretable machine learning model to predict the risk of GSU in individuals with prostate cancer (PCa) based on readily available clinical parameters.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on patients who underwent radical prostatectomy (RP) at Shanghai General Hospital and West China Hospital. Data from Shanghai General Hospital were categorized into a training set (80%) and a test set (20%), while data from West China Hospital were used for external validation. Preoperative clinical and pathological data were collected. Nine machine learning models [including random forest (RF) and light gradient boosting machine (LightGBM)], were developed, and the model demonstrating the best predictive performance was selected as the final model. Model performance was evaluated using receiver operating characteristic (ROC) curves, calibration curves, decision curves, and SHapley Additive exPlanations (SHAP) interpretation.</p><p><strong>Results: </strong>The LightGBM model demonstrated strong predictive performance, achieving an area under the ROC curve of 84.53% in the test set and 76.61% in external validation. Significant factors associated with GSU included the International Society of Urological Pathology (ISUP) grade, age, clinical tumor stage (T stage), body mass index, prostate-specific antigen (PSA), free-to-total PSA ratio (f/t PSA), platelet-to-lymphocyte ratio (PLR), and bilateral tumor involvement. An online prediction tool was developed based on this model.</p><p><strong>Conclusions: </strong>A machine learning model and an online prediction tool were developed to accurately predict GSU and identify factors associated with this process. This approach may assist clinicians in identifying individuals at high-risk for GSU and facilitating evidence-based treatment decisions.</p>\",\"PeriodicalId\":23270,\"journal\":{\"name\":\"Translational andrology and urology\",\"volume\":\"14 6\",\"pages\":\"1631-1644\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271951/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational andrology and urology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/tau-2025-178\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ANDROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational andrology and urology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tau-2025-178","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/26 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ANDROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:高发生率的Gleason评分升级(GSU)可能导致泌尿科医生低估肿瘤的侵袭性,导致次优治疗决策。本研究旨在开发一种可解释的机器学习模型,根据现成的临床参数预测前列腺癌(PCa)患者GSU的风险。方法:回顾性分析在上海总医院和华西医院行根治性前列腺切除术(RP)的患者。来自上海总医院的数据分为训练集(80%)和测试集(20%),华西医院的数据用于外部验证。收集术前临床及病理资料。我们开发了9个机器学习模型[包括随机森林(random forest, RF)和光梯度增强机(light gradient boosting machine, LightGBM)],最终选择预测性能最好的模型作为最终模型。采用受试者工作特征(ROC)曲线、校准曲线、决策曲线和SHapley加性解释(SHAP)解释来评估模型的性能。结果:LightGBM模型具有较强的预测性能,在测试集的ROC曲线下面积为84.53%,在外部验证中为76.61%。与GSU相关的重要因素包括国际泌尿病理学会(ISUP)分级、年龄、临床肿瘤分期(T期)、体重指数、前列腺特异性抗原(PSA)、游离-总PSA比(f/ T PSA)、血小板-淋巴细胞比(PLR)和双侧肿瘤累及情况。基于该模型开发了在线预测工具。结论:开发了机器学习模型和在线预测工具,以准确预测GSU并识别与此过程相关的因素。这种方法可以帮助临床医生识别GSU高危人群,并促进循证治疗决策。
Development and validation of an interpretable machine learning model for predicting Gleason score upgrade in prostate cancer.
Background: The high incidence of Gleason score upgrade (GSU) can lead urologists to underestimate tumor aggressiveness, resulting in suboptimal treatment decisions. This study aimed to develop an interpretable machine learning model to predict the risk of GSU in individuals with prostate cancer (PCa) based on readily available clinical parameters.
Methods: A retrospective analysis was conducted on patients who underwent radical prostatectomy (RP) at Shanghai General Hospital and West China Hospital. Data from Shanghai General Hospital were categorized into a training set (80%) and a test set (20%), while data from West China Hospital were used for external validation. Preoperative clinical and pathological data were collected. Nine machine learning models [including random forest (RF) and light gradient boosting machine (LightGBM)], were developed, and the model demonstrating the best predictive performance was selected as the final model. Model performance was evaluated using receiver operating characteristic (ROC) curves, calibration curves, decision curves, and SHapley Additive exPlanations (SHAP) interpretation.
Results: The LightGBM model demonstrated strong predictive performance, achieving an area under the ROC curve of 84.53% in the test set and 76.61% in external validation. Significant factors associated with GSU included the International Society of Urological Pathology (ISUP) grade, age, clinical tumor stage (T stage), body mass index, prostate-specific antigen (PSA), free-to-total PSA ratio (f/t PSA), platelet-to-lymphocyte ratio (PLR), and bilateral tumor involvement. An online prediction tool was developed based on this model.
Conclusions: A machine learning model and an online prediction tool were developed to accurately predict GSU and identify factors associated with this process. This approach may assist clinicians in identifying individuals at high-risk for GSU and facilitating evidence-based treatment decisions.
期刊介绍:
ranslational Andrology and Urology (Print ISSN 2223-4683; Online ISSN 2223-4691; Transl Androl Urol; TAU) is an open access, peer-reviewed, bi-monthly journal (quarterly published from Mar.2012 - Dec. 2014). The main focus of the journal is to describe new findings in the field of translational research of Andrology and Urology, provides current and practical information on basic research and clinical investigations of Andrology and Urology. Specific areas of interest include, but not limited to, molecular study, pathology, biology and technical advances related to andrology and urology. Topics cover range from evaluation, prevention, diagnosis, therapy, prognosis, rehabilitation and future challenges to urology and andrology. Contributions pertinent to urology and andrology are also included from related fields such as public health, basic sciences, education, sociology, and nursing.