Zhenyu Liu, Hang Ma, Ziqi Guo, Shuai Su, Xiangbiao He
{"title":"基于机器学习的美国白人肾盂移行细胞癌预测模型的开发:一项基于seer的研究。","authors":"Zhenyu Liu, Hang Ma, Ziqi Guo, Shuai Su, Xiangbiao He","doi":"10.21037/tau-24-385","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Transitional cell carcinoma (TCC) of the renal pelvis is a rare cancer within the urinary system. However, the prognosis is not entirely satisfactory. This study aims to develop a clinical model for predicting cancer-specific survival (CSS) at 1-, 3-, and 5-year for White Americans with renal pelvic TCC.</p><p><strong>Methods: </strong>Data of all White American patients diagnosed with TCC of the renal pelvis from 2010 to 2015 were extracted and analyzed from the Surveillance, Epidemiology, and End Results (SEER) database in this retrospective study. Subsequently, after excluding the metastatic group, a subgroup analysis was performed on the data of 1,715 White Americans with non-metastatic renal pelvic TCC. Patients included in this study were randomly divided into the training and validation sets in a ratio of 7:3. In addition, the features in the training set were extracted by the Boruta algorithm. The importance of these features was visualized using the eXtreme Gradient Boosting (XGBoost)-based SHapley Additive exPlanation (SHAP) tool. To improve predictive accuracy, a nomogram model with these identified independent prognostic variables was developed.</p><p><strong>Results: </strong>A total of 1,887 White American patients with renal pelvic TCC were included in this study. In the training set, the area under the curve (AUC) for CSS nomograms at 1-, 3-, and 5-year were 0.813 [95% confidence interval (CI): 0.774-0.852], 0.738 (95% CI: 0.702-0.774), and 0.733 (95% CI: 0.698-0.768), respectively. Correspondingly, the AUCs for CSS nomograms at the above time points were 0.781 (95% CI: 0.732-0.830), 0.785 (95% CI: 0.741-0.829), and 0.775 (95% CI: 0.729-0.820) in the validation set, respectively. The subgroup analysis results revealed that the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.788, 0.725, and 0.726 in the training set, respectively, while the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.831, 0.786, and 0.754 in the training set, respectively.</p><p><strong>Conclusions: </strong>In this study, a nomogram that predicts CSS in White American patients diagnosed with renal pelvic TCC was efficiently constructed. The application of the nomogram may enhance patient care and assist clinicians in choosing the optimal treatment strategies.</p>","PeriodicalId":23270,"journal":{"name":"Translational andrology and urology","volume":"13 12","pages":"2681-2693"},"PeriodicalIF":1.9000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11732296/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development of a machine learning-based predictive model for transitional cell carcinoma of the renal pelvis in White Americans: a SEER-based study.\",\"authors\":\"Zhenyu Liu, Hang Ma, Ziqi Guo, Shuai Su, Xiangbiao He\",\"doi\":\"10.21037/tau-24-385\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Transitional cell carcinoma (TCC) of the renal pelvis is a rare cancer within the urinary system. However, the prognosis is not entirely satisfactory. This study aims to develop a clinical model for predicting cancer-specific survival (CSS) at 1-, 3-, and 5-year for White Americans with renal pelvic TCC.</p><p><strong>Methods: </strong>Data of all White American patients diagnosed with TCC of the renal pelvis from 2010 to 2015 were extracted and analyzed from the Surveillance, Epidemiology, and End Results (SEER) database in this retrospective study. Subsequently, after excluding the metastatic group, a subgroup analysis was performed on the data of 1,715 White Americans with non-metastatic renal pelvic TCC. Patients included in this study were randomly divided into the training and validation sets in a ratio of 7:3. In addition, the features in the training set were extracted by the Boruta algorithm. The importance of these features was visualized using the eXtreme Gradient Boosting (XGBoost)-based SHapley Additive exPlanation (SHAP) tool. To improve predictive accuracy, a nomogram model with these identified independent prognostic variables was developed.</p><p><strong>Results: </strong>A total of 1,887 White American patients with renal pelvic TCC were included in this study. In the training set, the area under the curve (AUC) for CSS nomograms at 1-, 3-, and 5-year were 0.813 [95% confidence interval (CI): 0.774-0.852], 0.738 (95% CI: 0.702-0.774), and 0.733 (95% CI: 0.698-0.768), respectively. Correspondingly, the AUCs for CSS nomograms at the above time points were 0.781 (95% CI: 0.732-0.830), 0.785 (95% CI: 0.741-0.829), and 0.775 (95% CI: 0.729-0.820) in the validation set, respectively. The subgroup analysis results revealed that the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.788, 0.725, and 0.726 in the training set, respectively, while the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.831, 0.786, and 0.754 in the training set, respectively.</p><p><strong>Conclusions: </strong>In this study, a nomogram that predicts CSS in White American patients diagnosed with renal pelvic TCC was efficiently constructed. The application of the nomogram may enhance patient care and assist clinicians in choosing the optimal treatment strategies.</p>\",\"PeriodicalId\":23270,\"journal\":{\"name\":\"Translational andrology and urology\",\"volume\":\"13 12\",\"pages\":\"2681-2693\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11732296/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational andrology and urology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/tau-24-385\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ANDROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational andrology and urology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tau-24-385","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/28 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ANDROLOGY","Score":null,"Total":0}
Development of a machine learning-based predictive model for transitional cell carcinoma of the renal pelvis in White Americans: a SEER-based study.
Background: Transitional cell carcinoma (TCC) of the renal pelvis is a rare cancer within the urinary system. However, the prognosis is not entirely satisfactory. This study aims to develop a clinical model for predicting cancer-specific survival (CSS) at 1-, 3-, and 5-year for White Americans with renal pelvic TCC.
Methods: Data of all White American patients diagnosed with TCC of the renal pelvis from 2010 to 2015 were extracted and analyzed from the Surveillance, Epidemiology, and End Results (SEER) database in this retrospective study. Subsequently, after excluding the metastatic group, a subgroup analysis was performed on the data of 1,715 White Americans with non-metastatic renal pelvic TCC. Patients included in this study were randomly divided into the training and validation sets in a ratio of 7:3. In addition, the features in the training set were extracted by the Boruta algorithm. The importance of these features was visualized using the eXtreme Gradient Boosting (XGBoost)-based SHapley Additive exPlanation (SHAP) tool. To improve predictive accuracy, a nomogram model with these identified independent prognostic variables was developed.
Results: A total of 1,887 White American patients with renal pelvic TCC were included in this study. In the training set, the area under the curve (AUC) for CSS nomograms at 1-, 3-, and 5-year were 0.813 [95% confidence interval (CI): 0.774-0.852], 0.738 (95% CI: 0.702-0.774), and 0.733 (95% CI: 0.698-0.768), respectively. Correspondingly, the AUCs for CSS nomograms at the above time points were 0.781 (95% CI: 0.732-0.830), 0.785 (95% CI: 0.741-0.829), and 0.775 (95% CI: 0.729-0.820) in the validation set, respectively. The subgroup analysis results revealed that the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.788, 0.725, and 0.726 in the training set, respectively, while the AUCs for CSS nomograms at 1-, 3-, and 5-year were 0.831, 0.786, and 0.754 in the training set, respectively.
Conclusions: In this study, a nomogram that predicts CSS in White American patients diagnosed with renal pelvic TCC was efficiently constructed. The application of the nomogram may enhance patient care and assist clinicians in choosing the optimal treatment strategies.
期刊介绍:
ranslational Andrology and Urology (Print ISSN 2223-4683; Online ISSN 2223-4691; Transl Androl Urol; TAU) is an open access, peer-reviewed, bi-monthly journal (quarterly published from Mar.2012 - Dec. 2014). The main focus of the journal is to describe new findings in the field of translational research of Andrology and Urology, provides current and practical information on basic research and clinical investigations of Andrology and Urology. Specific areas of interest include, but not limited to, molecular study, pathology, biology and technical advances related to andrology and urology. Topics cover range from evaluation, prevention, diagnosis, therapy, prognosis, rehabilitation and future challenges to urology and andrology. Contributions pertinent to urology and andrology are also included from related fields such as public health, basic sciences, education, sociology, and nursing.