Hao Wu , Aierpati Maimaiti , Jinlong Huang , Jing Xue , Qiang Fu , Zening Wang , Mamutijiang Muertizha , Yang Li , Di Li , Qingjiu Zhou , Yongxin Wang
{"title":"The establishment of machine learning prognostic prediction models for pineal region tumors based on SEER-A multicenter real-world study","authors":"Hao Wu , Aierpati Maimaiti , Jinlong Huang , Jing Xue , Qiang Fu , Zening Wang , Mamutijiang Muertizha , Yang Li , Di Li , Qingjiu Zhou , Yongxin Wang","doi":"10.1016/j.ejso.2025.110058","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Pineal region tumors (PRT) are rare intracranial neoplasms with diverse pathological types and growth characteristics, leading to varied clinical manifestations. This study aims to develop machine learning (ML) models for survival prediction, offering valuable insights for medical practice in the management of PRTs.</div></div><div><h3>Methods</h3><div>Clinical information on PRTs was extracted from the Surveillance, Epidemiology, and End Results (SEER) database. The Kaplan-Meier (K-M) analysis was used to analyze the survival of PRT patients. Univariate and multivariate Cox regression analyses were conducted to identify risk factors for the survival of PRT patients. Then, nomograms were constructed. Seven ML models including Decision Tree, Logistic Regression, LightGBM, Random Forest, XGBoost, K-Nearest Neighbor Algorithm (KNN), and Support Vector Machine (SVM), were developed to predict the prognosis of PRT patients. The predictive value of ML models was evaluated by the area under the receiver's operating characteristic curve (AUC-ROC), tenfold cross verification, calibration curve, and decision curve analysis (DCA).</div></div><div><h3>Results</h3><div>Univariate and multivariate Cox regression revealed that age, histopathology, radiotherapy, and tumor size were independent risk factors for overall survival (OS). Histopathology, surgery, radiotherapy, and tumor size were risk factors for cancer-specific survival (CSS). K-M survival analysis revealed that age, histopathology, marital status, radiotherapy, sex, and surgery significantly impacted OS, while age, histopathology, marital status, race, radiotherapy, sex, and surgery significantly influenced CSS. In the prediction of OS, the ML models with the best clinical utility were RF, Logistic Regression, and XGBoost. For CSS, the most effective models were RF, LightGBM, and RF.</div></div><div><h3>Conclusion</h3><div>ML models demonstrate significant potential and high predictive efficacy in forecasting long-term postoperative survival in PRT patients, providing substantial clinical value.</div></div>","PeriodicalId":11522,"journal":{"name":"Ejso","volume":"51 8","pages":"Article 110058"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ejso","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S074879832500486X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Pineal region tumors (PRT) are rare intracranial neoplasms with diverse pathological types and growth characteristics, leading to varied clinical manifestations. This study aims to develop machine learning (ML) models for survival prediction, offering valuable insights for medical practice in the management of PRTs.
Methods
Clinical information on PRTs was extracted from the Surveillance, Epidemiology, and End Results (SEER) database. The Kaplan-Meier (K-M) analysis was used to analyze the survival of PRT patients. Univariate and multivariate Cox regression analyses were conducted to identify risk factors for the survival of PRT patients. Then, nomograms were constructed. Seven ML models including Decision Tree, Logistic Regression, LightGBM, Random Forest, XGBoost, K-Nearest Neighbor Algorithm (KNN), and Support Vector Machine (SVM), were developed to predict the prognosis of PRT patients. The predictive value of ML models was evaluated by the area under the receiver's operating characteristic curve (AUC-ROC), tenfold cross verification, calibration curve, and decision curve analysis (DCA).
Results
Univariate and multivariate Cox regression revealed that age, histopathology, radiotherapy, and tumor size were independent risk factors for overall survival (OS). Histopathology, surgery, radiotherapy, and tumor size were risk factors for cancer-specific survival (CSS). K-M survival analysis revealed that age, histopathology, marital status, radiotherapy, sex, and surgery significantly impacted OS, while age, histopathology, marital status, race, radiotherapy, sex, and surgery significantly influenced CSS. In the prediction of OS, the ML models with the best clinical utility were RF, Logistic Regression, and XGBoost. For CSS, the most effective models were RF, LightGBM, and RF.
Conclusion
ML models demonstrate significant potential and high predictive efficacy in forecasting long-term postoperative survival in PRT patients, providing substantial clinical value.
期刊介绍:
JSO - European Journal of Surgical Oncology ("the Journal of Cancer Surgery") is the Official Journal of the European Society of Surgical Oncology and BASO ~ the Association for Cancer Surgery.
The EJSO aims to advance surgical oncology research and practice through the publication of original research articles, review articles, editorials, debates and correspondence.