Individualized survival prediction and risk stratification using machine learning for patients with malignant struma ovarii: a population-based study with external validation.
{"title":"Individualized survival prediction and risk stratification using machine learning for patients with malignant struma ovarii: a population-based study with external validation.","authors":"Shangcheng Yan, Zhen Cao, Qiyao Zhang, Bingrong Chen, Hao Wu, Hongtao Cao, Xiaobin Li, Yaqi Wang, Yalei Wang, Yonghui Chen, Ziwen Liu","doi":"10.21037/gs-2025-35","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Malignant struma ovarii (MSO) is a rare thyroid-type cancer originating in ovarian teratoma. Prognosis of MSO is less studied without unanimous staging or stratification system. This study aimed to developed and validated a machine learning (ML)-based model to predict overall survival (OS) for patients with MSO and to risk-stratify them.</p><p><strong>Methods: </strong>Patients with histologically confirmed MSO diagnosed in 1975-2021 from the Surveillance, Epidemiology, and End Results (SEER) program were identified as the training cohort. Patients in a systematic literature review were collected as the testing cohort. OS was selected as the outcome, while demographic, clinicopathological and therapeutic information were used as features. Following data encoding, imputing and scaling, univariate feature selection was performed. Cox proportional hazard (CoxPH), Cox with elastic net penalty (CoxNet), random survival forest (RSF), gradient boosting machine (GBM), and survival tree (ST) models were trained and tuned. Each model was evaluated on its c-index, time-dependent area under the curve (AUC), time-dependent Brier score (BS) and stratification ability in the training and the testing cohort respectively. The algorithm that performed the best in the testing cohort was finally chosen for SHapley Additive exPlanations (SHAP) interpretation and Streamlit web application deployment.</p><p><strong>Results: </strong>The study included 120 and 194 patients in the training and testing cohort respectively. At the end of follow-up (median time 115.5 and 32.5 months respectively), 101 (84.2%) and 181 patients (93.3%) survived respectively. RSF had the best performance in the testing cohort, possessing the highest c-index (0.841, 95% confidence interval: 0.732-0.916), the highest mean AUC (0.852), the lowest integrated BS (0.042), and the smallest P value (<0.001) on log-rank test comparing the stratified groups. According to SHAP, older age, hysterectomy, larger tumor size and more advanced American Joint Committee on Cancer stage had the strongest predictive power for worse OS among all 13 features. An interactive application (https://mso-surv.streamlit.app/) was then implemented which can display the predicted Kaplan-Meier curve, survival probability, risk stratification and the contributions of features for the output.</p><p><strong>Conclusions: </strong>We reported the first externally tested time-to-event prognostic prediction model for MSO. ML algorithms enabled precise individual-patient prediction and stratification, and can potentially assist patient counselling and decision-making for treatment and surveillance.</p>","PeriodicalId":12760,"journal":{"name":"Gland surgery","volume":"14 6","pages":"1052-1065"},"PeriodicalIF":1.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261249/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gland surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/gs-2025-35","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/26 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Malignant struma ovarii (MSO) is a rare thyroid-type cancer originating in ovarian teratoma. Prognosis of MSO is less studied without unanimous staging or stratification system. This study aimed to developed and validated a machine learning (ML)-based model to predict overall survival (OS) for patients with MSO and to risk-stratify them.
Methods: Patients with histologically confirmed MSO diagnosed in 1975-2021 from the Surveillance, Epidemiology, and End Results (SEER) program were identified as the training cohort. Patients in a systematic literature review were collected as the testing cohort. OS was selected as the outcome, while demographic, clinicopathological and therapeutic information were used as features. Following data encoding, imputing and scaling, univariate feature selection was performed. Cox proportional hazard (CoxPH), Cox with elastic net penalty (CoxNet), random survival forest (RSF), gradient boosting machine (GBM), and survival tree (ST) models were trained and tuned. Each model was evaluated on its c-index, time-dependent area under the curve (AUC), time-dependent Brier score (BS) and stratification ability in the training and the testing cohort respectively. The algorithm that performed the best in the testing cohort was finally chosen for SHapley Additive exPlanations (SHAP) interpretation and Streamlit web application deployment.
Results: The study included 120 and 194 patients in the training and testing cohort respectively. At the end of follow-up (median time 115.5 and 32.5 months respectively), 101 (84.2%) and 181 patients (93.3%) survived respectively. RSF had the best performance in the testing cohort, possessing the highest c-index (0.841, 95% confidence interval: 0.732-0.916), the highest mean AUC (0.852), the lowest integrated BS (0.042), and the smallest P value (<0.001) on log-rank test comparing the stratified groups. According to SHAP, older age, hysterectomy, larger tumor size and more advanced American Joint Committee on Cancer stage had the strongest predictive power for worse OS among all 13 features. An interactive application (https://mso-surv.streamlit.app/) was then implemented which can display the predicted Kaplan-Meier curve, survival probability, risk stratification and the contributions of features for the output.
Conclusions: We reported the first externally tested time-to-event prognostic prediction model for MSO. ML algorithms enabled precise individual-patient prediction and stratification, and can potentially assist patient counselling and decision-making for treatment and surveillance.
期刊介绍:
Gland Surgery (Gland Surg; GS, Print ISSN 2227-684X; Online ISSN 2227-8575) being indexed by PubMed/PubMed Central, is an open access, peer-review journal launched at May of 2012, published bio-monthly since February 2015.