Individualized survival prediction and risk stratification using machine learning for patients with malignant struma ovarii: a population-based study with external validation.

IF 1.6 3区 医学 Q3 SURGERY
Gland surgery Pub Date : 2025-06-30 Epub Date: 2025-06-26 DOI:10.21037/gs-2025-35
Shangcheng Yan, Zhen Cao, Qiyao Zhang, Bingrong Chen, Hao Wu, Hongtao Cao, Xiaobin Li, Yaqi Wang, Yalei Wang, Yonghui Chen, Ziwen Liu
{"title":"Individualized survival prediction and risk stratification using machine learning for patients with malignant struma ovarii: a population-based study with external validation.","authors":"Shangcheng Yan, Zhen Cao, Qiyao Zhang, Bingrong Chen, Hao Wu, Hongtao Cao, Xiaobin Li, Yaqi Wang, Yalei Wang, Yonghui Chen, Ziwen Liu","doi":"10.21037/gs-2025-35","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Malignant struma ovarii (MSO) is a rare thyroid-type cancer originating in ovarian teratoma. Prognosis of MSO is less studied without unanimous staging or stratification system. This study aimed to developed and validated a machine learning (ML)-based model to predict overall survival (OS) for patients with MSO and to risk-stratify them.</p><p><strong>Methods: </strong>Patients with histologically confirmed MSO diagnosed in 1975-2021 from the Surveillance, Epidemiology, and End Results (SEER) program were identified as the training cohort. Patients in a systematic literature review were collected as the testing cohort. OS was selected as the outcome, while demographic, clinicopathological and therapeutic information were used as features. Following data encoding, imputing and scaling, univariate feature selection was performed. Cox proportional hazard (CoxPH), Cox with elastic net penalty (CoxNet), random survival forest (RSF), gradient boosting machine (GBM), and survival tree (ST) models were trained and tuned. Each model was evaluated on its c-index, time-dependent area under the curve (AUC), time-dependent Brier score (BS) and stratification ability in the training and the testing cohort respectively. The algorithm that performed the best in the testing cohort was finally chosen for SHapley Additive exPlanations (SHAP) interpretation and Streamlit web application deployment.</p><p><strong>Results: </strong>The study included 120 and 194 patients in the training and testing cohort respectively. At the end of follow-up (median time 115.5 and 32.5 months respectively), 101 (84.2%) and 181 patients (93.3%) survived respectively. RSF had the best performance in the testing cohort, possessing the highest c-index (0.841, 95% confidence interval: 0.732-0.916), the highest mean AUC (0.852), the lowest integrated BS (0.042), and the smallest P value (<0.001) on log-rank test comparing the stratified groups. According to SHAP, older age, hysterectomy, larger tumor size and more advanced American Joint Committee on Cancer stage had the strongest predictive power for worse OS among all 13 features. An interactive application (https://mso-surv.streamlit.app/) was then implemented which can display the predicted Kaplan-Meier curve, survival probability, risk stratification and the contributions of features for the output.</p><p><strong>Conclusions: </strong>We reported the first externally tested time-to-event prognostic prediction model for MSO. ML algorithms enabled precise individual-patient prediction and stratification, and can potentially assist patient counselling and decision-making for treatment and surveillance.</p>","PeriodicalId":12760,"journal":{"name":"Gland surgery","volume":"14 6","pages":"1052-1065"},"PeriodicalIF":1.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261249/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gland surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/gs-2025-35","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/26 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Malignant struma ovarii (MSO) is a rare thyroid-type cancer originating in ovarian teratoma. Prognosis of MSO is less studied without unanimous staging or stratification system. This study aimed to developed and validated a machine learning (ML)-based model to predict overall survival (OS) for patients with MSO and to risk-stratify them.

Methods: Patients with histologically confirmed MSO diagnosed in 1975-2021 from the Surveillance, Epidemiology, and End Results (SEER) program were identified as the training cohort. Patients in a systematic literature review were collected as the testing cohort. OS was selected as the outcome, while demographic, clinicopathological and therapeutic information were used as features. Following data encoding, imputing and scaling, univariate feature selection was performed. Cox proportional hazard (CoxPH), Cox with elastic net penalty (CoxNet), random survival forest (RSF), gradient boosting machine (GBM), and survival tree (ST) models were trained and tuned. Each model was evaluated on its c-index, time-dependent area under the curve (AUC), time-dependent Brier score (BS) and stratification ability in the training and the testing cohort respectively. The algorithm that performed the best in the testing cohort was finally chosen for SHapley Additive exPlanations (SHAP) interpretation and Streamlit web application deployment.

Results: The study included 120 and 194 patients in the training and testing cohort respectively. At the end of follow-up (median time 115.5 and 32.5 months respectively), 101 (84.2%) and 181 patients (93.3%) survived respectively. RSF had the best performance in the testing cohort, possessing the highest c-index (0.841, 95% confidence interval: 0.732-0.916), the highest mean AUC (0.852), the lowest integrated BS (0.042), and the smallest P value (<0.001) on log-rank test comparing the stratified groups. According to SHAP, older age, hysterectomy, larger tumor size and more advanced American Joint Committee on Cancer stage had the strongest predictive power for worse OS among all 13 features. An interactive application (https://mso-surv.streamlit.app/) was then implemented which can display the predicted Kaplan-Meier curve, survival probability, risk stratification and the contributions of features for the output.

Conclusions: We reported the first externally tested time-to-event prognostic prediction model for MSO. ML algorithms enabled precise individual-patient prediction and stratification, and can potentially assist patient counselling and decision-making for treatment and surveillance.

使用机器学习对卵巢恶性肿瘤患者进行个体化生存预测和风险分层:一项具有外部验证的基于人群的研究。
背景:卵巢恶性瘤(MSO)是一种罕见的甲状腺型肿瘤,起源于卵巢畸胎瘤。由于没有统一的分期或分层系统,对MSO预后的研究较少。本研究旨在开发并验证基于机器学习(ML)的模型,以预测MSO患者的总生存期(OS)并对其进行风险分层。方法:在监测、流行病学和最终结果(SEER)项目中诊断的1975-2021年组织学证实的MSO患者被确定为培训队列。收集系统文献综述中的患者作为测试队列。OS作为结局,人口学、临床病理和治疗信息作为特征。在进行数据编码、输入和缩放后,进行单变量特征选择。对Cox比例风险(Cox proportional hazard, Cox with elastic net penalty, Cox)、随机生存森林(random survival forest, RSF)、梯度增强机(gradient boosting machine, GBM)和生存树(survival tree, ST)模型进行了训练和调整。在训练组和测试组中,分别对各模型的c指数、随时间变化的曲线下面积(AUC)、随时间变化的Brier评分(BS)和分层能力进行评价。在测试队列中表现最好的算法最终被选择用于SHapley加性解释(SHAP)解释和Streamlit web应用程序部署。结果:本研究纳入训练组120例,测试组194例。随访结束时(中位时间分别为115.5个月和32.5个月),101例(84.2%)和181例(93.3%)患者存活。RSF在测试队列中表现最好,c指数最高(0.841,95%可信区间:0.732-0.916),平均AUC最高(0.852),综合BS最低(0.042),P值最小(结论:我们报道了第一个外部测试的MSO时间到事件的预后预测模型。机器学习算法实现了精确的个体患者预测和分层,并可以潜在地帮助患者咨询和决策治疗和监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Gland surgery
Gland surgery Medicine-Surgery
CiteScore
3.60
自引率
0.00%
发文量
113
期刊介绍: Gland Surgery (Gland Surg; GS, Print ISSN 2227-684X; Online ISSN 2227-8575) being indexed by PubMed/PubMed Central, is an open access, peer-review journal launched at May of 2012, published bio-monthly since February 2015.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信