儿童肝母细胞瘤的预后预测：基于seer的机器学习模型的开发和验证。

IF 2.2 3区医学 Q2 SURGERY

Updates in Surgery Pub Date : 2025-10-09 DOI:10.1007/s13304-025-02395-8

Jianfeng Luo, Kaikun Huang, Le Zheng, Long Li

{"title":"儿童肝母细胞瘤的预后预测：基于seer的机器学习模型的开发和验证。","authors":"Jianfeng Luo, Kaikun Huang, Le Zheng, Long Li","doi":"10.1007/s13304-025-02395-8","DOIUrl":null,"url":null,"abstract":"Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.","PeriodicalId":23391,"journal":{"name":"Updates in Surgery","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prognostic prediction of hepatoblastoma in children: development and validation of machine learning models-an SEER-based study.\",\"authors\":\"Jianfeng Luo, Kaikun Huang, Le Zheng, Long Li\",\"doi\":\"10.1007/s13304-025-02395-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.\",\"PeriodicalId\":23391,\"journal\":{\"name\":\"Updates in Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Updates in Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13304-025-02395-8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Updates in Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13304-025-02395-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

摘要

肝母细胞瘤（HB）是儿童最常见的原发性肝恶性肿瘤。虽然发病率较低，但对儿童健康构成严重威胁。传统方法在预测预后方面存在局限性。在精准医疗时代，机器学习（ML）在医疗领域显示出巨大的潜力，尤其是在预后预测方面。本研究旨在建立基于ml的HB患儿总生存期（OS）预测模型，以改善预后评估和生活质量。使用2000年至2021年监测、流行病学和最终结果（SEER）数据库的数据。共纳入525例符合纳入标准的儿童HB患者。数据按8:2的比例随机分为训练组（n = 420）和验证组（n = 105）。采用决策树（Decision Tree）、随机生存森林（Random Survival Forest， RSF）、梯度增强生存分析（Gradient Boosting Survival Analysis， GBSA）和支持向量机（Support Vector Machine， SVM）四种ML算法对HB患者的OS进行评估。采用受者-工作特征曲线下面积（AUC）和一致性指数（C-Index）对模型的性能进行评估。Shapley加性解释（SHAP）图用于解释每个变量对模型预测的贡献。分析患者的基本特征。通过特征变量选择，6个变量（年龄、肿瘤大小、淋巴结侵袭、转移状态、手术治疗和化疗）被确定为重要的预后因素。在4种ML模型中，RSF模型的预测性能最好。在训练队列中，预测1年、3年和5年OS的AUC分别为0.822、0.810和0.809，C-index为0.791 （95% CI 0.667-0.813）。在验证队列中，AUC分别为0.740、0.765和0.765，c指数为0.764 （95% CI 0.571-0.909）。SHAP总结图显示手术治疗、转移状态和肿瘤大小是最重要的变量。本研究成功开发并验证了四种基于ml的HB患者预后预测模型。RSF模型具有优越的预测性能，在协助临床医生制定个体化治疗决策、提高生存预测准确率、优化HB患者预后方面具有广阔的应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Prognostic prediction of hepatoblastoma in children: development and validation of machine learning models-an SEER-based study.

Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Updates in Surgery Medicine-Surgery

CiteScore

4.50

自引率

7.70%

发文量

208

期刊介绍： Updates in Surgery (UPIS) has been founded in 2010 as the official journal of the Italian Society of Surgery. It’s an international, English-language, peer-reviewed journal dedicated to the surgical sciences. Its main goal is to offer a valuable update on the most recent developments of those surgical techniques that are rapidly evolving, forcing the community of surgeons to a rigorous debate and a continuous refinement of standards of care. In this respect position papers on the mostly debated surgical approaches and accreditation criteria have been published and are welcome for the future. Beside its focus on general surgery, the journal draws particular attention to cutting edge topics and emerging surgical fields that are publishing in monothematic issues guest edited by well-known experts. Updates in Surgery has been considering various types of papers: editorials, comprehensive reviews, original studies and technical notes related to specific surgical procedures and techniques on liver, colorectal, gastric, pancreatic, robotic and bariatric surgery.