{"title":"儿童肝母细胞瘤的预后预测:基于seer的机器学习模型的开发和验证。","authors":"Jianfeng Luo, Kaikun Huang, Le Zheng, Long Li","doi":"10.1007/s13304-025-02395-8","DOIUrl":null,"url":null,"abstract":"<p><p>Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.</p>","PeriodicalId":23391,"journal":{"name":"Updates in Surgery","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prognostic prediction of hepatoblastoma in children: development and validation of machine learning models-an SEER-based study.\",\"authors\":\"Jianfeng Luo, Kaikun Huang, Le Zheng, Long Li\",\"doi\":\"10.1007/s13304-025-02395-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.</p>\",\"PeriodicalId\":23391,\"journal\":{\"name\":\"Updates in Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Updates in Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13304-025-02395-8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Updates in Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13304-025-02395-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}
Prognostic prediction of hepatoblastoma in children: development and validation of machine learning models-an SEER-based study.
Hepatoblastoma (HB) is the most common primary malignant liver tumor in children. Although the incidence is low, it is a serious threat to children's health. Traditional methods have limitations in prognosis prediction. In the era of precision medicine, machine learning (ML) shows great potential in medical fields, especially in prognosis prediction. This study aims to develop ML-based predictive models for the overall survival (OS) of children with HB, to improve prognosis assessment and quality of life. Data from the Surveillance, Epidemiology, and End Results (SEER) database from 2000 to 2021 were used. A total of 525 pediatric HB patients meeting the inclusion criteria were included. The data were randomly divided into a training cohort (n = 420) and a validation cohort (n = 105) in an 8:2 ratio. Four ML algorithms, including Decision Tree, Random Survival Forest (RSF), Gradient Boosting Survival Analysis (GBSA), and Support Vector Machine (SVM), were used to evaluate the OS of HB patients. The performance of the models was assessed using the area under the receiver-operating characteristic curve (AUC) and the consistency index (C-Index). Shapley Additive Explanations (SHAP) plots were used to interpret the contribution of each variable to the model prediction. The basic characteristics of the patients were analyzed. Through feature variable selection, six variables (age, tumor size, lymph-node invasion, metastatic status, surgical therapy, and chemotherapy) were identified as significant prognostic factors. Among the four ML models, the RSF model showed the best predictive performance. In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS was 0.822, 0.810, and 0.809, respectively, and the C-index was 0.791 (95% CI 0.667-0.813). In the validation cohort, the AUC was 0.740, 0.765, and 0.765, and the C-index was 0.764 (95% CI 0.571-0.909). The SHAP summary plot showed that surgical therapy, metastatic status, and tumor size were the most important variables. This study successfully developed and validated four ML-based predictive models for the prognosis of HB patients. The RSF model had superior predictive performance and has broad application prospects in assisting clinicians in making individualized treatment decisions, improving survival prediction accuracy, and optimizing the prognosis of HB patients.
期刊介绍:
Updates in Surgery (UPIS) has been founded in 2010 as the official journal of the Italian Society of Surgery. It’s an international, English-language, peer-reviewed journal dedicated to the surgical sciences. Its main goal is to offer a valuable update on the most recent developments of those surgical techniques that are rapidly evolving, forcing the community of surgeons to a rigorous debate and a continuous refinement of standards of care. In this respect position papers on the mostly debated surgical approaches and accreditation criteria have been published and are welcome for the future.
Beside its focus on general surgery, the journal draws particular attention to cutting edge topics and emerging surgical fields that are publishing in monothematic issues guest edited by well-known experts.
Updates in Surgery has been considering various types of papers: editorials, comprehensive reviews, original studies and technical notes related to specific surgical procedures and techniques on liver, colorectal, gastric, pancreatic, robotic and bariatric surgery.