Predictive models of lymph node metastasis in patients with gastrointestinal stromal tumors based on machine learning algorithms: a SEER-based retrospective study.
{"title":"Predictive models of lymph node metastasis in patients with gastrointestinal stromal tumors based on machine learning algorithms: a SEER-based retrospective study.","authors":"Zheng Lin, Jianping Xiong, Caixin Feng, Shaochun Pang, Liangxue Lin, Feiran Zhang, Xun Chen, Yuandong Yuan, Peijie Xie, Ting Wu, Yanchong Li, Peihong Zheng, Haijie Xu, Ziqun Liao","doi":"10.21037/jgo-24-777","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Gastrointestinal stromal tumor (GIST) is one of the most prevalent tumors in the digestive system. Due to the rarity of lymph node metastasis in patients with GISTs, there is a scarcity of related studies, which leads to ongoing debates regarding its impact on patient prognosis. This study aimed to analyze the impact of lymph node metastasis on the overall survival of GISTs patients and further building predictive models with machine learning algorithms.</p><p><strong>Methods: </strong>This is a retrospective study based on the Surveillance, Epidemiology and End Results (SEER) database. The demographic and clinicopathological characteristics of GISTs patients who underwent surgical therapy were collected from database. Five different types of machine learning algorithms were used to build the models. The ensemble models were based on the three algorithms with the highest sensitivity in the validating cohort. At last, receiver operator characteristic (ROC) curve, precision-recall curve (PRC), calibration curve, decision curve analysis (DCA) and Kaplan-Meier survival curve (KMC) were used to evaluate our models.</p><p><strong>Results: </strong>A total of 1,404 patients with GISTs were included in our study after data cleaning. Artificial Neural Network model [area under the ROC curve (AUC): 0.951, 95% confidence interval (CI): 0.901-0.992, sensitivity: 0.752] achieved the best performance in the validating cohort and was chosen to be the final model. Calibration plots showed good consistency between prediction and actual observations. Although the AUC between the final model and the baseline model showed no significant difference, the area under the PRC (PRAUC) of the final model (PRAUC =0.765) was significantly higher than that of the baseline model (PRAUC =0.455). DCA showed that the final model had high net benefit. Survival analysis indicated that the final model could distinguish the prognosis of patients significantly (all P<0.001).</p><p><strong>Conclusions: </strong>We used machine learning algorithms to build models that can accurately predict lymph node metastasis in GISTs patients.</p>","PeriodicalId":15841,"journal":{"name":"Journal of gastrointestinal oncology","volume":"16 1","pages":"53-66"},"PeriodicalIF":2.0000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11921308/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of gastrointestinal oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/jgo-24-777","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/26 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Gastrointestinal stromal tumor (GIST) is one of the most prevalent tumors in the digestive system. Due to the rarity of lymph node metastasis in patients with GISTs, there is a scarcity of related studies, which leads to ongoing debates regarding its impact on patient prognosis. This study aimed to analyze the impact of lymph node metastasis on the overall survival of GISTs patients and further building predictive models with machine learning algorithms.
Methods: This is a retrospective study based on the Surveillance, Epidemiology and End Results (SEER) database. The demographic and clinicopathological characteristics of GISTs patients who underwent surgical therapy were collected from database. Five different types of machine learning algorithms were used to build the models. The ensemble models were based on the three algorithms with the highest sensitivity in the validating cohort. At last, receiver operator characteristic (ROC) curve, precision-recall curve (PRC), calibration curve, decision curve analysis (DCA) and Kaplan-Meier survival curve (KMC) were used to evaluate our models.
Results: A total of 1,404 patients with GISTs were included in our study after data cleaning. Artificial Neural Network model [area under the ROC curve (AUC): 0.951, 95% confidence interval (CI): 0.901-0.992, sensitivity: 0.752] achieved the best performance in the validating cohort and was chosen to be the final model. Calibration plots showed good consistency between prediction and actual observations. Although the AUC between the final model and the baseline model showed no significant difference, the area under the PRC (PRAUC) of the final model (PRAUC =0.765) was significantly higher than that of the baseline model (PRAUC =0.455). DCA showed that the final model had high net benefit. Survival analysis indicated that the final model could distinguish the prognosis of patients significantly (all P<0.001).
Conclusions: We used machine learning algorithms to build models that can accurately predict lymph node metastasis in GISTs patients.
期刊介绍:
ournal of Gastrointestinal Oncology (Print ISSN 2078-6891; Online ISSN 2219-679X; J Gastrointest Oncol; JGO), the official journal of Society for Gastrointestinal Oncology (SGO), is an open-access, international peer-reviewed journal. It is published quarterly (Sep. 2010- Dec. 2013), bimonthly (Feb. 2014 -) and openly distributed worldwide.
JGO publishes manuscripts that focus on updated and practical information about diagnosis, prevention and clinical investigations of gastrointestinal cancer treatment. Specific areas of interest include, but not limited to, multimodality therapy, markers, imaging and tumor biology.