{"title":"利用机器学习预测食管癌的肺转移:一项基于人群的研究","authors":"Ying Fang, Jun Wan, Yukai Zeng","doi":"10.1007/s00432-024-05937-6","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Background</h3><p>This study aims to establish a predictive model for assessing the risk of esophageal cancer lung metastasis using machine learning techniques.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>Data on esophageal cancer patients from 2010 to 2020 were extracted from the surveillance, epidemiology, and end results (SEER) database. Through univariate and multivariate logistic regression analyses, eight indicators related to the risk of lung metastasis were selected. These indicators were incorporated into six machine learning classifiers to develop corresponding predictive models. The performance of these models was evaluated and compared using metrics such as The area under curve (AUC), accuracy, sensitivity, specificity, and F1 score.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>A total of 20,249 confirmed cases of esophageal cancer were included in this study. Among them, 14,174 cases (70%) were assigned to the training set while 6075 cases (30%) constituted the internal test set. Primary site location, tumor histology, tumor grade classification system T staging criteria N staging criteria brain metastasis bone metastasis liver metastasis emerged as independent risk factors for esophageal cancer with lung metastasis. Amongst the six constructed models, the GBM algorithm-based machine learning model demonstrated superior performance during internal dataset validation. AUC, accuracy, sensitivity, and specificity values achieved by this model stood at respectively at 0.803, 0.849, 0.604, and 0.867.</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>We have developed an online calculator based on the GBM model (https://lvgrkyxcgdvo7ugoyxyywe.streamlit.app/)to aid clinical decision-making and treatment planning.</p>","PeriodicalId":15118,"journal":{"name":"Journal of Cancer Research and Clinical Oncology","volume":"210 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Use machine learning to predict pulmonary metastasis of esophageal cancer: a population-based study\",\"authors\":\"Ying Fang, Jun Wan, Yukai Zeng\",\"doi\":\"10.1007/s00432-024-05937-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Background</h3><p>This study aims to establish a predictive model for assessing the risk of esophageal cancer lung metastasis using machine learning techniques.</p><h3 data-test=\\\"abstract-sub-heading\\\">Methods</h3><p>Data on esophageal cancer patients from 2010 to 2020 were extracted from the surveillance, epidemiology, and end results (SEER) database. Through univariate and multivariate logistic regression analyses, eight indicators related to the risk of lung metastasis were selected. These indicators were incorporated into six machine learning classifiers to develop corresponding predictive models. The performance of these models was evaluated and compared using metrics such as The area under curve (AUC), accuracy, sensitivity, specificity, and F1 score.</p><h3 data-test=\\\"abstract-sub-heading\\\">Results</h3><p>A total of 20,249 confirmed cases of esophageal cancer were included in this study. Among them, 14,174 cases (70%) were assigned to the training set while 6075 cases (30%) constituted the internal test set. Primary site location, tumor histology, tumor grade classification system T staging criteria N staging criteria brain metastasis bone metastasis liver metastasis emerged as independent risk factors for esophageal cancer with lung metastasis. Amongst the six constructed models, the GBM algorithm-based machine learning model demonstrated superior performance during internal dataset validation. AUC, accuracy, sensitivity, and specificity values achieved by this model stood at respectively at 0.803, 0.849, 0.604, and 0.867.</p><h3 data-test=\\\"abstract-sub-heading\\\">Conclusion</h3><p>We have developed an online calculator based on the GBM model (https://lvgrkyxcgdvo7ugoyxyywe.streamlit.app/)to aid clinical decision-making and treatment planning.</p>\",\"PeriodicalId\":15118,\"journal\":{\"name\":\"Journal of Cancer Research and Clinical Oncology\",\"volume\":\"210 1\",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cancer Research and Clinical Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00432-024-05937-6\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cancer Research and Clinical Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00432-024-05937-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景本研究旨在利用机器学习技术建立一个评估食管癌肺转移风险的预测模型。方法从监测、流行病学和最终结果(SEER)数据库中提取了2010年至2020年食管癌患者的数据。通过单变量和多变量逻辑回归分析,选出了与肺转移风险相关的八个指标。这些指标被纳入六个机器学习分类器,以建立相应的预测模型。研究使用曲线下面积(AUC)、准确性、灵敏度、特异性和 F1 分数等指标对这些模型的性能进行了评估和比较。其中,14174 个病例(70%)被分配到训练集,6075 个病例(30%)构成内部测试集。原发部位、肿瘤组织学、肿瘤分级分类系统 T 分期标准 N 分期标准 脑转移 骨转移 肝转移成为食管癌肺转移的独立危险因素。在所构建的六个模型中,基于 GBM 算法的机器学习模型在内部数据集验证中表现出卓越的性能。该模型的AUC、准确性、灵敏度和特异性值分别为0.803、0.849、0.604和0.867。结论我们开发了一种基于GBM模型的在线计算器(https://lvgrkyxcgdvo7ugoyxyywe.streamlit.app/),以帮助临床决策和治疗规划。
Use machine learning to predict pulmonary metastasis of esophageal cancer: a population-based study
Background
This study aims to establish a predictive model for assessing the risk of esophageal cancer lung metastasis using machine learning techniques.
Methods
Data on esophageal cancer patients from 2010 to 2020 were extracted from the surveillance, epidemiology, and end results (SEER) database. Through univariate and multivariate logistic regression analyses, eight indicators related to the risk of lung metastasis were selected. These indicators were incorporated into six machine learning classifiers to develop corresponding predictive models. The performance of these models was evaluated and compared using metrics such as The area under curve (AUC), accuracy, sensitivity, specificity, and F1 score.
Results
A total of 20,249 confirmed cases of esophageal cancer were included in this study. Among them, 14,174 cases (70%) were assigned to the training set while 6075 cases (30%) constituted the internal test set. Primary site location, tumor histology, tumor grade classification system T staging criteria N staging criteria brain metastasis bone metastasis liver metastasis emerged as independent risk factors for esophageal cancer with lung metastasis. Amongst the six constructed models, the GBM algorithm-based machine learning model demonstrated superior performance during internal dataset validation. AUC, accuracy, sensitivity, and specificity values achieved by this model stood at respectively at 0.803, 0.849, 0.604, and 0.867.
Conclusion
We have developed an online calculator based on the GBM model (https://lvgrkyxcgdvo7ugoyxyywe.streamlit.app/)to aid clinical decision-making and treatment planning.
期刊介绍:
The "Journal of Cancer Research and Clinical Oncology" publishes significant and up-to-date articles within the fields of experimental and clinical oncology. The journal, which is chiefly devoted to Original papers, also includes Reviews as well as Editorials and Guest editorials on current, controversial topics. The section Letters to the editors provides a forum for a rapid exchange of comments and information concerning previously published papers and topics of current interest. Meeting reports provide current information on the latest results presented at important congresses.
The following fields are covered: carcinogenesis - etiology, mechanisms; molecular biology; recent developments in tumor therapy; general diagnosis; laboratory diagnosis; diagnostic and experimental pathology; oncologic surgery; and epidemiology.