Development and validation of machine learning models for predicting lung metastasis risk in differentiated thyroid cancer based on two databases.

IF 1.5 3区医学 Q3 SURGERY

Gland surgery Pub Date : 2024-11-30 Epub Date: 2024-11-26 DOI:10.21037/gs-24-481

Haolin Shen, Caiyun Yang, Yuegui Wang, Jianmei Liao, Xianbo Zuo, Bo Zhang, Xiao Yang

{"title":"Development and validation of machine learning models for predicting lung metastasis risk in differentiated thyroid cancer based on two databases.","authors":"Haolin Shen, Caiyun Yang, Yuegui Wang, Jianmei Liao, Xianbo Zuo, Bo Zhang, Xiao Yang","doi":"10.21037/gs-24-481","DOIUrl":null,"url":null,"abstract":"Background: Differentiated thyroid cancer (DTC) progresses slowly, but patients with lung metastasis (LM) have a poor prognosis. The aim of this study was to develop and evaluate the predictive ability of machine learning (ML) models in estimating the risk of LM in patients with DTC and to identify the independent risk factors specific to different age and gender subgroups.Methods: The demographic and clinicopathological data of patients with DTC were obtained from two databases: firstly, the National Institutes of Health Surveillance, Epidemiology, and End Results (SEER) database [2010-2015], which provides extensive epidemiological and clinical information on cancer patients; secondly, the Zhangzhou Municipal Hospital Affiliated to Fujian Medical University [2014-2017], which focuses more on patients' specific clinicopathological characteristics and treatment outcomes. Common variables from both databases were extracted. The data were then split into training, testing and validation sets. The training set was used to build and train ML models, while the testing and validation set were employed to assess the performance of these models. In terms of model development, we established five different ML models: logistic regression (LR), random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), and gradient boosting machine (GBM). For model validation, we utilized various evaluation metrics, including accuracy, precision, recall, F1 score, Brier score, area under the receiver operating characteristic (ROC) curve (AUROC), area under the precision-recall (PR) curve (PR-AUC), calibration curve, and decision curve analysis (DCA). The importance of various features was ranked and visualized for the top-performing models.Results: The analysis identified age, gender, tumor size, T stage, N stage, and histologic type as significant independent risk factors for LM. The effects of gender, T stage, and histological type on the risk of LM varied across the different age subgroups. In the female population, tumor size was an independent risk factor for LM, while it was not in the male population. GBM achieved an AUROC of 0.982, a Brier score of 0.047, an accuracy of 0.818, and an F1 score of 0.818 in the validation set, outperforming the other models.Conclusions: The GBM model emerged as an effective tool for identifying high-risk LM populations in DTC, with the potential to guide clinical practice and facilitate the development of individualized treatment plans. Further research to validate these findings across more diverse patient populations and clinical settings is recommended.","PeriodicalId":12760,"journal":{"name":"Gland surgery","volume":"13 11","pages":"2174-2188"},"PeriodicalIF":1.5000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11635582/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gland surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/gs-24-481","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/26 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Differentiated thyroid cancer (DTC) progresses slowly, but patients with lung metastasis (LM) have a poor prognosis. The aim of this study was to develop and evaluate the predictive ability of machine learning (ML) models in estimating the risk of LM in patients with DTC and to identify the independent risk factors specific to different age and gender subgroups.

Methods: The demographic and clinicopathological data of patients with DTC were obtained from two databases: firstly, the National Institutes of Health Surveillance, Epidemiology, and End Results (SEER) database [2010-2015], which provides extensive epidemiological and clinical information on cancer patients; secondly, the Zhangzhou Municipal Hospital Affiliated to Fujian Medical University [2014-2017], which focuses more on patients' specific clinicopathological characteristics and treatment outcomes. Common variables from both databases were extracted. The data were then split into training, testing and validation sets. The training set was used to build and train ML models, while the testing and validation set were employed to assess the performance of these models. In terms of model development, we established five different ML models: logistic regression (LR), random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), and gradient boosting machine (GBM). For model validation, we utilized various evaluation metrics, including accuracy, precision, recall, F1 score, Brier score, area under the receiver operating characteristic (ROC) curve (AUROC), area under the precision-recall (PR) curve (PR-AUC), calibration curve, and decision curve analysis (DCA). The importance of various features was ranked and visualized for the top-performing models.

Results: The analysis identified age, gender, tumor size, T stage, N stage, and histologic type as significant independent risk factors for LM. The effects of gender, T stage, and histological type on the risk of LM varied across the different age subgroups. In the female population, tumor size was an independent risk factor for LM, while it was not in the male population. GBM achieved an AUROC of 0.982, a Brier score of 0.047, an accuracy of 0.818, and an F1 score of 0.818 in the validation set, outperforming the other models.

Conclusions: The GBM model emerged as an effective tool for identifying high-risk LM populations in DTC, with the potential to guide clinical practice and facilitate the development of individualized treatment plans. Further research to validate these findings across more diverse patient populations and clinical settings is recommended.

查看原文本刊更多论文

基于两个数据库开发和验证用于预测分化型甲状腺癌肺转移风险的机器学习模型

背景：分化型甲状腺癌（DTC）进展缓慢，但肺转移（LM）患者预后较差。本研究的目的是开发和评估机器学习（ML）模型在估计DTC患者LM风险方面的预测能力，并确定不同年龄和性别亚组特有的独立危险因素。方法：从两个数据库获取DTC患者的人口学和临床病理数据：一是美国国立卫生监测、流行病学和最终结果（SEER）数据库[2010-2015]，该数据库提供了广泛的癌症患者流行病学和临床信息；其次是福建医科大学附属漳州市医院[2014-2017]，更注重患者的具体临床病理特征和治疗结果。从两个数据库中提取公共变量。然后将数据分成训练集、测试集和验证集。训练集用于构建和训练机器学习模型，而测试和验证集用于评估这些模型的性能。在模型开发方面，我们建立了五种不同的机器学习模型：逻辑回归（LR）、随机森林（RF）、决策树（DT）、极端梯度增强（XGBoost）和梯度增强机（GBM）。为了验证模型，我们使用了各种评价指标，包括准确度、精密度、召回率、F1评分、Brier评分、受试者工作特征（ROC）曲线下面积（AUROC）、精密度-召回率（PR）曲线下面积（PR- auc）、校准曲线和决策曲线分析（DCA）。对于表现最好的模型，各种特征的重要性进行了排名和可视化。结果：分析发现年龄、性别、肿瘤大小、T分期、N分期和组织学类型是LM的重要独立危险因素。性别、T分期和组织学类型对LM风险的影响在不同的年龄亚组中有所不同。在女性人群中，肿瘤大小是LM的独立危险因素，而在男性人群中则不是。在验证集中，GBM的AUROC为0.982，Brier评分为0.047，准确率为0.818，F1评分为0.818，优于其他模型。结论：GBM模型是识别DTC中LM高危人群的有效工具，具有指导临床实践和促进个体化治疗计划制定的潜力。建议进行进一步的研究，在更多不同的患者群体和临床环境中验证这些发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Gland surgery Medicine-Surgery

CiteScore

3.60

自引率

0.00%

发文量

113

期刊介绍： Gland Surgery (Gland Surg; GS, Print ISSN 2227-684X; Online ISSN 2227-8575) being indexed by PubMed/PubMed Central, is an open access, peer-review journal launched at May of 2012, published bio-monthly since February 2015.