Predicting central lymph node metastasis in papillary thyroid microcarcinoma: a breakthrough with interpretable machine learning.

IF 3.9 2区 医学 Q2 ENDOCRINOLOGY & METABOLISM
Frontiers in Endocrinology Pub Date : 2025-05-12 eCollection Date: 2025-01-01 DOI:10.3389/fendo.2025.1537386
Weijun Zhou, Lijuan Li, Xiaowen Hao, Lanying Wu, Lifu Liu, Binyu Zheng, Yangzheng Xia, Yong Liu
{"title":"Predicting central lymph node metastasis in papillary thyroid microcarcinoma: a breakthrough with interpretable machine learning.","authors":"Weijun Zhou, Lijuan Li, Xiaowen Hao, Lanying Wu, Lifu Liu, Binyu Zheng, Yangzheng Xia, Yong Liu","doi":"10.3389/fendo.2025.1537386","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop and validate an interpretable machine learning (ML) model for the preoperative prediction of central lymph node metastasis (CLNM) in papillary thyroid microcarcinoma (PTMC).</p><p><strong>Methods: </strong>From December 2016 to December 2023, we retrospectively analyzed 710 PTMC patients who underwent thyroidectomies. Feature selection was conducted using the least absolute shrinkage and selection operator (LASSO) regression method, alongside the Support Vector Machine-Recursive Feature Elimination (SVM-RFE) algorithm in conjunction with multivariate logistic regression. Eight ML algorithms, namely Decision Tree, Random Forest (RF), K-nearest neighbors, Support vector machine, Extreme Gradient Boosting, Naive Bayes, Logistic regression, and Light Gradient Boosting machine, were developed for the prediction of CLNM. The performance of these models was evaluated using area under the receiver operating characteristic curve (AUC), decision curve analysis (DCA), sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), and F1 scores. Additionally, the Shapley Additive Explanation (SHAP) algorithm was utilized to clarify the results of the optimal ML model.</p><p><strong>Results: </strong>The results indicated that 32.95% of the patients (234/710) presented with CLNM. Tumor diameter, multifocality, lymph nodes identified via ultrasound (US-LN), and extrathyroidal extension (ETE) were identified as independent predictors of CLNM. The RF model achieved the highest performance in the validation set with an AUC of 0.893(95%CI: 0.846-0.940), accuracy of 0.832, sensitivity of 0.764, specificity of 0.866, PPV of 0.743, NPV of 0.879, and F1-score of 0.753. Furthermore, the DCA demonstrated that the RF model exhibited a superior clinical net benefit.</p><p><strong>Conclusion: </strong>Our model predicted the risk of CLNM in PTMC patients with high accuracy preoperatively.</p>","PeriodicalId":12447,"journal":{"name":"Frontiers in Endocrinology","volume":"16 ","pages":"1537386"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12104047/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Endocrinology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fendo.2025.1537386","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To develop and validate an interpretable machine learning (ML) model for the preoperative prediction of central lymph node metastasis (CLNM) in papillary thyroid microcarcinoma (PTMC).

Methods: From December 2016 to December 2023, we retrospectively analyzed 710 PTMC patients who underwent thyroidectomies. Feature selection was conducted using the least absolute shrinkage and selection operator (LASSO) regression method, alongside the Support Vector Machine-Recursive Feature Elimination (SVM-RFE) algorithm in conjunction with multivariate logistic regression. Eight ML algorithms, namely Decision Tree, Random Forest (RF), K-nearest neighbors, Support vector machine, Extreme Gradient Boosting, Naive Bayes, Logistic regression, and Light Gradient Boosting machine, were developed for the prediction of CLNM. The performance of these models was evaluated using area under the receiver operating characteristic curve (AUC), decision curve analysis (DCA), sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), and F1 scores. Additionally, the Shapley Additive Explanation (SHAP) algorithm was utilized to clarify the results of the optimal ML model.

Results: The results indicated that 32.95% of the patients (234/710) presented with CLNM. Tumor diameter, multifocality, lymph nodes identified via ultrasound (US-LN), and extrathyroidal extension (ETE) were identified as independent predictors of CLNM. The RF model achieved the highest performance in the validation set with an AUC of 0.893(95%CI: 0.846-0.940), accuracy of 0.832, sensitivity of 0.764, specificity of 0.866, PPV of 0.743, NPV of 0.879, and F1-score of 0.753. Furthermore, the DCA demonstrated that the RF model exhibited a superior clinical net benefit.

Conclusion: Our model predicted the risk of CLNM in PTMC patients with high accuracy preoperatively.

预测甲状腺乳头状微癌的中央淋巴结转移:可解释机器学习的突破。
目的:建立并验证可解释的机器学习(ML)模型,用于甲状腺乳头状微癌(PTMC)中央淋巴结转移(CLNM)的术前预测。方法:2016年12月至2023年12月,回顾性分析710例行甲状腺切除术的PTMC患者。特征选择采用最小绝对收缩和选择算子(LASSO)回归方法,支持向量机递归特征消除(SVM-RFE)算法结合多元逻辑回归进行。针对CLNM的预测,开发了决策树、随机森林(RF)、k近邻、支持向量机、极端梯度增强、朴素贝叶斯、逻辑回归、轻梯度增强等8种ML算法。通过受试者工作特征曲线下面积(AUC)、决策曲线分析(DCA)、敏感性、特异性、准确性、阳性预测值(PPV)、阴性预测值(NPV)和F1评分来评价这些模型的性能。此外,利用Shapley加性解释(SHAP)算法对最优ML模型的结果进行澄清。结果:32.95%(234/710)的患者表现为CLNM。肿瘤直径、多灶性、超声发现的淋巴结(US-LN)和甲状腺外延伸(ETE)被确定为CLNM的独立预测因子。该模型的AUC为0.893(95%CI: 0.846 ~ 0.940),准确度为0.832,灵敏度为0.764,特异性为0.866,PPV为0.743,NPV为0.879,f1评分为0.753,在验证集中表现最佳。此外,DCA表明RF模型表现出优越的临床净收益。结论:该模型预测PTMC患者术前发生CLNM的风险具有较高的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Endocrinology
Frontiers in Endocrinology Medicine-Endocrinology, Diabetes and Metabolism
CiteScore
5.70
自引率
9.60%
发文量
3023
审稿时长
14 weeks
期刊介绍: Frontiers in Endocrinology is a field journal of the "Frontiers in" journal series. In today’s world, endocrinology is becoming increasingly important as it underlies many of the challenges societies face - from obesity and diabetes to reproduction, population control and aging. Endocrinology covers a broad field from basic molecular and cellular communication through to clinical care and some of the most crucial public health issues. The journal, thus, welcomes outstanding contributions in any domain of endocrinology. Frontiers in Endocrinology publishes articles on the most outstanding discoveries across a wide research spectrum of Endocrinology. The mission of Frontiers in Endocrinology is to bring all relevant Endocrinology areas together on a single platform.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信