一个可解释的机器学习模型的开发和验证，以预测乳头状甲状腺癌的德尔菲淋巴结转移：一项大型队列研究。

IF 3.3 3区医学 Q2 ONCOLOGY

Journal of Cancer Pub Date : 2025-03-03 eCollection Date: 2025-01-01 DOI:10.7150/jca.110141

Jie Cui, Genglong Liu, Kai Yue, Yansheng Wu, Yuansheng Duan, Minghui Wei, Xudong Wang

{"title":"一个可解释的机器学习模型的开发和验证，以预测乳头状甲状腺癌的德尔菲淋巴结转移：一项大型队列研究。","authors":"Jie Cui, Genglong Liu, Kai Yue, Yansheng Wu, Yuansheng Duan, Minghui Wei, Xudong Wang","doi":"10.7150/jca.110141","DOIUrl":null,"url":null,"abstract":"Background: The occurrence of papillary thyroid cancer (PTC) has risen substantially and tends to exhibit early-stage lymph node metastasis (LNM), increasing the risk of postoperative recurrence and decreasing survival. There is a lack of a machine learning (ML) model to predict delphian LNM (DLNM) in PTC. This investigation seeks to comprehensively assess the significance of standard clinical indicators for DLNM prediction, while constructing a dependable and widely applicable ensemble ML framework to support surgical planning and therapeutic decision-making. Methods: This investigation incorporated 1993 sequential PTC patients who underwent curative surgical procedures from 2020 to 2023. Based on the time to surgery, we divided the cohort into the training cohort (n=1395) and the validation cohort (n=598). The Boruta algorithm was applied to select feature variables, succeeded by the development of an innovative ML structure combining 12 ML techniques across 113 permutations to create a unified prediction model (DLNM index). ROC analysis, calibration curve, Bootstrapping, 10-fold cross validation, restricted cubic spline (RCS) regression, multivariable logistic regression, and subgroup analysis were utilised to evaluate the predictive accuracy and discriminative ability of the DLNM index. Model interpretation and feature impact visualisation were accomplished through the Shapley Additive Explanations (SHAP) methodology. Results: Based on 14 features via the Boruta algorithm selection, we integrated them into 12 ML approaches, yielding 113 permutations, from which we identified the superior algorithm to establish a consensus ML-derived diagnostic model (DLNM index). The DLNM index exhibited excellent diagnostic values with a mean AUC of 0.763 in two cohorts and discriminative ability, serving as an independent risk factor (P < 0.001). It performed better in predicting performance and yielded a larger net benefit than the published model (P < 0.05). Bootstrapping and 10-fold cross validation, and subgroup analysis showed that the DLNM index was generally robust and generalisable. SHAP explains the importance of ranking features (tumour size, right 4 region LN, FT4, TG, and T3) and visualises global and individual risk prediction. RCS regression suggested a nonlinear link between the DLNM index, TG, tumour size, FT3, and DLNM risk. Conclusion: An optimised explainable model (DLNM index) comprising 12 clinical features based on multiple ML algorithms was constructed and validated to provide an economical, readily available, and precise diagnostic instrument for DLNM in PTC, which has potential implications for clinical practice. The SHAP explanation and RCS regression quantify and visualise tumour size and FT4 as the most important variables that increase DLNM risk.","PeriodicalId":15183,"journal":{"name":"Journal of Cancer","volume":"16 6","pages":"2041-2061"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905415/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of an explainable machine learning model to predict Delphian lymph node metastasis in papillary thyroid cancer: a large cohort study.\",\"authors\":\"Jie Cui, Genglong Liu, Kai Yue, Yansheng Wu, Yuansheng Duan, Minghui Wei, Xudong Wang\",\"doi\":\"10.7150/jca.110141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The occurrence of papillary thyroid cancer (PTC) has risen substantially and tends to exhibit early-stage lymph node metastasis (LNM), increasing the risk of postoperative recurrence and decreasing survival. There is a lack of a machine learning (ML) model to predict delphian LNM (DLNM) in PTC. This investigation seeks to comprehensively assess the significance of standard clinical indicators for DLNM prediction, while constructing a dependable and widely applicable ensemble ML framework to support surgical planning and therapeutic decision-making. Methods: This investigation incorporated 1993 sequential PTC patients who underwent curative surgical procedures from 2020 to 2023. Based on the time to surgery, we divided the cohort into the training cohort (n=1395) and the validation cohort (n=598). The Boruta algorithm was applied to select feature variables, succeeded by the development of an innovative ML structure combining 12 ML techniques across 113 permutations to create a unified prediction model (DLNM index). ROC analysis, calibration curve, Bootstrapping, 10-fold cross validation, restricted cubic spline (RCS) regression, multivariable logistic regression, and subgroup analysis were utilised to evaluate the predictive accuracy and discriminative ability of the DLNM index. Model interpretation and feature impact visualisation were accomplished through the Shapley Additive Explanations (SHAP) methodology. Results: Based on 14 features via the Boruta algorithm selection, we integrated them into 12 ML approaches, yielding 113 permutations, from which we identified the superior algorithm to establish a consensus ML-derived diagnostic model (DLNM index). The DLNM index exhibited excellent diagnostic values with a mean AUC of 0.763 in two cohorts and discriminative ability, serving as an independent risk factor (P < 0.001). It performed better in predicting performance and yielded a larger net benefit than the published model (P < 0.05). Bootstrapping and 10-fold cross validation, and subgroup analysis showed that the DLNM index was generally robust and generalisable. SHAP explains the importance of ranking features (tumour size, right 4 region LN, FT4, TG, and T3) and visualises global and individual risk prediction. RCS regression suggested a nonlinear link between the DLNM index, TG, tumour size, FT3, and DLNM risk. Conclusion: An optimised explainable model (DLNM index) comprising 12 clinical features based on multiple ML algorithms was constructed and validated to provide an economical, readily available, and precise diagnostic instrument for DLNM in PTC, which has potential implications for clinical practice. The SHAP explanation and RCS regression quantify and visualise tumour size and FT4 as the most important variables that increase DLNM risk.\",\"PeriodicalId\":15183,\"journal\":{\"name\":\"Journal of Cancer\",\"volume\":\"16 6\",\"pages\":\"2041-2061\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905415/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.7150/jca.110141\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7150/jca.110141","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景：甲状腺乳头状癌（PTC）的发病率大幅上升，并倾向于出现早期淋巴结转移（LNM），增加了术后复发的风险，降低了生存率。目前还缺乏预测PTC中delphian LNM （DLNM）的机器学习（ML）模型。本研究旨在全面评估标准临床指标对DLNM预测的意义，同时构建一个可靠且广泛适用的集成ML框架，以支持手术计划和治疗决策。方法：本研究纳入了1993例序贯PTC患者，这些患者于2020年至2023年接受了根治性手术。根据手术时间，我们将队列分为训练队列（n=1395）和验证队列（n=598）。Boruta算法用于选择特征变量，随后开发了一种创新的ML结构，结合113种排列的12种ML技术来创建统一的预测模型（DLNM指数）。采用ROC分析、校正曲线、Bootstrapping、10重交叉验证、限制性三次样条（RCS）回归、多变量logistic回归和亚组分析等方法评价DLNM指数的预测准确度和判别能力。模型解释和特征影响可视化是通过Shapley加性解释（SHAP）方法完成的。结果：基于Boruta算法选择的14个特征，我们将它们整合到12种ML方法中，产生113种排列，从中我们确定了更优的算法来建立共识ML衍生诊断模型（DLNM指数）。DLNM指数具有较好的诊断价值，两组的平均AUC为0.763，判别能力强，是独立的危险因素（P < 0.001）。与已发表的模型相比，该模型具有更好的预测性能和更大的净效益（P < 0.05）。Bootstrapping和10倍交叉验证以及亚组分析表明，DLNM指数总体上具有鲁棒性和通用性。SHAP解释了排序特征（肿瘤大小、右4区LN、FT4、TG和T3）的重要性，并将全球和个体风险预测可视化。RCS回归显示DLNM指数、TG、肿瘤大小、FT3和DLNM风险之间存在非线性联系。结论：基于多种ML算法构建了包含12个临床特征的优化可解释模型（DLNM指数），并对其进行了验证，为PTC DLNM提供了一种经济、方便、精确的诊断工具，具有潜在的临床应用价值。SHAP解释和RCS回归量化和可视化肿瘤大小和FT4作为增加DLNM风险的最重要变量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development and validation of an explainable machine learning model to predict Delphian lymph node metastasis in papillary thyroid cancer: a large cohort study.

Background: The occurrence of papillary thyroid cancer (PTC) has risen substantially and tends to exhibit early-stage lymph node metastasis (LNM), increasing the risk of postoperative recurrence and decreasing survival. There is a lack of a machine learning (ML) model to predict delphian LNM (DLNM) in PTC. This investigation seeks to comprehensively assess the significance of standard clinical indicators for DLNM prediction, while constructing a dependable and widely applicable ensemble ML framework to support surgical planning and therapeutic decision-making. Methods: This investigation incorporated 1993 sequential PTC patients who underwent curative surgical procedures from 2020 to 2023. Based on the time to surgery, we divided the cohort into the training cohort (n=1395) and the validation cohort (n=598). The Boruta algorithm was applied to select feature variables, succeeded by the development of an innovative ML structure combining 12 ML techniques across 113 permutations to create a unified prediction model (DLNM index). ROC analysis, calibration curve, Bootstrapping, 10-fold cross validation, restricted cubic spline (RCS) regression, multivariable logistic regression, and subgroup analysis were utilised to evaluate the predictive accuracy and discriminative ability of the DLNM index. Model interpretation and feature impact visualisation were accomplished through the Shapley Additive Explanations (SHAP) methodology. Results: Based on 14 features via the Boruta algorithm selection, we integrated them into 12 ML approaches, yielding 113 permutations, from which we identified the superior algorithm to establish a consensus ML-derived diagnostic model (DLNM index). The DLNM index exhibited excellent diagnostic values with a mean AUC of 0.763 in two cohorts and discriminative ability, serving as an independent risk factor (P < 0.001). It performed better in predicting performance and yielded a larger net benefit than the published model (P < 0.05). Bootstrapping and 10-fold cross validation, and subgroup analysis showed that the DLNM index was generally robust and generalisable. SHAP explains the importance of ranking features (tumour size, right 4 region LN, FT4, TG, and T3) and visualises global and individual risk prediction. RCS regression suggested a nonlinear link between the DLNM index, TG, tumour size, FT3, and DLNM risk. Conclusion: An optimised explainable model (DLNM index) comprising 12 clinical features based on multiple ML algorithms was constructed and validated to provide an economical, readily available, and precise diagnostic instrument for DLNM in PTC, which has potential implications for clinical practice. The SHAP explanation and RCS regression quantify and visualise tumour size and FT4 as the most important variables that increase DLNM risk.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Cancer ONCOLOGY-

CiteScore

8.10

自引率

2.60%

发文量

333

审稿时长

12 weeks

期刊介绍： Journal of Cancer is an open access, peer-reviewed journal with broad scope covering all areas of cancer research, especially novel concepts, new methods, new regimens, new therapeutic agents, and alternative approaches for early detection and intervention of cancer. The Journal is supported by an international editorial board consisting of a distinguished team of cancer researchers. Journal of Cancer aims at rapid publication of high quality results in cancer research while maintaining rigorous peer-review process.