Prediction of lateral lymph node metastasis with short diameter less than 8 mm in papillary thyroid carcinoma based on radiomics.

IF 3.5 2区 医学 Q2 ONCOLOGY
Yan Wang, Shuangqingyue Zhang, Minghui Zhang, Gaosen Zhang, Zhiguang Chen, Xuemei Wang, Ziyi Yang, Zijun Yu, He Ma, Zhihong Wang, Liang Sang
{"title":"Prediction of lateral lymph node metastasis with short diameter less than 8 mm in papillary thyroid carcinoma based on radiomics.","authors":"Yan Wang, Shuangqingyue Zhang, Minghui Zhang, Gaosen Zhang, Zhiguang Chen, Xuemei Wang, Ziyi Yang, Zijun Yu, He Ma, Zhihong Wang, Liang Sang","doi":"10.1186/s40644-024-00803-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The aim of this study was to establish an ensemble learning model based on clinicopathological parameter and ultrasound radomics for assessing the risk of lateral cervical lymph node with short diameter less than 8 mm (small lymph nodes were used instead) metastasis in patients with papillary thyroid cancer (PTC), thereby guiding the selection of surgical methods.</p><p><strong>Methods: </strong>This retrospective analysis was conducted on 454 patients diagnosed with papillary thyroid carcinoma who underwent total thyroidectomy and lateral neck lymph node dissection or lymph node intraoperative frozen section biopsy at the First Hospital of China Medical University between January 2015 and April 2022. In a ratio of 8:2, 362(80%) patients were assigned to the training set and 92(20%) patients were assigned to the test set. Clinical pathological features and radomics features related to ultrasound imaging were extracted, followed by feature selection using recursive feature elimination (RFE). Based on distinct feature sets, we constructed ensemble learning models comprising random forest (RF), extreme gradient boosting (XGBoost), categorical boosting (CatBoost), gradient boosting decision tree (GBDT), and light gradient boosting machine (Lightgbm) to develop clinical models, radiomics models, and clinical-radiomic models. Through the comparison of performance metrics such as area under curve (AUC), accuracy (ACC), specificity (SPE), precision (PRE), recall rate, F1 score, mean squared error (MSE) etc., we identified the optimal model and visualized its results using shapley additive exPlanations (SHAP).</p><p><strong>Results: </strong>In this study, a total of 454 patients were included, among whom 342 PTC patients had small lymph node metastasis in the lateral neck region, while 112 did not have any metastasis. A total of 1035 features were initially considered for inclusion in this study, which were then narrowed down to 10 clinical features, 8 radiomics features, and 17 combined clinical-omics features. Based on these three feature sets, a total of fifteen ensemble learning models were established. In the test set, RF model in the clinical model is outperforms other models (AUC = 0.72, F1 = 0.75, Jaccard = 0.60 and Recall = 0.84), while CatBoost model in the radiomics model is superior to other models (AUC = 0.91, BA = 0.83 and SPE = 0.76). Among the clinical-radiomic models, Catboost exhibits optimal performance (AUC = 0.93, ACC = 0.88, BA = 0.87, F1 = 0.91, SPE = 0.83, PRE = 0.88, Jaccard = 0.83 and Recall = 0.92). Using the SHAP algorithm to visualize the operation process of the clinical-omics CatBoost model, we found that clinical omics features such as central lymph node metastasis (CLNM), Origin_Shape_Sphericity (o_shap_sphericity), LoG-sigma3_first order_ Skewness (log-3_fo_skewness), wavelet-HH_first order_Skewness (w-HH_fo_skewness) and wavelet-HH_first order_Skewness (sqr_gldm_DNUN) had the greatest impact on predicting the presence of lateral cervical small lymph node metastasis in PTC patients.</p><p><strong>Conclusions: </strong>(1) In this study, among the ensemble learning models established based on clinicopathological features and radiomics features for predicting PTC lateral small lymph node metastasis, the clinical-radiomic CatBoost model has the best performance. (2) SHAP can visualize how the clinical and radiomics features affect the results and realize the interpretation of the model. (3) The combined CatBoost model can improve the diagnostic accuracy of suspicious lymph nodes with short diameter < 8 mm that are difficult to obtain accurate puncture results. The combined application of radiomics features is more accurate and reasonable than the prediction of clinical data alone, which helps to accurately evaluate the surgical scope and provide support for individual clinical decision making.</p>","PeriodicalId":9548,"journal":{"name":"Cancer Imaging","volume":"24 1","pages":"155"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11566407/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40644-024-00803-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: The aim of this study was to establish an ensemble learning model based on clinicopathological parameter and ultrasound radomics for assessing the risk of lateral cervical lymph node with short diameter less than 8 mm (small lymph nodes were used instead) metastasis in patients with papillary thyroid cancer (PTC), thereby guiding the selection of surgical methods.

Methods: This retrospective analysis was conducted on 454 patients diagnosed with papillary thyroid carcinoma who underwent total thyroidectomy and lateral neck lymph node dissection or lymph node intraoperative frozen section biopsy at the First Hospital of China Medical University between January 2015 and April 2022. In a ratio of 8:2, 362(80%) patients were assigned to the training set and 92(20%) patients were assigned to the test set. Clinical pathological features and radomics features related to ultrasound imaging were extracted, followed by feature selection using recursive feature elimination (RFE). Based on distinct feature sets, we constructed ensemble learning models comprising random forest (RF), extreme gradient boosting (XGBoost), categorical boosting (CatBoost), gradient boosting decision tree (GBDT), and light gradient boosting machine (Lightgbm) to develop clinical models, radiomics models, and clinical-radiomic models. Through the comparison of performance metrics such as area under curve (AUC), accuracy (ACC), specificity (SPE), precision (PRE), recall rate, F1 score, mean squared error (MSE) etc., we identified the optimal model and visualized its results using shapley additive exPlanations (SHAP).

Results: In this study, a total of 454 patients were included, among whom 342 PTC patients had small lymph node metastasis in the lateral neck region, while 112 did not have any metastasis. A total of 1035 features were initially considered for inclusion in this study, which were then narrowed down to 10 clinical features, 8 radiomics features, and 17 combined clinical-omics features. Based on these three feature sets, a total of fifteen ensemble learning models were established. In the test set, RF model in the clinical model is outperforms other models (AUC = 0.72, F1 = 0.75, Jaccard = 0.60 and Recall = 0.84), while CatBoost model in the radiomics model is superior to other models (AUC = 0.91, BA = 0.83 and SPE = 0.76). Among the clinical-radiomic models, Catboost exhibits optimal performance (AUC = 0.93, ACC = 0.88, BA = 0.87, F1 = 0.91, SPE = 0.83, PRE = 0.88, Jaccard = 0.83 and Recall = 0.92). Using the SHAP algorithm to visualize the operation process of the clinical-omics CatBoost model, we found that clinical omics features such as central lymph node metastasis (CLNM), Origin_Shape_Sphericity (o_shap_sphericity), LoG-sigma3_first order_ Skewness (log-3_fo_skewness), wavelet-HH_first order_Skewness (w-HH_fo_skewness) and wavelet-HH_first order_Skewness (sqr_gldm_DNUN) had the greatest impact on predicting the presence of lateral cervical small lymph node metastasis in PTC patients.

Conclusions: (1) In this study, among the ensemble learning models established based on clinicopathological features and radiomics features for predicting PTC lateral small lymph node metastasis, the clinical-radiomic CatBoost model has the best performance. (2) SHAP can visualize how the clinical and radiomics features affect the results and realize the interpretation of the model. (3) The combined CatBoost model can improve the diagnostic accuracy of suspicious lymph nodes with short diameter < 8 mm that are difficult to obtain accurate puncture results. The combined application of radiomics features is more accurate and reasonable than the prediction of clinical data alone, which helps to accurately evaluate the surgical scope and provide support for individual clinical decision making.

基于放射组学预测甲状腺乳头状癌短径小于8毫米的侧淋巴结转移
研究目的本研究旨在建立一个基于临床病理参数和超声放射组学的集合学习模型,用于评估甲状腺乳头状癌(PTC)患者颈侧淋巴结短径小于8 mm(以小淋巴结代替)转移的风险,从而指导手术方式的选择:本研究对2015年1月至2022年4月期间在中国医科大学附属第一医院接受甲状腺全切除术和颈侧淋巴结清扫术或淋巴结术中冰冻切片活检的454例甲状腺乳头状癌患者进行了回顾性分析。按照 8:2 的比例,362 例(80%)患者被分配到训练集,92 例(20%)患者被分配到测试集。提取与超声成像相关的临床病理特征和放射组学特征,然后使用递归特征消除(RFE)进行特征选择。根据不同的特征集,我们构建了包括随机森林(RF)、极梯度提升(XGBoost)、分类提升(CatBoost)、梯度提升决策树(GBDT)和轻梯度提升机(Lightgbm)在内的集合学习模型,以开发临床模型、放射组学模型和临床-放射组学模型。通过比较曲线下面积(AUC)、准确性(ACC)、特异性(SPE)、精确性(PRE)、召回率、F1得分、均方误差(MSE)等性能指标,我们确定了最佳模型,并使用夏普利加性外计划(SHAP)将其结果可视化:本研究共纳入 454 例患者,其中 342 例 PTC 患者有颈侧小淋巴结转移,112 例无任何转移。本研究最初共考虑纳入 1035 个特征,然后将其缩小到 10 个临床特征、8 个放射组学特征和 17 个临床-组学组合特征。基于这三个特征集,共建立了 15 个集合学习模型。在测试集中,临床模型中的 RF 模型优于其他模型(AUC = 0.72、F1 = 0.75、Jaccard = 0.60 和 Recall = 0.84),而放射组学模型中的 CatBoost 模型优于其他模型(AUC = 0.91、BA = 0.83 和 SPE = 0.76)。在临床放射组学模型中,Catboost 表现出最佳性能(AUC = 0.93、ACC = 0.88、BA = 0.87、F1 = 0.91、SPE = 0.83、PRE = 0.88、Jaccard = 0.83 和 Recall = 0.92)。利用 SHAP 算法可视化临床组学 CatBoost 模型的运算过程,我们发现中心淋巴结转移(CLNM)、原点_形状_球形度(o_shap_sphericity)等临床组学特征在临床组学 CatBoost 模型的运算过程中发挥了重要作用、LoG-sigma3_first order_Skewness (log-3_fo_skewness)、wavelet-HH_first order_Skewness (w-HH_fo_skewness) 和 wavelet-HH_first order_Skewness (sqr_gldm_DNUN) 对预测 PTC 患者颈侧小淋巴结转移的影响最大。结论:(1)在本研究中,基于临床病理特征和放射组学特征建立的用于预测 PTC 侧方小淋巴结转移的集合学习模型中,临床-放射组学 CatBoost 模型的性能最佳。(2)SHAP 可以直观地显示临床特征和放射组学特征对结果的影响,实现对模型的解释。(3) 联合 CatBoost 模型可提高短直径可疑淋巴结的诊断准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Imaging
Cancer Imaging ONCOLOGY-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
CiteScore
7.00
自引率
0.00%
发文量
66
审稿时长
>12 weeks
期刊介绍: Cancer Imaging is an open access, peer-reviewed journal publishing original articles, reviews and editorials written by expert international radiologists working in oncology. The journal encompasses CT, MR, PET, ultrasound, radionuclide and multimodal imaging in all kinds of malignant tumours, plus new developments, techniques and innovations. Topics of interest include: Breast Imaging Chest Complications of treatment Ear, Nose & Throat Gastrointestinal Hepatobiliary & Pancreatic Imaging biomarkers Interventional Lymphoma Measurement of tumour response Molecular functional imaging Musculoskeletal Neuro oncology Nuclear Medicine Paediatric.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信