基于变压器分割和混合特征学习的超声影像甲状腺结节自动多类分类

IF 2.5 4区综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES

Journal of Radiation Research and Applied Sciences Pub Date : 2025-08-18 DOI:10.1016/j.jrras.2025.101782

Mingshuang Fang , Qingfeng Ma , Binxiong Xu

{"title":"基于变压器分割和混合特征学习的超声影像甲状腺结节自动多类分类","authors":"Mingshuang Fang , Qingfeng Ma , Binxiong Xu","doi":"10.1016/j.jrras.2025.101782","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.</div></div><div><h3>Materials and methods</h3><div>In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.</div></div><div><h3>Results</h3><div>UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.</div></div><div><h3>Conclusions</h3><div>The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.</div></div>","PeriodicalId":16920,"journal":{"name":"Journal of Radiation Research and Applied Sciences","volume":"18 4","pages":"Article 101782"},"PeriodicalIF":2.5000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated multi-class classification of thyroid nodules in ultrasound imaging using transformer-based segmentation and hybrid feature learning\",\"authors\":\"Mingshuang Fang , Qingfeng Ma , Binxiong Xu\",\"doi\":\"10.1016/j.jrras.2025.101782\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.</div></div><div><h3>Materials and methods</h3><div>In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.</div></div><div><h3>Results</h3><div>UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.</div></div><div><h3>Conclusions</h3><div>The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.</div></div>\",\"PeriodicalId\":16920,\"journal\":{\"name\":\"Journal of Radiation Research and Applied Sciences\",\"volume\":\"18 4\",\"pages\":\"Article 101782\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Radiation Research and Applied Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1687850725004947\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiation Research and Applied Sciences","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687850725004947","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

目的开发并验证一种端到端机器学习流程，用于超声成像中甲状腺结节的自动多类别分类，该流程采用基于变压器的分割和混合放射学-深度特征集成来提高临床准确性和可重复性。材料和方法在这项多中心研究中，来自5家医院的2654例超声病例用于模型开发，另外来自一个独立中心的873例用于外部验证。甲状腺结节的分割采用四种架构：UNETR、nnU-Net、swan -UNet和UNet。从分割的区域中获得手工制作的放射特征和通过Vision Transformer编码器层提取的深度特征。使用ICC≥0.75对特征进行过滤，然后进行方差和基于相关性的细化。对lasso、PCA和Mutual information三种特征选择方法进行了评价。使用XGBoost、Random Forest和TabTransformer对六个TI-RADS类别进行分类。五重分层交叉验证和外部检验确保了稳健性。使用Dice、Jaccard和Hausdorff指标评估分割；分类性能通过准确率、AUC和召回率进行评估。结果tsunetr取得了最高的分割性能和最准确的分类。通过Lasso选择放射学特征并使用XGBoost分类观察到最佳结果（外部准确度：93.0%,AUC: 93.6%，召回率：92.0%）。深层特征显示出相似的结果（准确率：92.8%）。q值分析证实了表现最好的模型的统计优越性。分割模型之间的差异显著影响了分类性能，突出了边界质量的重要性。所有模型均表现出较强的泛化性和最小的过拟合。结论基于ti - rads的全自动甲状腺结节分类管道的可行性及临床应用价值。所提出的框架具有通用性、可解释性，适合集成到实时诊断系统中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automated multi-class classification of thyroid nodules in ultrasound imaging using transformer-based segmentation and hybrid feature learning

Objective

To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.

Materials and methods

In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.

Results

UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.

Conclusions

The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Radiation Research and Applied Sciences MULTIDISCIPLINARY SCIENCES-

自引率

5.90%

发文量

130

审稿时长

16 weeks

期刊介绍： Journal of Radiation Research and Applied Sciences provides a high quality medium for the publication of substantial, original and scientific and technological papers on the development and applications of nuclear, radiation and isotopes in biology, medicine, drugs, biochemistry, microbiology, agriculture, entomology, food technology, chemistry, physics, solid states, engineering, environmental and applied sciences.