{"title":"基于变压器分割和混合特征学习的超声影像甲状腺结节自动多类分类","authors":"Mingshuang Fang , Qingfeng Ma , Binxiong Xu","doi":"10.1016/j.jrras.2025.101782","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.</div></div><div><h3>Materials and methods</h3><div>In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.</div></div><div><h3>Results</h3><div>UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.</div></div><div><h3>Conclusions</h3><div>The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.</div></div>","PeriodicalId":16920,"journal":{"name":"Journal of Radiation Research and Applied Sciences","volume":"18 4","pages":"Article 101782"},"PeriodicalIF":2.5000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated multi-class classification of thyroid nodules in ultrasound imaging using transformer-based segmentation and hybrid feature learning\",\"authors\":\"Mingshuang Fang , Qingfeng Ma , Binxiong Xu\",\"doi\":\"10.1016/j.jrras.2025.101782\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.</div></div><div><h3>Materials and methods</h3><div>In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.</div></div><div><h3>Results</h3><div>UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.</div></div><div><h3>Conclusions</h3><div>The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.</div></div>\",\"PeriodicalId\":16920,\"journal\":{\"name\":\"Journal of Radiation Research and Applied Sciences\",\"volume\":\"18 4\",\"pages\":\"Article 101782\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Radiation Research and Applied Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1687850725004947\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiation Research and Applied Sciences","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687850725004947","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Automated multi-class classification of thyroid nodules in ultrasound imaging using transformer-based segmentation and hybrid feature learning
Objective
To develop and validate an end-to-end machine learning pipeline for automated multi-class classification of thyroid nodules in ultrasound imaging, using transformer-based segmentation and hybrid radiomic-deep feature integration to enhance clinical accuracy and reproducibility.
Materials and methods
In this multi-center study, 2654 ultrasound cases from five hospitals were used for model development, and 873 additional cases from an independent center were used for external validation. Thyroid nodules were segmented using four architectures: UNETR, nnU-Net, Swin-UNet, and UNet. From the segmented regions, handcrafted radiomic features and deep features extracted via Vision Transformer encoder layers were obtained. Features were filtered using ICC ≥0.75, followed by variance and correlation-based refinement. Three feature selection methods—Lasso, PCA, and Mutual Information—were evaluated. Classification was performed using XGBoost, Random Forest, and TabTransformer across six TI-RADS categories. Five-fold stratified cross-validation and external testing ensured robustness. Segmentation was assessed using Dice, Jaccard, and Hausdorff metrics; classification performance was evaluated via accuracy, AUC, and recall.
Results
UNETR achieved the highest segmentation performance and enabled the most accurate classification. The best outcome was observed with radiomic features selected via Lasso and classified with XGBoost (external accuracy: 93.0 %, AUC: 93.6 %, recall: 92.0 %). Deep features showed comparable results (accuracy: 92.8 %). Q-value analysis confirmed statistical superiority of the best-performing models. Differences across segmentation models significantly impacted classification performance, highlighting the importance of boundary quality. All models demonstrated strong generalizability and minimal overfitting.
Conclusions
The study demonstrates the feasibility and clinical value of a fully automated pipeline for TI-RADS-based thyroid nodule classification. The proposed framework is generalizable, interpretable, and suitable for integration into real-time diagnostic systems.
期刊介绍:
Journal of Radiation Research and Applied Sciences provides a high quality medium for the publication of substantial, original and scientific and technological papers on the development and applications of nuclear, radiation and isotopes in biology, medicine, drugs, biochemistry, microbiology, agriculture, entomology, food technology, chemistry, physics, solid states, engineering, environmental and applied sciences.