Wenwu Lu, Di Zhang, Wang Zhou, Wei Wei, Xin Wu, Wenbo Ding, Chaoxue Zhang
{"title":"基于深度学习特征的超声图像诊断甲状腺结节:在线动态图和梯度加权类激活映射。","authors":"Wenwu Lu, Di Zhang, Wang Zhou, Wei Wei, Xin Wu, Wenbo Ding, Chaoxue Zhang","doi":"10.21037/qims-2025-159","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The high incidence of thyroid nodules (TNs) necessitates accurate and effective differentiation between benign and malignant cases to avoid overtreatment, which is crucial for both patients and doctors. The aim of this study was to develop and visualize an integrated model to enhance the diagnostic capability of young radiologists in evaluating TNs.</p><p><strong>Methods: </strong>A retrospective collection of 1,501 ultrasound (US) images of TNs were randomly divided into training and validation sets. An independent test set comprised 541 patients from The First Affiliated Hospital of Anhui Medical University and the Affiliated Hospital of Integration Chinese and Western Medicine with Nanjing University of Traditional Chinese Medicine. We fine-tuned five ImageNet-pretrained deep learning (DL) models via transfer learning (TL) on US images to generate prediction scores and construct the final model. Gradient-weighted class activation mapping (Grad-CAM) was employed to highlight sensitive areas of the US images that contribute to nodule classification. Additionally, a comprehensive model was established utilizing both US image features and DL features, and an online dynamic nomogram was subsequently created for practical application. Models were compared and evaluated for discrimination, calibration, and effectiveness, to ascertain whether the DL model can improve radiologists' diagnosis of TNs.</p><p><strong>Results: </strong>The DL model demonstrated superior performance compared to the US model, with area under the receiver operating characteristic (ROC) curve (AUC) values of 0.875 and 0.787 on the test set, respectively. When combined into a comprehensive diagnostic model, a significant improvement was observed (test set AUC: 0.907). The Net Reclassification Index (NRI) results indicate that the heat maps and scores output by the DL model help to improve radiologists' classification accuracy in distinguishing between benign and malignant nodules.</p><p><strong>Conclusions: </strong>The integrated model based on US image features and DL features demonstrates good diagnostic performance for distinguishing between benign and malignant TNs.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 6","pages":"5689-5702"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209616/pdf/","citationCount":"0","resultStr":"{\"title\":\"Diagnosis of thyroid nodules using ultrasound images based on deep learning features: online dynamic nomogram and gradient-weighted class activation mapping.\",\"authors\":\"Wenwu Lu, Di Zhang, Wang Zhou, Wei Wei, Xin Wu, Wenbo Ding, Chaoxue Zhang\",\"doi\":\"10.21037/qims-2025-159\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The high incidence of thyroid nodules (TNs) necessitates accurate and effective differentiation between benign and malignant cases to avoid overtreatment, which is crucial for both patients and doctors. The aim of this study was to develop and visualize an integrated model to enhance the diagnostic capability of young radiologists in evaluating TNs.</p><p><strong>Methods: </strong>A retrospective collection of 1,501 ultrasound (US) images of TNs were randomly divided into training and validation sets. An independent test set comprised 541 patients from The First Affiliated Hospital of Anhui Medical University and the Affiliated Hospital of Integration Chinese and Western Medicine with Nanjing University of Traditional Chinese Medicine. We fine-tuned five ImageNet-pretrained deep learning (DL) models via transfer learning (TL) on US images to generate prediction scores and construct the final model. Gradient-weighted class activation mapping (Grad-CAM) was employed to highlight sensitive areas of the US images that contribute to nodule classification. Additionally, a comprehensive model was established utilizing both US image features and DL features, and an online dynamic nomogram was subsequently created for practical application. Models were compared and evaluated for discrimination, calibration, and effectiveness, to ascertain whether the DL model can improve radiologists' diagnosis of TNs.</p><p><strong>Results: </strong>The DL model demonstrated superior performance compared to the US model, with area under the receiver operating characteristic (ROC) curve (AUC) values of 0.875 and 0.787 on the test set, respectively. When combined into a comprehensive diagnostic model, a significant improvement was observed (test set AUC: 0.907). The Net Reclassification Index (NRI) results indicate that the heat maps and scores output by the DL model help to improve radiologists' classification accuracy in distinguishing between benign and malignant nodules.</p><p><strong>Conclusions: </strong>The integrated model based on US image features and DL features demonstrates good diagnostic performance for distinguishing between benign and malignant TNs.</p>\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 6\",\"pages\":\"5689-5702\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209616/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-2025-159\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-2025-159","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Diagnosis of thyroid nodules using ultrasound images based on deep learning features: online dynamic nomogram and gradient-weighted class activation mapping.
Background: The high incidence of thyroid nodules (TNs) necessitates accurate and effective differentiation between benign and malignant cases to avoid overtreatment, which is crucial for both patients and doctors. The aim of this study was to develop and visualize an integrated model to enhance the diagnostic capability of young radiologists in evaluating TNs.
Methods: A retrospective collection of 1,501 ultrasound (US) images of TNs were randomly divided into training and validation sets. An independent test set comprised 541 patients from The First Affiliated Hospital of Anhui Medical University and the Affiliated Hospital of Integration Chinese and Western Medicine with Nanjing University of Traditional Chinese Medicine. We fine-tuned five ImageNet-pretrained deep learning (DL) models via transfer learning (TL) on US images to generate prediction scores and construct the final model. Gradient-weighted class activation mapping (Grad-CAM) was employed to highlight sensitive areas of the US images that contribute to nodule classification. Additionally, a comprehensive model was established utilizing both US image features and DL features, and an online dynamic nomogram was subsequently created for practical application. Models were compared and evaluated for discrimination, calibration, and effectiveness, to ascertain whether the DL model can improve radiologists' diagnosis of TNs.
Results: The DL model demonstrated superior performance compared to the US model, with area under the receiver operating characteristic (ROC) curve (AUC) values of 0.875 and 0.787 on the test set, respectively. When combined into a comprehensive diagnostic model, a significant improvement was observed (test set AUC: 0.907). The Net Reclassification Index (NRI) results indicate that the heat maps and scores output by the DL model help to improve radiologists' classification accuracy in distinguishing between benign and malignant nodules.
Conclusions: The integrated model based on US image features and DL features demonstrates good diagnostic performance for distinguishing between benign and malignant TNs.