The impact of deep learning on diagnostic performance in the differentiation of benign and malignant thyroid nodules.

Esat Kaba, Merve Solak, Ayşenur Topçu Varlık, Yusuf Çubukçu, Lütfullah Sağır, Kubilay Muhammed Sünnetci, Ahmet Alkan, Hasan Gündoğdu, Fatma Beyazal Çeliker, Mehmet Beyazal
{"title":"The impact of deep learning on diagnostic performance in the differentiation of benign and malignant thyroid nodules.","authors":"Esat Kaba, Merve Solak, Ayşenur Topçu Varlık, Yusuf Çubukçu, Lütfullah Sağır, Kubilay Muhammed Sünnetci, Ahmet Alkan, Hasan Gündoğdu, Fatma Beyazal Çeliker, Mehmet Beyazal","doi":"10.11152/mu-4432","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>This study aims to use deep learning (DL) to classify thyroid nodules as benign and malignant with ultrasonography (US). In addition, this study investigates the impact of DL on the diagnostic success of radiologists with different experiences. Material and methods: This study included 576 US images of thyroid nodules. The dataset was divided into 80% training and 20% test sets. Four radiologists with different levels of experience classified the images in the test set as benign-malignant. A DL model was then trained with the train set and predicted benign-malignant for the test set. Then, the output of the DL model for each nodule in the test set was presented to 4 radiologists, who were asked to make a benign-malignant classification again considering these DL results.</p><p><strong>Results: </strong>The accuracy of the DL model was 0.9391. The accuracy for junior resident (JR) 1, JR 2, senior resident (SR), and senior radiologist (Srad) before DL-assisting were 0.7043, 0.7826, 0.8435, and 0.8522 respectively. The accuracy in DL-assisted classifications was 0.9130, 0.8696, 0.9304, and 0.9043 for JR 1, JR2, SR, and Srad, respectively. DL assistance changed the decisions of less experienced radiologists more than more experienced radiologists. Conclusion: The DL model has superior accuracy in classifying thyroid nodules as benign-malignant with US images than radiologists with different levels of experience. Additionally, all radiologists, and most notably less experienced radiology residents, increased their accuracy in DL-assisted predictions.</p>","PeriodicalId":94138,"journal":{"name":"Medical ultrasonography","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical ultrasonography","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11152/mu-4432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: This study aims to use deep learning (DL) to classify thyroid nodules as benign and malignant with ultrasonography (US). In addition, this study investigates the impact of DL on the diagnostic success of radiologists with different experiences. Material and methods: This study included 576 US images of thyroid nodules. The dataset was divided into 80% training and 20% test sets. Four radiologists with different levels of experience classified the images in the test set as benign-malignant. A DL model was then trained with the train set and predicted benign-malignant for the test set. Then, the output of the DL model for each nodule in the test set was presented to 4 radiologists, who were asked to make a benign-malignant classification again considering these DL results.

Results: The accuracy of the DL model was 0.9391. The accuracy for junior resident (JR) 1, JR 2, senior resident (SR), and senior radiologist (Srad) before DL-assisting were 0.7043, 0.7826, 0.8435, and 0.8522 respectively. The accuracy in DL-assisted classifications was 0.9130, 0.8696, 0.9304, and 0.9043 for JR 1, JR2, SR, and Srad, respectively. DL assistance changed the decisions of less experienced radiologists more than more experienced radiologists. Conclusion: The DL model has superior accuracy in classifying thyroid nodules as benign-malignant with US images than radiologists with different levels of experience. Additionally, all radiologists, and most notably less experienced radiology residents, increased their accuracy in DL-assisted predictions.

深度学习对甲状腺结节良恶性鉴别诊断性能的影响。
目的:本研究旨在利用深度学习(DL)对甲状腺结节进行良性和恶性超声波成像(US)分类。此外,本研究还探讨了深度学习对具有不同经验的放射科医生诊断成功率的影响。材料和方法:本研究包括 576 张甲状腺结节的 US 图像。数据集分为 80% 的训练集和 20% 的测试集。四名具有不同经验水平的放射科医生对测试集中的图像进行良恶性分类。然后用训练集训练 DL 模型,并预测测试集的良恶性。然后,将测试集中每个结节的 DL 模型输出结果提交给 4 位放射科医生,要求他们根据这些 DL 结果再次进行良恶性分类:结果:DL 模型的准确率为 0.9391。在 DL 辅助之前,初级住院医师 (JR) 1、JR 2、高级住院医师 (SR) 和高级放射科医师 (Srad) 的准确率分别为 0.7043、0.7826、0.8435 和 0.8522。在 DL 辅助下,JR 1、JR 2、SR 和 Srad 的分类准确率分别为 0.9130、0.8696、0.9304 和 0.9043。与经验丰富的放射科医生相比,经验不足的放射科医生在 DL 辅助下做出的决定变化更大。结论DL 模型在利用 US 图像将甲状腺结节分为良性-恶性方面的准确性优于不同经验水平的放射科医生。此外,所有放射科医生,尤其是经验较少的放射科住院医生,在 DL 辅助预测中的准确率都有所提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信