{"title":"超声影像诊断滤泡样甲状腺肿瘤的无创深度学习系统:一项多中心回顾性研究。","authors":"Hui Shen,Yue Huang,Wenxiao Yan,Chifa Zhang,Ting Liang,Dan Yang,Xiaoxiao Feng,Shuyi Liu,Yu Wang,Weihan Cao,Ying Cheng,Hongyan Chen,Qiujie Ni,Fei Wang,Jingjing You,Zhe Jin,Wenle He,Jie Sun,Dexing Yang,Lijuan Liu,Boling Cao,Xiao Zhang,Yingjia Li,Shufang Pei,Shuixing Zhang,Bin Zhang","doi":"10.1097/sla.0000000000006841","DOIUrl":null,"url":null,"abstract":"OBJECTIVE\r\nTo propose a deep learning (DL) system for the preoperative diagnosis of follicular-like thyroid neoplasms (FNs) using routine ultrasound images.\r\n\r\nSUMMARY BACKGROUND DATA\r\nPreoperative diagnosis of malignancy in nodules suspicious for an FN remains challenging. Ultrasound, fine-needle aspiration cytology, and intraoperative frozen section pathology cannot unambiguously distinguish between benign and malignant FNs, leading to unnecessary biopsies and operations in benign nodules.\r\n\r\nMETHODS\r\nThis multicenter, retrospective study included 3634 patients who underwent ultrasound and received a definite diagnosis of FN from 11 centers, comprising thyroid follicular adenoma (n=1748), follicular carcinoma (n=299), and follicular variant of papillary thyroid carcinoma (n=1587). Four DL models including Inception-v3, ResNet50, Inception-ResNet-v2, and DenseNet161 were constructed on a training set (n=2587, 6178 images) and were verified on an internal validation set (n=648, 1633 images) and an external validation set (n=399, 847 images). The diagnostic efficacy of the DL models was evaluated against the ACR TI-RADS regarding the area under the curve (AUC), sensitivity, specificity, and unnecessary biopsy rate.\r\n\r\nRESULTS\r\nWhen externally validated, the four DL models yielded robust and comparable performance, with AUCs of 82.2%-85.2%, sensitivities of 69.6%-76.0%, and specificities of 84.1%-89.2%, which outperformed the ACR TI-RADS. Compared to ACR TI-RADS, the DL models showed a higher biopsy rate of malignancy (71.6% -79.9% vs 37.7%, P<0.001) and a significantly lower unnecessary FNAB rate (8.5% -12.8% vs 40.7%, P<0.001).\r\n\r\nCONCLUSION\r\nThis study provides a noninvasive DL tool for accurate preoperative diagnosis of FNs, showing better performance than ACR TI-RADS and reducing unnecessary invasive interventions.","PeriodicalId":8017,"journal":{"name":"Annals of surgery","volume":"109 1","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Noninvasive Deep Learning System for Preoperative Diagnosis of Follicular-Like Thyroid Neoplasms Using Ultrasound Images: A Multicenter, Retrospective Study.\",\"authors\":\"Hui Shen,Yue Huang,Wenxiao Yan,Chifa Zhang,Ting Liang,Dan Yang,Xiaoxiao Feng,Shuyi Liu,Yu Wang,Weihan Cao,Ying Cheng,Hongyan Chen,Qiujie Ni,Fei Wang,Jingjing You,Zhe Jin,Wenle He,Jie Sun,Dexing Yang,Lijuan Liu,Boling Cao,Xiao Zhang,Yingjia Li,Shufang Pei,Shuixing Zhang,Bin Zhang\",\"doi\":\"10.1097/sla.0000000000006841\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"OBJECTIVE\\r\\nTo propose a deep learning (DL) system for the preoperative diagnosis of follicular-like thyroid neoplasms (FNs) using routine ultrasound images.\\r\\n\\r\\nSUMMARY BACKGROUND DATA\\r\\nPreoperative diagnosis of malignancy in nodules suspicious for an FN remains challenging. Ultrasound, fine-needle aspiration cytology, and intraoperative frozen section pathology cannot unambiguously distinguish between benign and malignant FNs, leading to unnecessary biopsies and operations in benign nodules.\\r\\n\\r\\nMETHODS\\r\\nThis multicenter, retrospective study included 3634 patients who underwent ultrasound and received a definite diagnosis of FN from 11 centers, comprising thyroid follicular adenoma (n=1748), follicular carcinoma (n=299), and follicular variant of papillary thyroid carcinoma (n=1587). Four DL models including Inception-v3, ResNet50, Inception-ResNet-v2, and DenseNet161 were constructed on a training set (n=2587, 6178 images) and were verified on an internal validation set (n=648, 1633 images) and an external validation set (n=399, 847 images). The diagnostic efficacy of the DL models was evaluated against the ACR TI-RADS regarding the area under the curve (AUC), sensitivity, specificity, and unnecessary biopsy rate.\\r\\n\\r\\nRESULTS\\r\\nWhen externally validated, the four DL models yielded robust and comparable performance, with AUCs of 82.2%-85.2%, sensitivities of 69.6%-76.0%, and specificities of 84.1%-89.2%, which outperformed the ACR TI-RADS. Compared to ACR TI-RADS, the DL models showed a higher biopsy rate of malignancy (71.6% -79.9% vs 37.7%, P<0.001) and a significantly lower unnecessary FNAB rate (8.5% -12.8% vs 40.7%, P<0.001).\\r\\n\\r\\nCONCLUSION\\r\\nThis study provides a noninvasive DL tool for accurate preoperative diagnosis of FNs, showing better performance than ACR TI-RADS and reducing unnecessary invasive interventions.\",\"PeriodicalId\":8017,\"journal\":{\"name\":\"Annals of surgery\",\"volume\":\"109 1\",\"pages\":\"\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1097/sla.0000000000006841\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/sla.0000000000006841","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
摘要
目的建立一种基于常规超声图像的甲状腺滤泡样肿瘤(FNs)术前诊断深度学习系统。背景资料疑似FN结节的术前诊断仍然具有挑战性。超声、细针穿刺细胞学和术中冷冻切片病理不能明确区分良性和恶性结节,导致不必要的良性结节活检和手术。方法本研究是一项多中心回顾性研究,纳入了来自11个中心的3634例经超声确诊为FN的患者,包括甲状腺滤泡性腺瘤(n=1748)、滤泡性癌(n=299)和滤泡变异型甲状腺乳头状癌(n=1587)。在训练集(n=2587、6178张图片)上构建了Inception-v3、ResNet50、Inception-ResNet-v2和DenseNet161四个深度学习模型,并在内部验证集(n=648、1633张图片)和外部验证集(n=399、847张图片)上进行了验证。根据ACR TI-RADS对DL模型的诊断效果进行评估,包括曲线下面积(AUC)、敏感性、特异性和不必要的活检率。结果经外部验证,4种DL模型的auc值为82.2% ~ 85.2%,灵敏度为69.6% ~ 76.0%,特异性为84.1% ~ 89.2%,均优于ACR TI-RADS。与ACR TI-RADS相比,DL模型显示更高的恶性肿瘤活检率(71.6% -79.9% vs 37.7%, P<0.001)和显著降低的不必要的FNAB率(8.5% -12.8% vs 40.7%, P<0.001)。结论本研究为FNs术前准确诊断提供了一种无创DL工具,其性能优于ACR TI-RADS,减少了不必要的侵入性干预。
Noninvasive Deep Learning System for Preoperative Diagnosis of Follicular-Like Thyroid Neoplasms Using Ultrasound Images: A Multicenter, Retrospective Study.
OBJECTIVE
To propose a deep learning (DL) system for the preoperative diagnosis of follicular-like thyroid neoplasms (FNs) using routine ultrasound images.
SUMMARY BACKGROUND DATA
Preoperative diagnosis of malignancy in nodules suspicious for an FN remains challenging. Ultrasound, fine-needle aspiration cytology, and intraoperative frozen section pathology cannot unambiguously distinguish between benign and malignant FNs, leading to unnecessary biopsies and operations in benign nodules.
METHODS
This multicenter, retrospective study included 3634 patients who underwent ultrasound and received a definite diagnosis of FN from 11 centers, comprising thyroid follicular adenoma (n=1748), follicular carcinoma (n=299), and follicular variant of papillary thyroid carcinoma (n=1587). Four DL models including Inception-v3, ResNet50, Inception-ResNet-v2, and DenseNet161 were constructed on a training set (n=2587, 6178 images) and were verified on an internal validation set (n=648, 1633 images) and an external validation set (n=399, 847 images). The diagnostic efficacy of the DL models was evaluated against the ACR TI-RADS regarding the area under the curve (AUC), sensitivity, specificity, and unnecessary biopsy rate.
RESULTS
When externally validated, the four DL models yielded robust and comparable performance, with AUCs of 82.2%-85.2%, sensitivities of 69.6%-76.0%, and specificities of 84.1%-89.2%, which outperformed the ACR TI-RADS. Compared to ACR TI-RADS, the DL models showed a higher biopsy rate of malignancy (71.6% -79.9% vs 37.7%, P<0.001) and a significantly lower unnecessary FNAB rate (8.5% -12.8% vs 40.7%, P<0.001).
CONCLUSION
This study provides a noninvasive DL tool for accurate preoperative diagnosis of FNs, showing better performance than ACR TI-RADS and reducing unnecessary invasive interventions.
期刊介绍:
The Annals of Surgery is a renowned surgery journal, recognized globally for its extensive scholarly references. It serves as a valuable resource for the international medical community by disseminating knowledge regarding important developments in surgical science and practice. Surgeons regularly turn to the Annals of Surgery to stay updated on innovative practices and techniques. The journal also offers special editorial features such as "Advances in Surgical Technique," offering timely coverage of ongoing clinical issues. Additionally, the journal publishes monthly review articles that address the latest concerns in surgical practice.