深度学习模型在预测乳腺成像报告和数据系统3和乳房x光检查4A病变中的价值。

IF 2.9 2区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Quantitative Imaging in Medicine and Surgery Pub Date : 2025-05-01 Epub Date: 2025-04-25 DOI:10.21037/qims-24-1523

Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma

{"title":"深度学习模型在预测乳腺成像报告和数据系统3和乳房x光检查4A病变中的价值。","authors":"Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma","doi":"10.21037/qims-24-1523","DOIUrl":null,"url":null,"abstract":"Background: The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.Methods: This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.Results: The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).Conclusions: The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 5","pages":"4047-4058"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084680/pdf/","citationCount":"0","resultStr":"{\"title\":\"Value of deep learning model for predicting Breast Imaging Reporting and Data System 3 and 4A lesions on mammography.\",\"authors\":\"Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma\",\"doi\":\"10.21037/qims-24-1523\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.Methods: This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.Results: The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).Conclusions: The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 5\",\"pages\":\"4047-4058\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084680/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-24-1523\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-24-1523","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

背景：病变的诊断分类为乳腺成像报告和数据系统（BI-RADS）分类3或4A决定患者是否需要活检；然而，在乳房x线摄影解释中，放射科医生之间存在观察者之间的差异。这种变异性可能导致BI-RADS 3病变的漏诊，以及对良性BI-RADS 4A病变进行不必要的活检。因此，我们评估了基于乳腺x线摄影的深度学习（DL）模型在区分BI-RADS 3和4A病变方面的诊断价值及其对放射科医生决策的影响。方法：本回顾性多中心研究分析了2020年1月至12月在深圳市人民医院和深圳市罗湖人民医院就诊的824例患者的846例乳房x线检查发现的乳腺病变（BI-RADS 3和4A）。六名乳腺成像专家（三名初级和三名高级）独立审查了所有有或没有DL模型辅助的乳房x光片。随访或活检结果作为参考标准。使用受试者工作特征（ROC）曲线和曲线下面积（AUC）评估诊断效果，并使用DeLong检验比较AUC。结果：DL模型区分BI-RADS 3和4A病变的AUC为0.74，优于初级放射科医师的独立表现（AUC =0.57, AUC =0.55, AUC =0.58），但仍低于高级放射科医师（AUC =0.78, AUC =0.77, AUC =0.76）。在DL模型的帮助下，所有六名放射科医生在诊断BI-RADS 3和4A病变时的auc都高于他们没有辅助的表现。重要的是，DL整合显著提高了初级放射科医师的auc，达到0.74-0.77 （P0.05）。结论：基于乳腺x线摄影的DL模型显著提高了初级放射科医师对BI-RADS 3和4A病变的诊断能力，有效减少了漏诊和不必要的活检。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Value of deep learning model for predicting Breast Imaging Reporting and Data System 3 and 4A lesions on mammography.

Background: The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.

Methods: This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.

Results: The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).

Conclusions: The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Quantitative Imaging in Medicine and Surgery Medicine-Radiology, Nuclear Medicine and Imaging

CiteScore

4.20

自引率

17.90%

发文量

252

期刊介绍： Information not localized