Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma
{"title":"深度学习模型在预测乳腺成像报告和数据系统3和乳房x光检查4A病变中的价值。","authors":"Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma","doi":"10.21037/qims-24-1523","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.</p><p><strong>Methods: </strong>This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.</p><p><strong>Results: </strong>The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).</p><p><strong>Conclusions: </strong>The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 5","pages":"4047-4058"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084680/pdf/","citationCount":"0","resultStr":"{\"title\":\"Value of deep learning model for predicting Breast Imaging Reporting and Data System 3 and 4A lesions on mammography.\",\"authors\":\"Xiaohui Lin, Tingting Liao, Yuting Yang, Rushan Ouyang, Yunshu Zhou, Xiaohui Lai, Jie Ma\",\"doi\":\"10.21037/qims-24-1523\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.</p><p><strong>Methods: </strong>This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.</p><p><strong>Results: </strong>The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).</p><p><strong>Conclusions: </strong>The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.</p>\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 5\",\"pages\":\"4047-4058\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084680/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-24-1523\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-24-1523","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Value of deep learning model for predicting Breast Imaging Reporting and Data System 3 and 4A lesions on mammography.
Background: The diagnostic categorization of a lesion as Breast Imaging Reporting and Data System (BI-RADS) category 3 or 4A determines whether a patient needs a biopsy; however, interobserver variability exists among radiologists in mammographic interpretation. This variability may lead to underdiagnoses of BI-RADS 3 lesions and unnecessary biopsies of benign BI-RADS 4A lesions. Therefore, we assessed the diagnostic value of a mammography-based deep learning (DL) model for differentiating BI-RADS 3 and 4A lesions and its impact on radiologists' decision-making.
Methods: This retrospective multicenter study analyzed 846 mammographically detected breast lesions (BI-RADS 3 and 4A) from 824 patients at Shenzhen People's Hospital and Shenzhen Luohu People's Hospital between January and December 2020. Six breast imaging specialists (three junior and three senior) independently reviewed all mammograms with and without DL model assistance. The follow-up or biopsy results were used as the reference standard. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC), with the DeLong test being used to compare AUCs.
Results: The DL model yielded an AUC of 0.74 for distinguishing BI-RADS 3 and 4A lesions, outperforming junior radiologists' standalone performance (AUC =0.57, AUC =0.55, and AUC =0.58) but remaining inferior to senior radiologists (AUC =0.78, AUC =0.77, and AUC =0.76). With DL model assistance, all six radiologists had higher AUCs for diagnosing BI-RADS 3 and 4A lesions as compared to their unassisted performance. Importantly, DL integration significantly increased junior radiologists' AUCs to 0.74-0.77 (P<0.001), whereas the increase in the AUCs of the senior radiologists (to 0.79, 0.78, and 0.78) was not significant (P>0.05).
Conclusions: The mammography-based DL model significantly improved the diagnostic performance of junior radiologists for BI-RADS 3 and 4A lesions, effectively reducing missed diagnoses and unnecessary biopsies.