基于课程方法的高效注释、基于补丁、可解释的深度学习用于乳房x光筛查中的乳腺癌检测。

IF 4.1 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Ozden Camurdan, Toygar Tanyel, Esma Aktufan Cerekci, Deniz Alis, Emine Meltem, Nurper Denizoglu, Mustafa Ege Seker, Ilkay Oksuz, Ercan Karaarslan
{"title":"基于课程方法的高效注释、基于补丁、可解释的深度学习用于乳房x光筛查中的乳腺癌检测。","authors":"Ozden Camurdan, Toygar Tanyel, Esma Aktufan Cerekci, Deniz Alis, Emine Meltem, Nurper Denizoglu, Mustafa Ege Seker, Ilkay Oksuz, Ercan Karaarslan","doi":"10.1186/s13244-025-01922-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To develop an efficient deep learning (DL) model for breast cancer detection in mammograms, utilizing both weak (image-level) and strong (bounding boxes) annotations and providing explainable artificial intelligence (XAI) with gradient-weighted class activation mapping (Grad-CAM), assessed by the ground truth overlap ratio.</p><p><strong>Methods: </strong>Three radiologists annotated a balanced dataset of 1976 mammograms (cancer-positive and -negative) from three centers. We developed a patch-based DL model using curriculum learning, progressively increasing patch sizes during training. The model was trained under varying levels of strong supervision (0%, 20%, 40%, and 100% of the dataset), resulting in baseline, curriculum 20, curriculum 40, and curriculum 100 models. Training for each model was repeated ten times, with results presented as mean ± standard deviation. Model performance was also tested on an external dataset of 4276 mammograms to assess generalizability.</p><p><strong>Results: </strong>F1 scores for the baseline, curriculum 20, curriculum 40, and curriculum 100 models were 80.55 ± 0.88, 82.41 ± 0.47, 83.03 ± 0.31, and 83.95 ± 0.55, respectively, with ground truth overlap ratios of 60.26 ± 1.91, 62.13 ± 1.2, 62.26 ± 1.52, and 64.18 ± 1.37. In the external dataset, F1 scores were 74.65 ± 1.35, 77.77 ± 0.73, 78.23 ± 1.78, and 78.73 ± 1.25, respectively, maintaining a similar performance trend.</p><p><strong>Conclusion: </strong>Training DL models with a curriculum method and a patch-based approach yields satisfactory performance and XAI, even with a limited set of densely annotated data, offering a promising avenue for deploying DL in large-scale mammography datasets.</p><p><strong>Critical relevance: </strong>This study introduces a DL model for mammography-based breast cancer detection, utilizing curriculum learning with limited, strongly labeled data. It showcases performance gains and better explainability, addressing challenges of extensive dataset needs and DL's \"black-box\" nature.</p><p><strong>Key points: </strong>Increasing numbers of mammograms for radiologists to interpret pose a logistical challenge. We trained a DL model leveraging curriculum learning with mixed annotations for mammography. The DL model outperformed the baseline model with image-level annotations using only 20% of the strong labels. The study addresses the challenge of requiring extensive datasets and strong supervision for DL efficacy. The model demonstrated improved explainability through Grad-CAM, verified by a higher ground truth overlap ratio. He proposed approach also yielded robust performance on external testing data.</p>","PeriodicalId":13639,"journal":{"name":"Insights into Imaging","volume":"16 1","pages":"60"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11923329/pdf/","citationCount":"0","resultStr":"{\"title\":\"Annotation-efficient, patch-based, explainable deep learning using curriculum method for breast cancer detection in screening mammography.\",\"authors\":\"Ozden Camurdan, Toygar Tanyel, Esma Aktufan Cerekci, Deniz Alis, Emine Meltem, Nurper Denizoglu, Mustafa Ege Seker, Ilkay Oksuz, Ercan Karaarslan\",\"doi\":\"10.1186/s13244-025-01922-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>To develop an efficient deep learning (DL) model for breast cancer detection in mammograms, utilizing both weak (image-level) and strong (bounding boxes) annotations and providing explainable artificial intelligence (XAI) with gradient-weighted class activation mapping (Grad-CAM), assessed by the ground truth overlap ratio.</p><p><strong>Methods: </strong>Three radiologists annotated a balanced dataset of 1976 mammograms (cancer-positive and -negative) from three centers. We developed a patch-based DL model using curriculum learning, progressively increasing patch sizes during training. The model was trained under varying levels of strong supervision (0%, 20%, 40%, and 100% of the dataset), resulting in baseline, curriculum 20, curriculum 40, and curriculum 100 models. Training for each model was repeated ten times, with results presented as mean ± standard deviation. Model performance was also tested on an external dataset of 4276 mammograms to assess generalizability.</p><p><strong>Results: </strong>F1 scores for the baseline, curriculum 20, curriculum 40, and curriculum 100 models were 80.55 ± 0.88, 82.41 ± 0.47, 83.03 ± 0.31, and 83.95 ± 0.55, respectively, with ground truth overlap ratios of 60.26 ± 1.91, 62.13 ± 1.2, 62.26 ± 1.52, and 64.18 ± 1.37. In the external dataset, F1 scores were 74.65 ± 1.35, 77.77 ± 0.73, 78.23 ± 1.78, and 78.73 ± 1.25, respectively, maintaining a similar performance trend.</p><p><strong>Conclusion: </strong>Training DL models with a curriculum method and a patch-based approach yields satisfactory performance and XAI, even with a limited set of densely annotated data, offering a promising avenue for deploying DL in large-scale mammography datasets.</p><p><strong>Critical relevance: </strong>This study introduces a DL model for mammography-based breast cancer detection, utilizing curriculum learning with limited, strongly labeled data. It showcases performance gains and better explainability, addressing challenges of extensive dataset needs and DL's \\\"black-box\\\" nature.</p><p><strong>Key points: </strong>Increasing numbers of mammograms for radiologists to interpret pose a logistical challenge. We trained a DL model leveraging curriculum learning with mixed annotations for mammography. The DL model outperformed the baseline model with image-level annotations using only 20% of the strong labels. The study addresses the challenge of requiring extensive datasets and strong supervision for DL efficacy. The model demonstrated improved explainability through Grad-CAM, verified by a higher ground truth overlap ratio. He proposed approach also yielded robust performance on external testing data.</p>\",\"PeriodicalId\":13639,\"journal\":{\"name\":\"Insights into Imaging\",\"volume\":\"16 1\",\"pages\":\"60\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11923329/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Insights into Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13244-025-01922-w\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insights into Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13244-025-01922-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

摘要

目的:开发一种有效的深度学习(DL)模型,用于乳房x线照片中的乳腺癌检测,利用弱(图像级)和强(边界框)注释,并提供可解释的人工智能(XAI),具有梯度加权类激活映射(gradcam),通过地面真实重叠比进行评估。方法:三名放射科医生注释了来自三个中心的1976年乳房x光片(癌症阳性和阴性)的平衡数据集。我们使用课程学习开发了一个基于补丁的深度学习模型,在训练过程中逐步增加补丁大小。该模型在不同程度的强监督下(0%、20%、40%和100%的数据集)进行训练,产生基线、课程20、课程40和课程100模型。每个模型重复训练10次,结果以均数±标准差表示。模型的性能也在4276张乳房x光片的外部数据集上进行了测试,以评估通用性。结果:基线模型、课程20模型、课程40模型、课程100模型的F1得分分别为80.55±0.88、82.41±0.47、83.03±0.31、83.95±0.55,真实度重叠比分别为60.26±1.91、62.13±1.2、62.26±1.52、64.18±1.37。在外部数据集中,F1得分分别为74.65±1.35、77.77±0.73、78.23±1.78和78.73±1.25,表现趋势相似。结论:使用课程方法和基于补丁的方法训练DL模型可以产生令人满意的性能和XAI,即使是有限的密集注释数据集,也为在大规模乳房x线摄影数据集中部署DL提供了有希望的途径。关键相关性:本研究介绍了基于乳房x线摄影的乳腺癌检测的DL模型,利用有限的、强标记数据的课程学习。它展示了性能提升和更好的可解释性,解决了广泛数据集需求和深度学习“黑箱”性质的挑战。要点:越来越多的乳房x光片给放射科医生带来了后勤方面的挑战。我们训练了一个利用课程学习和乳腺x线摄影混合注释的深度学习模型。DL模型仅使用20%的强标签,在图像级注释方面优于基线模型。该研究解决了需要广泛的数据集和对DL疗效的强有力监督的挑战。通过Grad-CAM,该模型的可解释性得到了提高,并得到了较高的地面真值重叠比的验证。他提出的方法在外部测试数据上也产生了健壮的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Annotation-efficient, patch-based, explainable deep learning using curriculum method for breast cancer detection in screening mammography.

Objectives: To develop an efficient deep learning (DL) model for breast cancer detection in mammograms, utilizing both weak (image-level) and strong (bounding boxes) annotations and providing explainable artificial intelligence (XAI) with gradient-weighted class activation mapping (Grad-CAM), assessed by the ground truth overlap ratio.

Methods: Three radiologists annotated a balanced dataset of 1976 mammograms (cancer-positive and -negative) from three centers. We developed a patch-based DL model using curriculum learning, progressively increasing patch sizes during training. The model was trained under varying levels of strong supervision (0%, 20%, 40%, and 100% of the dataset), resulting in baseline, curriculum 20, curriculum 40, and curriculum 100 models. Training for each model was repeated ten times, with results presented as mean ± standard deviation. Model performance was also tested on an external dataset of 4276 mammograms to assess generalizability.

Results: F1 scores for the baseline, curriculum 20, curriculum 40, and curriculum 100 models were 80.55 ± 0.88, 82.41 ± 0.47, 83.03 ± 0.31, and 83.95 ± 0.55, respectively, with ground truth overlap ratios of 60.26 ± 1.91, 62.13 ± 1.2, 62.26 ± 1.52, and 64.18 ± 1.37. In the external dataset, F1 scores were 74.65 ± 1.35, 77.77 ± 0.73, 78.23 ± 1.78, and 78.73 ± 1.25, respectively, maintaining a similar performance trend.

Conclusion: Training DL models with a curriculum method and a patch-based approach yields satisfactory performance and XAI, even with a limited set of densely annotated data, offering a promising avenue for deploying DL in large-scale mammography datasets.

Critical relevance: This study introduces a DL model for mammography-based breast cancer detection, utilizing curriculum learning with limited, strongly labeled data. It showcases performance gains and better explainability, addressing challenges of extensive dataset needs and DL's "black-box" nature.

Key points: Increasing numbers of mammograms for radiologists to interpret pose a logistical challenge. We trained a DL model leveraging curriculum learning with mixed annotations for mammography. The DL model outperformed the baseline model with image-level annotations using only 20% of the strong labels. The study addresses the challenge of requiring extensive datasets and strong supervision for DL efficacy. The model demonstrated improved explainability through Grad-CAM, verified by a higher ground truth overlap ratio. He proposed approach also yielded robust performance on external testing data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Insights into Imaging
Insights into Imaging Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
7.30
自引率
4.30%
发文量
182
审稿时长
13 weeks
期刊介绍: Insights into Imaging (I³) is a peer-reviewed open access journal published under the brand SpringerOpen. All content published in the journal is freely available online to anyone, anywhere! I³ continuously updates scientific knowledge and progress in best-practice standards in radiology through the publication of original articles and state-of-the-art reviews and opinions, along with recommendations and statements from the leading radiological societies in Europe. Founded by the European Society of Radiology (ESR), I³ creates a platform for educational material, guidelines and recommendations, and a forum for topics of controversy. A balanced combination of review articles, original papers, short communications from European radiological congresses and information on society matters makes I³ an indispensable source for current information in this field. I³ is owned by the ESR, however authors retain copyright to their article according to the Creative Commons Attribution License (see Copyright and License Agreement). All articles can be read, redistributed and reused for free, as long as the author of the original work is cited properly. The open access fees (article-processing charges) for this journal are kindly sponsored by ESR for all Members. The journal went open access in 2012, which means that all articles published since then are freely available online.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信