多源数据增强对卷积神经网络乳腺造影异常分类性能的影响。

Frontiers in radiology Pub Date : 2023-01-01 DOI:10.3389/fradi.2023.1181190

InChan Hwang, Hari Trivedi, Beatrice Brown-Mulry, Linglin Zhang, Vineela Nalla, Aimilia Gastounioti, Judy Gichoya, Laleh Seyyed-Kalantari, Imon Banerjee, MinJae Woo

{"title":"多源数据增强对卷积神经网络乳腺造影异常分类性能的影响。","authors":"InChan Hwang, Hari Trivedi, Beatrice Brown-Mulry, Linglin Zhang, Vineela Nalla, Aimilia Gastounioti, Judy Gichoya, Laleh Seyyed-Kalantari, Imon Banerjee, MinJae Woo","doi":"10.3389/fradi.2023.1181190","DOIUrl":null,"url":null,"abstract":"Introduction: To date, most mammography-related AI models have been trained using either film or digital mammogram datasets with little overlap. We investigated whether or not combining film and digital mammography during training will help or hinder modern models designed for use on digital mammograms.Methods: To this end, a total of six binary classifiers were trained for comparison. The first three classifiers were trained using images only from Emory Breast Imaging Dataset (EMBED) using ResNet50, ResNet101, and ResNet152 architectures. The next three classifiers were trained using images from EMBED, Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), and Digital Database for Screening Mammography (DDSM) datasets. All six models were tested only on digital mammograms from EMBED.Results: The results showed that performance degradation to the customized ResNet models was statistically significant overall when EMBED dataset was augmented with CBIS-DDSM/DDSM. While the performance degradation was observed in all racial subgroups, some races are subject to more severe performance drop as compared to other races.Discussion: The degradation may potentially be due to ( 1) a mismatch in features between film-based and digital mammograms ( 2) a mismatch in pathologic and radiological information. In conclusion, use of both film and digital mammography during training may hinder modern models designed for breast cancer screening. Caution is required when combining film-based and digital mammograms or when utilizing pathologic and radiological information simultaneously.","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1181190"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426498/pdf/","citationCount":"1","resultStr":"{\"title\":\"Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.\",\"authors\":\"InChan Hwang, Hari Trivedi, Beatrice Brown-Mulry, Linglin Zhang, Vineela Nalla, Aimilia Gastounioti, Judy Gichoya, Laleh Seyyed-Kalantari, Imon Banerjee, MinJae Woo\",\"doi\":\"10.3389/fradi.2023.1181190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: To date, most mammography-related AI models have been trained using either film or digital mammogram datasets with little overlap. We investigated whether or not combining film and digital mammography during training will help or hinder modern models designed for use on digital mammograms.Methods: To this end, a total of six binary classifiers were trained for comparison. The first three classifiers were trained using images only from Emory Breast Imaging Dataset (EMBED) using ResNet50, ResNet101, and ResNet152 architectures. The next three classifiers were trained using images from EMBED, Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), and Digital Database for Screening Mammography (DDSM) datasets. All six models were tested only on digital mammograms from EMBED.Results: The results showed that performance degradation to the customized ResNet models was statistically significant overall when EMBED dataset was augmented with CBIS-DDSM/DDSM. While the performance degradation was observed in all racial subgroups, some races are subject to more severe performance drop as compared to other races.Discussion: The degradation may potentially be due to ( 1) a mismatch in features between film-based and digital mammograms ( 2) a mismatch in pathologic and radiological information. In conclusion, use of both film and digital mammography during training may hinder modern models designed for breast cancer screening. Caution is required when combining film-based and digital mammograms or when utilizing pathologic and radiological information simultaneously.\",\"PeriodicalId\":73101,\"journal\":{\"name\":\"Frontiers in radiology\",\"volume\":\"3 \",\"pages\":\"1181190\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10426498/pdf/\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in radiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fradi.2023.1181190\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in radiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fradi.2023.1181190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

迄今为止，大多数与乳房x光检查相关的人工智能模型都是使用胶片或数字乳房x光检查数据集进行训练的，几乎没有重叠。我们调查了在培训期间结合胶片和数字乳房x光检查是否有助于或阻碍设计用于数字乳房x光检查的现代模型。方法:为此，共训练了6个二元分类器进行比较。前三个分类器仅使用来自Emory乳腺成像数据集(EMBED)的图像，使用ResNet50、ResNet101和ResNet152架构进行训练。接下来的三个分类器使用来自EMBED、乳腺筛查数字数据库(CBIS-DDSM)和乳腺筛查数字数据库(DDSM)数据集的图像进行训练。所有六个模型都只在EMBED的数字乳房x光片上进行了测试。结果:结果表明，当嵌入数据集与CBIS-DDSM/DDSM增强时，自定义ResNet模型的性能下降总体上有统计学意义。虽然在所有种族亚组中都观察到表现下降，但与其他种族相比，某些种族的表现下降更为严重。讨论:退化可能是由于(1)基于胶片的乳房x光片和数字乳房x光片特征不匹配(2)病理和放射信息不匹配。总之，在培训期间同时使用胶片和数字乳房x光检查可能会阻碍为乳腺癌筛查设计的现代模型。当结合胶片和数字乳房x光检查或同时使用病理和放射信息时，需要谨慎。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.

查看原文本刊更多论文

Impact of multi-source data augmentation on performance of convolutional neural networks for abnormality classification in mammography.

Introduction: To date, most mammography-related AI models have been trained using either film or digital mammogram datasets with little overlap. We investigated whether or not combining film and digital mammography during training will help or hinder modern models designed for use on digital mammograms.

Methods: To this end, a total of six binary classifiers were trained for comparison. The first three classifiers were trained using images only from Emory Breast Imaging Dataset (EMBED) using ResNet50, ResNet101, and ResNet152 architectures. The next three classifiers were trained using images from EMBED, Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM), and Digital Database for Screening Mammography (DDSM) datasets. All six models were tested only on digital mammograms from EMBED.

Results: The results showed that performance degradation to the customized ResNet models was statistically significant overall when EMBED dataset was augmented with CBIS-DDSM/DDSM. While the performance degradation was observed in all racial subgroups, some races are subject to more severe performance drop as compared to other races.

Discussion: The degradation may potentially be due to ( 1) a mismatch in features between film-based and digital mammograms ( 2) a mismatch in pathologic and radiological information. In conclusion, use of both film and digital mammography during training may hinder modern models designed for breast cancer screening. Caution is required when combining film-based and digital mammograms or when utilizing pathologic and radiological information simultaneously.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in radiology

CiteScore

1.20

自引率

0.00%

发文量