External Evaluation of a Mammography-based Deep Learning Model for Predicting Breast Cancer in an Ethnically Diverse Population.

IF 8.1 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Radiology-Artificial Intelligence Pub Date : 2023-07-26 eCollection Date: 2023-11-01 DOI:10.1148/ryai.220299
Olasubomi J Omoleye, Anna E Woodard, Frederick M Howard, Fangyuan Zhao, Toshio F Yoshimatsu, Yonglan Zheng, Alexander T Pearson, Maksim Levental, Benjamin S Aribisala, Kirti Kulkarni, Gregory S Karczmar, Olufunmilayo I Olopade, Hiroyuki Abe, Dezheng Huo
{"title":"External Evaluation of a Mammography-based Deep Learning Model for Predicting Breast Cancer in an Ethnically Diverse Population.","authors":"Olasubomi J Omoleye, Anna E Woodard, Frederick M Howard, Fangyuan Zhao, Toshio F Yoshimatsu, Yonglan Zheng, Alexander T Pearson, Maksim Levental, Benjamin S Aribisala, Kirti Kulkarni, Gregory S Karczmar, Olufunmilayo I Olopade, Hiroyuki Abe, Dezheng Huo","doi":"10.1148/ryai.220299","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To externally evaluate a mammography-based deep learning (DL) model (Mirai) in a high-risk racially diverse population and compare its performance with other mammographic measures.</p><p><strong>Materials and methods: </strong>A total of 6435 screening mammograms in 2096 female patients (median age, 56.4 years ± 11.2 [SD]) enrolled in a hospital-based case-control study from 2006 to 2020 were retrospectively evaluated. Pathologically confirmed breast cancer was the primary outcome. Mirai scores were the primary predictors. Breast density and Breast Imaging Reporting and Data System (BI-RADS) assessment categories were comparative predictors. Performance was evaluated using area under the receiver operating characteristic curve (AUC) and concordance index analyses.</p><p><strong>Results: </strong>Mirai achieved 1- and 5-year AUCs of 0.71 (95% CI: 0.68, 0.74) and 0.65 (95% CI: 0.64, 0.67), respectively. One-year AUCs for nondense versus dense breasts were 0.72 versus 0.58 (<i>P</i> = .10). There was no evidence of a difference in near-term discrimination performance between BI-RADS and Mirai (1-year AUC, 0.73 vs 0.68; <i>P</i> = .34). For longer-term prediction (2-5 years), Mirai outperformed BI-RADS assessment (5-year AUC, 0.63 vs 0.54; <i>P</i> < .001). Using only images of the unaffected breast reduced the discriminatory performance of the DL model (<i>P</i> < .001 at all time points), suggesting that its predictions are likely dependent on the detection of ipsilateral premalignant patterns.</p><p><strong>Conclusion: </strong>A mammography DL model showed good performance in a high-risk external dataset enriched for African American patients, benign breast disease, and <i>BRCA</i> mutation carriers, and study findings suggest that the model performance is likely driven by the detection of precancerous changes.<b>Keywords:</b> Breast, Cancer, Computer Applications, Convolutional Neural Network, Deep Learning Algorithms, Informatics, Epidemiology, Machine Learning, Mammography, Oncology, Radiomics <i>Supplemental material is available for this article</i>. © RSNA, 2023See also commentary by Kontos and Kalpathy-Cramer in this issue.</p>","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10698602/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology-Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1148/ryai.220299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/11/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: To externally evaluate a mammography-based deep learning (DL) model (Mirai) in a high-risk racially diverse population and compare its performance with other mammographic measures.

Materials and methods: A total of 6435 screening mammograms in 2096 female patients (median age, 56.4 years ± 11.2 [SD]) enrolled in a hospital-based case-control study from 2006 to 2020 were retrospectively evaluated. Pathologically confirmed breast cancer was the primary outcome. Mirai scores were the primary predictors. Breast density and Breast Imaging Reporting and Data System (BI-RADS) assessment categories were comparative predictors. Performance was evaluated using area under the receiver operating characteristic curve (AUC) and concordance index analyses.

Results: Mirai achieved 1- and 5-year AUCs of 0.71 (95% CI: 0.68, 0.74) and 0.65 (95% CI: 0.64, 0.67), respectively. One-year AUCs for nondense versus dense breasts were 0.72 versus 0.58 (P = .10). There was no evidence of a difference in near-term discrimination performance between BI-RADS and Mirai (1-year AUC, 0.73 vs 0.68; P = .34). For longer-term prediction (2-5 years), Mirai outperformed BI-RADS assessment (5-year AUC, 0.63 vs 0.54; P < .001). Using only images of the unaffected breast reduced the discriminatory performance of the DL model (P < .001 at all time points), suggesting that its predictions are likely dependent on the detection of ipsilateral premalignant patterns.

Conclusion: A mammography DL model showed good performance in a high-risk external dataset enriched for African American patients, benign breast disease, and BRCA mutation carriers, and study findings suggest that the model performance is likely driven by the detection of precancerous changes.Keywords: Breast, Cancer, Computer Applications, Convolutional Neural Network, Deep Learning Algorithms, Informatics, Epidemiology, Machine Learning, Mammography, Oncology, Radiomics Supplemental material is available for this article. © RSNA, 2023See also commentary by Kontos and Kalpathy-Cramer in this issue.

基于乳房x线摄影的深度学习模型在不同种族人群中预测乳腺癌的外部评估
目的:在高风险种族多样化人群中对基于乳腺X光摄影的深度学习(DL)模型(Mirai)进行外部评估,并将其性能与其他乳腺X光摄影测量方法进行比较:回顾性评估了2006年至2020年期间一项基于医院的病例对照研究中2096名女性患者(中位年龄为56.4岁±11.2 [SD])的6435张乳腺X光筛查照片。病理确诊的乳腺癌是主要结果。Mirai 评分是主要预测指标。乳腺密度和乳腺成像报告和数据系统(BI-RADS)评估类别是比较预测指标。使用接收者操作特征曲线下面积(AUC)和一致性指数分析对性能进行评估:Mirai的1年和5年AUC分别为0.71(95% CI:0.68,0.74)和0.65(95% CI:0.64,0.67)。非致密乳房与致密乳房的一年 AUC 分别为 0.72 与 0.58(P = .10)。没有证据表明 BI-RADS 和 Mirai 的近期判别性能存在差异(1 年 AUC 为 0.73 vs 0.68;P = .34)。在长期预测(2-5 年)方面,Mirai 的表现优于 BI-RADS 评估(5 年 AUC,0.63 vs 0.54;P <.001)。仅使用未受影响乳房的图像降低了 DL 模型的鉴别性能(所有时间点的 P < .001),这表明其预测可能依赖于同侧恶性前模式的检测:乳腺 X 射线 DL 模型在富含非裔美国人患者、良性乳腺疾病和 BRCA 基因突变携带者的高风险外部数据集中表现出良好的性能,研究结果表明该模型的性能可能是由癌前病变的检测驱动的:乳腺癌 计算机应用 卷积神经网络 深度学习算法 信息学 流行病学 机器学习 乳房 X 线照相术 肿瘤学 放射组学 本文有补充材料。©RSNA,2023另请参阅本期 Kontos 和 Kalpathy-Cramer 的评论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
16.20
自引率
1.00%
发文量
0
期刊介绍: Radiology: Artificial Intelligence is a bi-monthly publication that focuses on the emerging applications of machine learning and artificial intelligence in the field of imaging across various disciplines. This journal is available online and accepts multiple manuscript types, including Original Research, Technical Developments, Data Resources, Review articles, Editorials, Letters to the Editor and Replies, Special Reports, and AI in Brief.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信