Contrast-enhanced mammography-based interpretable machine learning model for the prediction of the molecular subtype breast cancers.

IF 3.2 3区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Mengwei Ma, Weimin Xu, Jun Yang, Bowen Zheng, Chanjuan Wen, Sina Wang, Zeyuan Xu, Genggeng Qin, Weiguo Chen
{"title":"Contrast-enhanced mammography-based interpretable machine learning model for the prediction of the molecular subtype breast cancers.","authors":"Mengwei Ma, Weimin Xu, Jun Yang, Bowen Zheng, Chanjuan Wen, Sina Wang, Zeyuan Xu, Genggeng Qin, Weiguo Chen","doi":"10.1186/s12880-025-01765-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aims to establish a machine learning prediction model to explore the correlation between contrast-enhanced mammography (CEM) imaging features and molecular subtypes of mass-type breast cancer.</p><p><strong>Materials and methods: </strong>This retrospective study included women with breast cancer who underwent CEM preoperatively between 2018 and 2021. We included 241 patients, which were randomly assigned to either a training or a test set in a 7:3 ratio. Twenty-one features were visually described, including four clinical features and seventeen radiological features, these radiological features which extracted from the CEM. Three binary classifications of subtypes were performed: Luminal vs. non-Luminal, HER2-enriched vs. non-HER2-enriched, and triple-negative (TNBC) vs. non-triple-negative. A multinomial naive Bayes (MNB) machine learning scheme was employed for the classification, and the least absolute shrink age and selection operator method were used to select the most predictive features for the classifiers. The classification performance was evaluated using the area under the receiver operating characteristic curve. We also utilized SHapley Additive exPlanation (SHAP) values to explain the prediction model.</p><p><strong>Results: </strong>The model that used a combination of low energy (LE) and dual-energy subtraction (DES) achieved the best performance compared to using either of the two images alone, yielding an area under the receiver operating characteristic curve of 0.798 for Luminal vs. non-Luminal subtypes, 0.695 for TNBC vs. non-TNBC, and 0.773 for HER2-enriched vs. non-HER2-enriched. The SHAP algorithm shows that \"LE_mass_margin_spiculated,\" \"DES_mass_enhanced_margin_spiculated,\" and \"DES_mass_internal_enhancement_homogeneous\" have the most significant impact on the model's performance in predicting Luminal and non-Luminal breast cancer. \"mass_calcification_relationship_no,\" \"calcification_ type_no,\" and \"LE_mass_margin_spiculated\" have a considerable impact on the model's performance in predicting HER2 and non-HER2 breast cancer.</p><p><strong>Conclusions: </strong>The radiological characteristics of breast tumors extracted from CEM were found to be associated with breast cancer subtypes in our study. Future research is needed to validate these findings.</p>","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":"25 1","pages":"255"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220444/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-025-01765-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aims to establish a machine learning prediction model to explore the correlation between contrast-enhanced mammography (CEM) imaging features and molecular subtypes of mass-type breast cancer.

Materials and methods: This retrospective study included women with breast cancer who underwent CEM preoperatively between 2018 and 2021. We included 241 patients, which were randomly assigned to either a training or a test set in a 7:3 ratio. Twenty-one features were visually described, including four clinical features and seventeen radiological features, these radiological features which extracted from the CEM. Three binary classifications of subtypes were performed: Luminal vs. non-Luminal, HER2-enriched vs. non-HER2-enriched, and triple-negative (TNBC) vs. non-triple-negative. A multinomial naive Bayes (MNB) machine learning scheme was employed for the classification, and the least absolute shrink age and selection operator method were used to select the most predictive features for the classifiers. The classification performance was evaluated using the area under the receiver operating characteristic curve. We also utilized SHapley Additive exPlanation (SHAP) values to explain the prediction model.

Results: The model that used a combination of low energy (LE) and dual-energy subtraction (DES) achieved the best performance compared to using either of the two images alone, yielding an area under the receiver operating characteristic curve of 0.798 for Luminal vs. non-Luminal subtypes, 0.695 for TNBC vs. non-TNBC, and 0.773 for HER2-enriched vs. non-HER2-enriched. The SHAP algorithm shows that "LE_mass_margin_spiculated," "DES_mass_enhanced_margin_spiculated," and "DES_mass_internal_enhancement_homogeneous" have the most significant impact on the model's performance in predicting Luminal and non-Luminal breast cancer. "mass_calcification_relationship_no," "calcification_ type_no," and "LE_mass_margin_spiculated" have a considerable impact on the model's performance in predicting HER2 and non-HER2 breast cancer.

Conclusions: The radiological characteristics of breast tumors extracted from CEM were found to be associated with breast cancer subtypes in our study. Future research is needed to validate these findings.

基于对比增强乳房x线照相术的可解释机器学习模型用于预测乳腺癌分子亚型。
目的:本研究旨在建立机器学习预测模型,探讨肿块型乳腺癌造影(CEM)影像特征与分子亚型的相关性。材料和方法:本回顾性研究纳入了2018年至2021年间术前接受CEM的乳腺癌患者。我们纳入了241例患者,按7:3的比例随机分配到训练组或测试组。21个特征被视觉描述,包括4个临床特征和17个放射学特征,这些放射学特征是由CEM提取的。进行了三种亚型的二元分类:Luminal与非Luminal, her2富集与非her2富集,三阴性(TNBC)与非三阴性。采用多项朴素贝叶斯(MNB)机器学习方案进行分类,并采用最小绝对收缩年龄和选择算子方法为分类器选择最具预测性的特征。用受者工作特征曲线下面积评价分类效果。我们还使用SHapley加性解释(SHAP)值来解释预测模型。结果:与单独使用两种图像中的任何一种相比,使用低能量(LE)和双能量减法(DES)组合的模型获得了最佳性能,Luminal亚型与非Luminal亚型的接收器工作特征曲线下面积为0.798,TNBC与非TNBC的面积为0.695,her2富集与非her2富集的面积为0.773。SHAP算法表明,“LE_mass_margin_spiculated”、“DES_mass_enhanced_margin_spiculated”和“DES_mass_internal_enhancement_homogeneous”对模型预测腔腔和非腔腔乳腺癌的性能影响最为显著。“mass_calcification_relationship_no”、“calcification_type_no”和“LE_mass_margin_spiculated”对模型预测HER2和非HER2乳腺癌的性能有相当大的影响。结论:我们的研究发现CEM提取的乳腺肿瘤的放射学特征与乳腺癌亚型有关。需要进一步的研究来验证这些发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Medical Imaging
BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-
CiteScore
4.60
自引率
3.70%
发文量
198
审稿时长
27 weeks
期刊介绍: BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信