Multiparametric MRI model to predict molecular subtypes of breast cancer using Shapley additive explanations interpretability analysis

IF 4.9 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Yao Huang , Xiaoxia Wang , Ying Cao , Mengfei Li , Lan Li , Huifang Chen , Sun Tang , Xiaosong Lan , Fujie Jiang , Jiuquan Zhang
{"title":"Multiparametric MRI model to predict molecular subtypes of breast cancer using Shapley additive explanations interpretability analysis","authors":"Yao Huang ,&nbsp;Xiaoxia Wang ,&nbsp;Ying Cao ,&nbsp;Mengfei Li ,&nbsp;Lan Li ,&nbsp;Huifang Chen ,&nbsp;Sun Tang ,&nbsp;Xiaosong Lan ,&nbsp;Fujie Jiang ,&nbsp;Jiuquan Zhang","doi":"10.1016/j.diii.2024.01.004","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The purpose of this study was to assess the predictive performance of multiparametric magnetic resonance imaging (MRI) for molecular subtypes and interpret features using SHapley Additive exPlanations (SHAP) analysis.</p></div><div><h3>Material and methods</h3><p>Patients with breast cancer who underwent pre-treatment MRI (including ultrafast dynamic contrast-enhanced MRI, magnetic resonance spectroscopy, diffusion kurtosis imaging and intravoxel incoherent motion) were recruited between February 2019 and January 2022. Thirteen semantic and thirteen multiparametric features were collected and the key features were selected to develop machine-learning models for predicting molecular subtypes of breast cancers (luminal A, luminal B, triple-negative and HER2-enriched) by using stepwise logistic regression. Semantic model and multiparametric model were built and compared based on five machine-learning classifiers. Model decision-making was interpreted using SHAP analysis.</p></div><div><h3>Results</h3><p>A total of 188 women (mean age, 53 ± 11 [standard deviation] years; age range: 25–75 years) were enrolled and further divided into training cohort (131 women) and validation cohort (57 women). XGBoost demonstrated good predictive performance among five machine-learning classifiers. Within the validation cohort, the areas under the receiver operating characteristic curves (AUCs) for the semantic models ranged from 0.693 (95% confidence interval [CI]: 0.478–0.839) for HER2-enriched subtype to 0.764 (95% CI: 0.681–0.908) for luminal A subtype, inferior to multiparametric models that yielded AUCs ranging from 0.771 (95% CI: 0.630–0.888) for HER2-enriched subtype to 0.857 (95% CI: 0.717–0.957) for triple-negative subtype. The AUCs between the semantic and the multiparametric models did not show significant differences (<em>P</em> range: 0.217–0.640). SHAP analysis revealed that lower iAUC, higher kurtosis, lower D*, and lower kurtosis were distinctive features for luminal A, luminal B, triple-negative breast cancer, and HER2-enriched subtypes, respectively.</p></div><div><h3>Conclusion</h3><p>Multiparametric MRI is superior to semantic models to effectively predict the molecular subtypes of breast cancer.</p></div>","PeriodicalId":48656,"journal":{"name":"Diagnostic and Interventional Imaging","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostic and Interventional Imaging","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211568424000160","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

The purpose of this study was to assess the predictive performance of multiparametric magnetic resonance imaging (MRI) for molecular subtypes and interpret features using SHapley Additive exPlanations (SHAP) analysis.

Material and methods

Patients with breast cancer who underwent pre-treatment MRI (including ultrafast dynamic contrast-enhanced MRI, magnetic resonance spectroscopy, diffusion kurtosis imaging and intravoxel incoherent motion) were recruited between February 2019 and January 2022. Thirteen semantic and thirteen multiparametric features were collected and the key features were selected to develop machine-learning models for predicting molecular subtypes of breast cancers (luminal A, luminal B, triple-negative and HER2-enriched) by using stepwise logistic regression. Semantic model and multiparametric model were built and compared based on five machine-learning classifiers. Model decision-making was interpreted using SHAP analysis.

Results

A total of 188 women (mean age, 53 ± 11 [standard deviation] years; age range: 25–75 years) were enrolled and further divided into training cohort (131 women) and validation cohort (57 women). XGBoost demonstrated good predictive performance among five machine-learning classifiers. Within the validation cohort, the areas under the receiver operating characteristic curves (AUCs) for the semantic models ranged from 0.693 (95% confidence interval [CI]: 0.478–0.839) for HER2-enriched subtype to 0.764 (95% CI: 0.681–0.908) for luminal A subtype, inferior to multiparametric models that yielded AUCs ranging from 0.771 (95% CI: 0.630–0.888) for HER2-enriched subtype to 0.857 (95% CI: 0.717–0.957) for triple-negative subtype. The AUCs between the semantic and the multiparametric models did not show significant differences (P range: 0.217–0.640). SHAP analysis revealed that lower iAUC, higher kurtosis, lower D*, and lower kurtosis were distinctive features for luminal A, luminal B, triple-negative breast cancer, and HER2-enriched subtypes, respectively.

Conclusion

Multiparametric MRI is superior to semantic models to effectively predict the molecular subtypes of breast cancer.

利用沙普利加法解释可解释性分析预测乳腺癌分子亚型的多参数磁共振成像模型
材料与方法在2019年2月至2022年1月期间招募了接受治疗前磁共振成像(包括超快动态对比增强磁共振成像、磁共振波谱、弥散峰度成像和体细胞内不连贯运动)的乳腺癌患者。收集了13个语义特征和13个多参数特征,并选择其中的关键特征,利用逐步逻辑回归法开发了预测乳腺癌分子亚型(管腔A型、管腔B型、三阴性和HER2富集)的机器学习模型。建立了语义模型和多参数模型,并基于五个机器学习分类器进行了比较。结果 共有 188 名妇女(平均年龄为 53 ± 11 [标准差]岁;年龄范围:25-75 岁)参加了研究,并进一步分为训练队列(131 名妇女)和验证队列(57 名妇女)。在五种机器学习分类器中,XGBoost 表现出良好的预测性能。在验证队列中,语义模型的接收者操作特征曲线下面积(AUC)从HER2富集亚型的0.693(95%置信区间[CI]:0.478-0.839)到HER2富集亚型的0.764(95%置信区间[CI]:0.对于管腔 A 亚型,其 AUC 为 0.771(95% CI:0.630-0.888),而对于三阴性亚型,其 AUC 为 0.857(95% CI:0.717-0.957)。语义模型和多参数模型之间的 AUC 并无显著差异(P 范围:0.217-0.640)。SHAP分析显示,较低的iAUC、较高的峰度、较低的D*和较低的峰度分别是管腔A型、管腔B型、三阴性乳腺癌和HER2富集亚型的显著特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Diagnostic and Interventional Imaging
Diagnostic and Interventional Imaging Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
8.50
自引率
29.10%
发文量
126
审稿时长
11 days
期刊介绍: Diagnostic and Interventional Imaging accepts publications originating from any part of the world based only on their scientific merit. The Journal focuses on illustrated articles with great iconographic topics and aims at aiding sharpening clinical decision-making skills as well as following high research topics. All articles are published in English. Diagnostic and Interventional Imaging publishes editorials, technical notes, letters, original and review articles on abdominal, breast, cancer, cardiac, emergency, forensic medicine, head and neck, musculoskeletal, gastrointestinal, genitourinary, interventional, obstetric, pediatric, thoracic and vascular imaging, neuroradiology, nuclear medicine, as well as contrast material, computer developments, health policies and practice, and medical physics relevant to imaging.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信