Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.

IF 4.7 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

American Journal of Roentgenology Pub Date : 2024-10-16 DOI:10.2214/AJR.24.31717

Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo

{"title":"Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.","authors":"Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo","doi":"10.2214/AJR.24.31717","DOIUrl":null,"url":null,"abstract":"BACKGROUND: MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. OBJECTIVE: To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. METHODS: This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. RESULTS: In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. CONCLUSION: The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. CLINICAL IMPACT: The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.","PeriodicalId":55529,"journal":{"name":"American Journal of Roentgenology","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Roentgenology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2214/AJR.24.31717","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

BACKGROUND: MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. OBJECTIVE: To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. METHODS: This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. RESULTS: In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. CONCLUSION: The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. CLINICAL IMPACT: The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.

查看原文本刊更多论文

利用机器学习的多参数 MRI 放射组学用于区分 HER2 零、低和阳性乳腺癌：模型开发、测试和可解释性分析。

背景：磁共振成像放射组学已被探索用于乳腺癌 HER2 表达的三级分类（即 HER2-零、HER2-低或 HER2-阳性），但人们对此类模型如何得出预测结果还缺乏了解。目的：开发并测试用于区分乳腺癌患者三级 HER2 表达水平的多参数 MRI 放射组学机器学习模型，并使用 SHapley Additive exPlanation（SHAP）分析法通过局部和全局解释模型特征的贡献。方法：这项回顾性研究纳入了来自两个中心（中心1：578人；中心2：159人）的737名乳腺癌患者（平均年龄为54.1±10.6岁），这些患者接受了乳腺磁共振成像检查，并在切除活检后确定了HER2的表达。分析包括两项任务：区分HER2阴性（即HER2-0或HER2-低）和HER2阳性肿瘤（任务1），以及区分HER2-0和HER2-低肿瘤（任务2）。对于每项任务，中心1的患者按7:3的比例随机分配到训练集（任务1：n=405；任务2：n=284）或内部测试集（任务1：n=173；任务2：n=122）；中心2的患者组成外部测试集（任务1：n=159；任务2：n=105）。放射组学特征从早期动态对比增强图像（DCE）、T2 加权图像（T2WI）和 DWI 中提取。每个任务都使用支持向量机（SVM）进行特征选择；使用 SVM 相关系数的特征权重计算多参数放射组学评分（radscore）；构建传统 MRI 模型和组合模型；评估模型性能。使用 SHAP 分析为模型输出提供局部和全局解释。结果：在外部测试集中，对于任务 1，传统 MRI 模型、radscore 和组合模型的 AUC 分别为 0.624、0.757 和 0.762；对于任务 2，radscore 的 AUC 为 0.754，且无法构建传统 MRI 模型或组合模型。SHAP分析表明，早期DCE特征对这两项任务的影响最大；T2WI特征对任务2的影响也很显著。结论：研究结果表明，磁共振成像放射组学模型在表征 HER2 表达的非侵入性方面表现欠佳。临床影响：该研究提供了一个使用 SHAP 解释分析来更好地理解基于成像的机器学习模型预测的例子。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

American Journal of Roentgenology 医学-核医学

CiteScore

12.80

自引率

4.00%

发文量

920

审稿时长

3 months

期刊介绍： Founded in 1907, the monthly American Journal of Roentgenology (AJR) is the world’s longest continuously published general radiology journal. AJR is recognized as among the specialty’s leading peer-reviewed journals and has a worldwide circulation of close to 25,000. The journal publishes clinically-oriented articles across all radiology subspecialties, seeking relevance to radiologists’ daily practice. The journal publishes hundreds of articles annually with a diverse range of formats, including original research, reviews, clinical perspectives, editorials, and other short reports. The journal engages its audience through a spectrum of social media and digital communication activities.