Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.
IF 4.7 2区 医学Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
{"title":"Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.","authors":"Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo","doi":"10.2214/AJR.24.31717","DOIUrl":null,"url":null,"abstract":"<p><p><b>BACKGROUND:</b> MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. <b>OBJECTIVE:</b> To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. <b>METHODS:</b> This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. <b>RESULTS:</b> In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. <b>CONCLUSION:</b> The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. <b>CLINICAL IMPACT:</b> The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.</p>","PeriodicalId":55529,"journal":{"name":"American Journal of Roentgenology","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Roentgenology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2214/AJR.24.31717","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
BACKGROUND: MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. OBJECTIVE: To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. METHODS: This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. RESULTS: In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. CONCLUSION: The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. CLINICAL IMPACT: The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.
期刊介绍:
Founded in 1907, the monthly American Journal of Roentgenology (AJR) is the world’s longest continuously published general radiology journal. AJR is recognized as among the specialty’s leading peer-reviewed journals and has a worldwide circulation of close to 25,000. The journal publishes clinically-oriented articles across all radiology subspecialties, seeking relevance to radiologists’ daily practice. The journal publishes hundreds of articles annually with a diverse range of formats, including original research, reviews, clinical perspectives, editorials, and other short reports. The journal engages its audience through a spectrum of social media and digital communication activities.