Yao Huang , Xiaoxia Wang , Ying Cao , Mengfei Li , Lan Li , Huifang Chen , Sun Tang , Xiaosong Lan , Fujie Jiang , Jiuquan Zhang
{"title":"利用沙普利加法解释可解释性分析预测乳腺癌分子亚型的多参数磁共振成像模型","authors":"Yao Huang , Xiaoxia Wang , Ying Cao , Mengfei Li , Lan Li , Huifang Chen , Sun Tang , Xiaosong Lan , Fujie Jiang , Jiuquan Zhang","doi":"10.1016/j.diii.2024.01.004","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The purpose of this study was to assess the predictive performance of multiparametric magnetic resonance imaging (MRI) for molecular subtypes and interpret features using SHapley Additive exPlanations (SHAP) analysis.</p></div><div><h3>Material and methods</h3><p>Patients with breast cancer who underwent pre-treatment MRI (including ultrafast dynamic contrast-enhanced MRI, magnetic resonance spectroscopy, diffusion kurtosis imaging and intravoxel incoherent motion) were recruited between February 2019 and January 2022. Thirteen semantic and thirteen multiparametric features were collected and the key features were selected to develop machine-learning models for predicting molecular subtypes of breast cancers (luminal A, luminal B, triple-negative and HER2-enriched) by using stepwise logistic regression. Semantic model and multiparametric model were built and compared based on five machine-learning classifiers. Model decision-making was interpreted using SHAP analysis.</p></div><div><h3>Results</h3><p>A total of 188 women (mean age, 53 ± 11 [standard deviation] years; age range: 25–75 years) were enrolled and further divided into training cohort (131 women) and validation cohort (57 women). XGBoost demonstrated good predictive performance among five machine-learning classifiers. Within the validation cohort, the areas under the receiver operating characteristic curves (AUCs) for the semantic models ranged from 0.693 (95% confidence interval [CI]: 0.478–0.839) for HER2-enriched subtype to 0.764 (95% CI: 0.681–0.908) for luminal A subtype, inferior to multiparametric models that yielded AUCs ranging from 0.771 (95% CI: 0.630–0.888) for HER2-enriched subtype to 0.857 (95% CI: 0.717–0.957) for triple-negative subtype. The AUCs between the semantic and the multiparametric models did not show significant differences (<em>P</em> range: 0.217–0.640). SHAP analysis revealed that lower iAUC, higher kurtosis, lower D*, and lower kurtosis were distinctive features for luminal A, luminal B, triple-negative breast cancer, and HER2-enriched subtypes, respectively.</p></div><div><h3>Conclusion</h3><p>Multiparametric MRI is superior to semantic models to effectively predict the molecular subtypes of breast cancer.</p></div>","PeriodicalId":48656,"journal":{"name":"Diagnostic and Interventional Imaging","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiparametric MRI model to predict molecular subtypes of breast cancer using Shapley additive explanations interpretability analysis\",\"authors\":\"Yao Huang , Xiaoxia Wang , Ying Cao , Mengfei Li , Lan Li , Huifang Chen , Sun Tang , Xiaosong Lan , Fujie Jiang , Jiuquan Zhang\",\"doi\":\"10.1016/j.diii.2024.01.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><p>The purpose of this study was to assess the predictive performance of multiparametric magnetic resonance imaging (MRI) for molecular subtypes and interpret features using SHapley Additive exPlanations (SHAP) analysis.</p></div><div><h3>Material and methods</h3><p>Patients with breast cancer who underwent pre-treatment MRI (including ultrafast dynamic contrast-enhanced MRI, magnetic resonance spectroscopy, diffusion kurtosis imaging and intravoxel incoherent motion) were recruited between February 2019 and January 2022. Thirteen semantic and thirteen multiparametric features were collected and the key features were selected to develop machine-learning models for predicting molecular subtypes of breast cancers (luminal A, luminal B, triple-negative and HER2-enriched) by using stepwise logistic regression. Semantic model and multiparametric model were built and compared based on five machine-learning classifiers. Model decision-making was interpreted using SHAP analysis.</p></div><div><h3>Results</h3><p>A total of 188 women (mean age, 53 ± 11 [standard deviation] years; age range: 25–75 years) were enrolled and further divided into training cohort (131 women) and validation cohort (57 women). XGBoost demonstrated good predictive performance among five machine-learning classifiers. Within the validation cohort, the areas under the receiver operating characteristic curves (AUCs) for the semantic models ranged from 0.693 (95% confidence interval [CI]: 0.478–0.839) for HER2-enriched subtype to 0.764 (95% CI: 0.681–0.908) for luminal A subtype, inferior to multiparametric models that yielded AUCs ranging from 0.771 (95% CI: 0.630–0.888) for HER2-enriched subtype to 0.857 (95% CI: 0.717–0.957) for triple-negative subtype. The AUCs between the semantic and the multiparametric models did not show significant differences (<em>P</em> range: 0.217–0.640). SHAP analysis revealed that lower iAUC, higher kurtosis, lower D*, and lower kurtosis were distinctive features for luminal A, luminal B, triple-negative breast cancer, and HER2-enriched subtypes, respectively.</p></div><div><h3>Conclusion</h3><p>Multiparametric MRI is superior to semantic models to effectively predict the molecular subtypes of breast cancer.</p></div>\",\"PeriodicalId\":48656,\"journal\":{\"name\":\"Diagnostic and Interventional Imaging\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diagnostic and Interventional Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2211568424000160\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostic and Interventional Imaging","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211568424000160","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Multiparametric MRI model to predict molecular subtypes of breast cancer using Shapley additive explanations interpretability analysis
Purpose
The purpose of this study was to assess the predictive performance of multiparametric magnetic resonance imaging (MRI) for molecular subtypes and interpret features using SHapley Additive exPlanations (SHAP) analysis.
Material and methods
Patients with breast cancer who underwent pre-treatment MRI (including ultrafast dynamic contrast-enhanced MRI, magnetic resonance spectroscopy, diffusion kurtosis imaging and intravoxel incoherent motion) were recruited between February 2019 and January 2022. Thirteen semantic and thirteen multiparametric features were collected and the key features were selected to develop machine-learning models for predicting molecular subtypes of breast cancers (luminal A, luminal B, triple-negative and HER2-enriched) by using stepwise logistic regression. Semantic model and multiparametric model were built and compared based on five machine-learning classifiers. Model decision-making was interpreted using SHAP analysis.
Results
A total of 188 women (mean age, 53 ± 11 [standard deviation] years; age range: 25–75 years) were enrolled and further divided into training cohort (131 women) and validation cohort (57 women). XGBoost demonstrated good predictive performance among five machine-learning classifiers. Within the validation cohort, the areas under the receiver operating characteristic curves (AUCs) for the semantic models ranged from 0.693 (95% confidence interval [CI]: 0.478–0.839) for HER2-enriched subtype to 0.764 (95% CI: 0.681–0.908) for luminal A subtype, inferior to multiparametric models that yielded AUCs ranging from 0.771 (95% CI: 0.630–0.888) for HER2-enriched subtype to 0.857 (95% CI: 0.717–0.957) for triple-negative subtype. The AUCs between the semantic and the multiparametric models did not show significant differences (P range: 0.217–0.640). SHAP analysis revealed that lower iAUC, higher kurtosis, lower D*, and lower kurtosis were distinctive features for luminal A, luminal B, triple-negative breast cancer, and HER2-enriched subtypes, respectively.
Conclusion
Multiparametric MRI is superior to semantic models to effectively predict the molecular subtypes of breast cancer.
期刊介绍:
Diagnostic and Interventional Imaging accepts publications originating from any part of the world based only on their scientific merit. The Journal focuses on illustrated articles with great iconographic topics and aims at aiding sharpening clinical decision-making skills as well as following high research topics. All articles are published in English.
Diagnostic and Interventional Imaging publishes editorials, technical notes, letters, original and review articles on abdominal, breast, cancer, cardiac, emergency, forensic medicine, head and neck, musculoskeletal, gastrointestinal, genitourinary, interventional, obstetric, pediatric, thoracic and vascular imaging, neuroradiology, nuclear medicine, as well as contrast material, computer developments, health policies and practice, and medical physics relevant to imaging.