Multicenter investigation of preoperative distinction between primary central nervous system lymphomas and glioblastomas through interpretable artificial intelligence models.
Yun-Feng Yang, Endong Zhao, Yutong Shi, Hao Zhang, Yuan-Yuan Yang
{"title":"Multicenter investigation of preoperative distinction between primary central nervous system lymphomas and glioblastomas through interpretable artificial intelligence models.","authors":"Yun-Feng Yang, Endong Zhao, Yutong Shi, Hao Zhang, Yuan-Yuan Yang","doi":"10.1007/s00234-024-03451-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Research into the effectiveness and applicability of deep learning, radiomics, and their integrated models based on Magnetic Resonance Imaging (MRI) for preoperative differentiation between Primary Central Nervous System Lymphoma (PCNSL) and Glioblastoma (GBM), along with an exploration of the interpretability of these models.</p><p><strong>Materials and methods: </strong>A retrospective analysis was performed on MRI images and clinical data from 261 patients across two medical centers. The data were split into a training set (n = 153, medical center 1) and an external test set (n = 108, medical center 2). Radiomic features were extracted using Pyradiomics to build the Radiomics Model. Deep learning networks, including the transformer-based MobileVIT Model and Convolutional Neural Networks (CNN) based ConvNeXt Model, were trained separately. By applying the \"late fusion\" theory, the radiomics model and deep learning model were fused to produce the optimal Max-Fusion Model. Additionally, Shapley Additive exPlanations (SHAP) and Grad-CAM were employed for interpretability analysis.</p><p><strong>Results: </strong>In the external test set, the Radiomics Model achieved an Area under the receiver operating characteristic curve (AUC) of 0.86, the MobileVIT Model had an AUC of 0.91, the ConvNeXt Model demonstrated an AUC of 0.89, and the Max-Fusion Model showed an AUC of 0.92. The Delong test revealed a significant difference in AUC between the Max-Fusion Model and the Radiomics Model (P = 0.02).</p><p><strong>Conclusion: </strong>The Max-Fusion Model, combining different models, presents superior performance in distinguishing PCNSL and GBM, highlighting the effectiveness of model fusion for enhanced decision-making in medical applications.</p><p><strong>Clinical relevance statement: </strong>The preoperative non-invasive differentiation between PCNSL and GBM assists clinicians in selecting appropriate treatment regimens and clinical management strategies.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00234-024-03451-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Research into the effectiveness and applicability of deep learning, radiomics, and their integrated models based on Magnetic Resonance Imaging (MRI) for preoperative differentiation between Primary Central Nervous System Lymphoma (PCNSL) and Glioblastoma (GBM), along with an exploration of the interpretability of these models.
Materials and methods: A retrospective analysis was performed on MRI images and clinical data from 261 patients across two medical centers. The data were split into a training set (n = 153, medical center 1) and an external test set (n = 108, medical center 2). Radiomic features were extracted using Pyradiomics to build the Radiomics Model. Deep learning networks, including the transformer-based MobileVIT Model and Convolutional Neural Networks (CNN) based ConvNeXt Model, were trained separately. By applying the "late fusion" theory, the radiomics model and deep learning model were fused to produce the optimal Max-Fusion Model. Additionally, Shapley Additive exPlanations (SHAP) and Grad-CAM were employed for interpretability analysis.
Results: In the external test set, the Radiomics Model achieved an Area under the receiver operating characteristic curve (AUC) of 0.86, the MobileVIT Model had an AUC of 0.91, the ConvNeXt Model demonstrated an AUC of 0.89, and the Max-Fusion Model showed an AUC of 0.92. The Delong test revealed a significant difference in AUC between the Max-Fusion Model and the Radiomics Model (P = 0.02).
Conclusion: The Max-Fusion Model, combining different models, presents superior performance in distinguishing PCNSL and GBM, highlighting the effectiveness of model fusion for enhanced decision-making in medical applications.
Clinical relevance statement: The preoperative non-invasive differentiation between PCNSL and GBM assists clinicians in selecting appropriate treatment regimens and clinical management strategies.