可解释的脑年龄预测：形态计量学和深度学习管道的比较评估。

IF 4.5 Q1 Computer Science

Brain Informatics Pub Date : 2024-12-18 DOI:10.1186/s40708-024-00244-9

Maria Luigia Natalia De Bonis, Giuseppe Fasano, Angela Lombardi, Carmelo Ardito, Antonio Ferrara, Eugenio Di Sciascio, Tommaso Di Noia

{"title":"可解释的脑年龄预测：形态计量学和深度学习管道的比较评估。","authors":"Maria Luigia Natalia De Bonis, Giuseppe Fasano, Angela Lombardi, Carmelo Ardito, Antonio Ferrara, Eugenio Di Sciascio, Tommaso Di Noia","doi":"10.1186/s40708-024-00244-9","DOIUrl":null,"url":null,"abstract":"Brain age, a biomarker reflecting brain health relative to chronological age, is increasingly used in neuroimaging to detect early signs of neurodegenerative diseases and support personalized treatment plans. Two primary approaches for brain age prediction have emerged: morphometric feature extraction from MRI scans and deep learning (DL) applied to raw MRI data. However, a systematic comparison of these methods regarding performance, interpretability, and clinical utility has been limited. In this study, we present a comparative evaluation of two pipelines: one using morphometric features from FreeSurfer and the other employing 3D convolutional neural networks (CNNs). Using a multisite neuroimaging dataset, we assessed both model performance and the interpretability of predictions through eXplainable Artificial Intelligence (XAI) methods, applying SHAP to the feature-based pipeline and Grad-CAM and DeepSHAP to the CNN-based pipeline. Our results show comparable performance between the two pipelines in Leave-One-Site-Out (LOSO) validation, achieving state-of-the-art performance on the independent test set ( <math><mrow><mi>M</mi> <mi>A</mi> <mi>E</mi> <mo>=</mo> <mn>3.21</mn></mrow> </math> with DNN and morphometric features and <math><mrow><mi>M</mi> <mi>A</mi> <mi>E</mi> <mo>=</mo> <mn>3.08</mn></mrow> </math> with a DenseNet-121 architecture). SHAP provided the most consistent and interpretable results, while DeepSHAP exhibited greater variability. Further work is needed to assess the clinical utility of Grad-CAM. This study addresses a critical gap by systematically comparing the interpretability of multiple XAI methods across distinct brain age prediction pipelines. Our findings underscore the importance of integrating XAI into clinical practice, offering insights into how XAI outputs vary and their potential utility for clinicians.","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"11 1","pages":"33"},"PeriodicalIF":4.5000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11655902/pdf/","citationCount":"0","resultStr":"{\"title\":\"Explainable brain age prediction: a comparative evaluation of morphometric and deep learning pipelines.\",\"authors\":\"Maria Luigia Natalia De Bonis, Giuseppe Fasano, Angela Lombardi, Carmelo Ardito, Antonio Ferrara, Eugenio Di Sciascio, Tommaso Di Noia\",\"doi\":\"10.1186/s40708-024-00244-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Brain age, a biomarker reflecting brain health relative to chronological age, is increasingly used in neuroimaging to detect early signs of neurodegenerative diseases and support personalized treatment plans. Two primary approaches for brain age prediction have emerged: morphometric feature extraction from MRI scans and deep learning (DL) applied to raw MRI data. However, a systematic comparison of these methods regarding performance, interpretability, and clinical utility has been limited. In this study, we present a comparative evaluation of two pipelines: one using morphometric features from FreeSurfer and the other employing 3D convolutional neural networks (CNNs). Using a multisite neuroimaging dataset, we assessed both model performance and the interpretability of predictions through eXplainable Artificial Intelligence (XAI) methods, applying SHAP to the feature-based pipeline and Grad-CAM and DeepSHAP to the CNN-based pipeline. Our results show comparable performance between the two pipelines in Leave-One-Site-Out (LOSO) validation, achieving state-of-the-art performance on the independent test set ( <math><mrow><mi>M</mi> <mi>A</mi> <mi>E</mi> <mo>=</mo> <mn>3.21</mn></mrow> </math> with DNN and morphometric features and <math><mrow><mi>M</mi> <mi>A</mi> <mi>E</mi> <mo>=</mo> <mn>3.08</mn></mrow> </math> with a DenseNet-121 architecture). SHAP provided the most consistent and interpretable results, while DeepSHAP exhibited greater variability. Further work is needed to assess the clinical utility of Grad-CAM. This study addresses a critical gap by systematically comparing the interpretability of multiple XAI methods across distinct brain age prediction pipelines. Our findings underscore the importance of integrating XAI into clinical practice, offering insights into how XAI outputs vary and their potential utility for clinicians.\",\"PeriodicalId\":37465,\"journal\":{\"name\":\"Brain Informatics\",\"volume\":\"11 1\",\"pages\":\"33\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11655902/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Brain Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40708-024-00244-9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40708-024-00244-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

脑年龄是一种反映相对于实足年龄的大脑健康的生物标志物，越来越多地用于神经影像学，以检测神经退行性疾病的早期迹象，并支持个性化的治疗计划。脑年龄预测的两种主要方法已经出现：从MRI扫描中提取形态特征和应用于原始MRI数据的深度学习（DL）。然而，关于这些方法的性能、可解释性和临床应用的系统比较是有限的。在这项研究中，我们对两个管道进行了比较评估：一个使用FreeSurfer的形态特征，另一个使用3D卷积神经网络（cnn）。使用多站点神经成像数据集，我们通过可解释人工智能（eXplainable Artificial Intelligence， XAI）方法评估了模型性能和预测的可解释性，将SHAP应用于基于特征的管道，将Grad-CAM和DeepSHAP应用于基于cnn的管道。我们的研究结果显示，在LOSO验证中，两个管道之间的性能相当，在独立测试集上实现了最先进的性能（DNN和形态特征的平均分= 3.21，DenseNet-121架构的平均分= 3.08）。SHAP提供了最一致和可解释的结果，而DeepSHAP表现出更大的变异性。需要进一步的工作来评估Grad-CAM的临床应用。本研究通过系统地比较不同脑年龄预测管道中多种XAI方法的可解释性，解决了一个关键的差距。我们的研究结果强调了将XAI整合到临床实践中的重要性，提供了关于XAI输出如何变化及其对临床医生的潜在效用的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Explainable brain age prediction: a comparative evaluation of morphometric and deep learning pipelines.

Brain age, a biomarker reflecting brain health relative to chronological age, is increasingly used in neuroimaging to detect early signs of neurodegenerative diseases and support personalized treatment plans. Two primary approaches for brain age prediction have emerged: morphometric feature extraction from MRI scans and deep learning (DL) applied to raw MRI data. However, a systematic comparison of these methods regarding performance, interpretability, and clinical utility has been limited. In this study, we present a comparative evaluation of two pipelines: one using morphometric features from FreeSurfer and the other employing 3D convolutional neural networks (CNNs). Using a multisite neuroimaging dataset, we assessed both model performance and the interpretability of predictions through eXplainable Artificial Intelligence (XAI) methods, applying SHAP to the feature-based pipeline and Grad-CAM and DeepSHAP to the CNN-based pipeline. Our results show comparable performance between the two pipelines in Leave-One-Site-Out (LOSO) validation, achieving state-of-the-art performance on the independent test set ( $M A E = 3.21$ with DNN and morphometric features and $M A E = 3.08$ with a DenseNet-121 architecture). SHAP provided the most consistent and interpretable results, while DeepSHAP exhibited greater variability. Further work is needed to assess the clinical utility of Grad-CAM. This study addresses a critical gap by systematically comparing the interpretability of multiple XAI methods across distinct brain age prediction pipelines. Our findings underscore the importance of integrating XAI into clinical practice, offering insights into how XAI outputs vary and their potential utility for clinicians.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Brain Informatics Computer Science-Computer Science Applications

CiteScore

9.50

自引率

0.00%

发文量

审稿时长

13 weeks

期刊介绍： Brain Informatics is an international, peer-reviewed, interdisciplinary open-access journal published under the brand SpringerOpen, which provides a unique platform for researchers and practitioners to disseminate original research on computational and informatics technologies related to brain. This journal addresses the computational, cognitive, physiological, biological, physical, ecological and social perspectives of brain informatics. It also welcomes emerging information technologies and advanced neuro-imaging technologies, such as big data analytics and interactive knowledge discovery related to various large-scale brain studies and their applications. This journal will publish high-quality original research papers, brief reports and critical reviews in all theoretical, technological, clinical and interdisciplinary studies that make up the field of brain informatics and its applications in brain-machine intelligence, brain-inspired intelligent systems, mental health and brain disorders, etc. The scope of papers includes the following five tracks: Track 1: Cognitive and Computational Foundations of Brain Science Track 2: Human Information Processing Systems Track 3: Brain Big Data Analytics, Curation and Management Track 4: Informatics Paradigms for Brain and Mental Health Research Track 5: Brain-Machine Intelligence and Brain-Inspired Computing