AlzFormer: Multi-modal framework for Alzheimer’s classification using MRI and graph-embedded demographics guided by adaptive attention gating

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics Pub Date : 2025-08-20 DOI:10.1016/j.compmedimag.2025.102638

Sayyed Shahid Hussain , Xu Degang , Pir Masoom Shah , Hikmat Khan , Adnan Zeb

{"title":"AlzFormer: Multi-modal framework for Alzheimer’s classification using MRI and graph-embedded demographics guided by adaptive attention gating","authors":"Sayyed Shahid Hussain , Xu Degang , Pir Masoom Shah , Hikmat Khan , Adnan Zeb","doi":"10.1016/j.compmedimag.2025.102638","DOIUrl":null,"url":null,"abstract":"<div><div>Alzheimer’s disease (AD) is the most common neurodegenerative progressive disorder and the fifth-leading cause of death in older people. The detection of AD is a very challenging task for clinicians and radiologists due to the complex nature of this disease, thus requiring automatic data-driven machine-learning models to enhance diagnostic accuracy and support expert decision-making. However, machine learning models are hindered by three key limitations, in AD classification:(i) diffuse and subtle structural changes in the brain that make it difficult to capture global pathology (ii) non-uniform alterations across MRI planes, which limit single-view learning and (iii) the lack of deep integration of demographic context, which is often ignored despite its clinical importance. To address these challenges in this paper, we propose a novel multi-modal deep learning framework, named AlzFormer, that dynamically integrates 3D MRI with demographic features represented as knowledge graph embeddings for AD classification. Specifically, (i) to capture global and volumetric features, a 3D CNN is employed; (ii) to model plane-specific information, three parallel 2D CNNs are used for tri-planar processing (axial, coronal, sagittal), combined with a Transformer encoder; and (iii) to incorporate demographic context, we integrate demographic features as knowledge graph embeddings through a novel Adaptive Attention Gating mechanism that balances contributions from both modalities (i.e., MRI and demographics). Comprehensive experiments on two real-world datasets, including generalization tests, ablation studies, and robustness evaluation under noisy conditions, demonstrate that the proposed model provides a robust and effective solution for AD diagnosis. These results suggest strong potential for integration into Clinical Decision Support Systems (CDSS), offering a more interpretable and personalized approach to early Alzheimer’s detection.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"124 ","pages":"Article 102638"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125001478","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Alzheimer’s disease (AD) is the most common neurodegenerative progressive disorder and the fifth-leading cause of death in older people. The detection of AD is a very challenging task for clinicians and radiologists due to the complex nature of this disease, thus requiring automatic data-driven machine-learning models to enhance diagnostic accuracy and support expert decision-making. However, machine learning models are hindered by three key limitations, in AD classification:(i) diffuse and subtle structural changes in the brain that make it difficult to capture global pathology (ii) non-uniform alterations across MRI planes, which limit single-view learning and (iii) the lack of deep integration of demographic context, which is often ignored despite its clinical importance. To address these challenges in this paper, we propose a novel multi-modal deep learning framework, named AlzFormer, that dynamically integrates 3D MRI with demographic features represented as knowledge graph embeddings for AD classification. Specifically, (i) to capture global and volumetric features, a 3D CNN is employed; (ii) to model plane-specific information, three parallel 2D CNNs are used for tri-planar processing (axial, coronal, sagittal), combined with a Transformer encoder; and (iii) to incorporate demographic context, we integrate demographic features as knowledge graph embeddings through a novel Adaptive Attention Gating mechanism that balances contributions from both modalities (i.e., MRI and demographics). Comprehensive experiments on two real-world datasets, including generalization tests, ablation studies, and robustness evaluation under noisy conditions, demonstrate that the proposed model provides a robust and effective solution for AD diagnosis. These results suggest strong potential for integration into Clinical Decision Support Systems (CDSS), offering a more interpretable and personalized approach to early Alzheimer’s detection.

查看原文本刊更多论文

AlzFormer：基于自适应注意力门控的MRI和图形嵌入人口统计学指导下的阿尔茨海默病多模态分类框架

阿尔茨海默病（AD）是最常见的神经退行性进行性疾病，也是老年人死亡的第五大原因。由于这种疾病的复杂性，对临床医生和放射科医生来说，检测AD是一项非常具有挑战性的任务，因此需要自动数据驱动的机器学习模型来提高诊断准确性并支持专家决策。然而，机器学习模型在AD分类中受到三个关键限制的阻碍：(i)大脑中弥漫性和微妙的结构变化使得难以捕捉全局病理；（ii） MRI平面上的不均匀变化限制了单视图学习；（iii）缺乏人口背景的深度整合，尽管它具有临床重要性，但经常被忽视。为了解决这些挑战，我们提出了一个新的多模态深度学习框架，名为AlzFormer，它动态地将3D MRI与表示为知识图嵌入的人口特征集成在一起，用于AD分类。具体来说，(i)为了捕获全局和体积特征，采用了3D CNN；（ii）为了模拟特定平面的信息，使用三个平行的二维cnn进行三平面处理（轴向、冠状、矢状），并结合Transformer编码器；（iii）结合人口背景，我们通过一种新的自适应注意门控机制将人口特征整合为知识图嵌入，该机制平衡了两种模式（即MRI和人口统计学）的贡献。在两个真实数据集上的综合实验，包括泛化测试、消融研究和噪声条件下的鲁棒性评估，表明该模型为AD诊断提供了鲁棒性和有效性的解决方案。这些结果表明整合到临床决策支持系统（CDSS）的强大潜力，为早期阿尔茨海默病的检测提供更可解释和个性化的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computerized Medical Imaging and Graphics 医学-核医学

CiteScore

10.70

自引率

3.50%

发文量

审稿时长

26 days

期刊介绍： The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.