AlzFormer：基于视频的阿尔茨海默病早期诊断时空注意模型

IF 2.8 3区医学 Q2 NEUROSCIENCES

Neuroscience Pub Date : 2025-09-03 DOI:10.1016/j.neuroscience.2025.08.062

Taymaz Akan , Sara Akan , Sait Alp , Christina Raye Ledbetter , Mohammad Alfrad Nobel Bhuiyan , for the Alzheimer’s Disease Neuroimaging Initiative

{"title":"AlzFormer：基于视频的阿尔茨海默病早期诊断时空注意模型","authors":"Taymaz Akan , Sara Akan , Sait Alp , Christina Raye Ledbetter , Mohammad Alfrad Nobel Bhuiyan , for the Alzheimer’s Disease Neuroimaging Initiative","doi":"10.1016/j.neuroscience.2025.08.062","DOIUrl":null,"url":null,"abstract":"<div><div>Early and accurate Alzheimer’s disease (AD) diagnosis is critical for effective intervention, but it is still challenging due to neurodegeneration’s slow and complex progression. Recent studies in brain imaging analysis have highlighted the crucial roles of deep learning techniques in computer-assisted interventions for diagnosing brain diseases. In this study, we propose AlzFormer, a novel deep learning framework based on a space–time attention mechanism, for multiclass classification of AD, MCI, and CN individuals using structural MRI scans. Unlike conventional deep learning models, we used spatiotemporal self-attention to model inter-slice continuity by treating T1-weighted MRI volumes as sequential inputs, where slices correspond to video frames. Our model was fine-tuned and evaluated using 1.5 T MRI scans from the ADNI dataset. To ensure the anatomical consistency of all the MRI data, All MRI volumes were pre-processed with skull stripping and spatial normalization to MNI space. AlzFormer achieved an overall accuracy of 94 % on the test set, with balanced class-wise F1-scores (AD: 0.94, MCI: 0.99, CN: 0.98) and a macro-average AUC of 0.98. We also utilized attention map analysis to identify clinically significant patterns, particularly emphasizing subcortical structures and medial temporal regions implicated in AD. These findings demonstrate the potential of transformer-based architectures for robust and interpretable classification of brain disorders using structural MRI.</div></div>","PeriodicalId":19142,"journal":{"name":"Neuroscience","volume":"585 ","pages":"Pages 133-143"},"PeriodicalIF":2.8000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AlzFormer: Video-based space-time attention model for early diagnosis of Alzheimer’s disease\",\"authors\":\"Taymaz Akan , Sara Akan , Sait Alp , Christina Raye Ledbetter , Mohammad Alfrad Nobel Bhuiyan , for the Alzheimer’s Disease Neuroimaging Initiative\",\"doi\":\"10.1016/j.neuroscience.2025.08.062\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Early and accurate Alzheimer’s disease (AD) diagnosis is critical for effective intervention, but it is still challenging due to neurodegeneration’s slow and complex progression. Recent studies in brain imaging analysis have highlighted the crucial roles of deep learning techniques in computer-assisted interventions for diagnosing brain diseases. In this study, we propose AlzFormer, a novel deep learning framework based on a space–time attention mechanism, for multiclass classification of AD, MCI, and CN individuals using structural MRI scans. Unlike conventional deep learning models, we used spatiotemporal self-attention to model inter-slice continuity by treating T1-weighted MRI volumes as sequential inputs, where slices correspond to video frames. Our model was fine-tuned and evaluated using 1.5 T MRI scans from the ADNI dataset. To ensure the anatomical consistency of all the MRI data, All MRI volumes were pre-processed with skull stripping and spatial normalization to MNI space. AlzFormer achieved an overall accuracy of 94 % on the test set, with balanced class-wise F1-scores (AD: 0.94, MCI: 0.99, CN: 0.98) and a macro-average AUC of 0.98. We also utilized attention map analysis to identify clinically significant patterns, particularly emphasizing subcortical structures and medial temporal regions implicated in AD. These findings demonstrate the potential of transformer-based architectures for robust and interpretable classification of brain disorders using structural MRI.</div></div>\",\"PeriodicalId\":19142,\"journal\":{\"name\":\"Neuroscience\",\"volume\":\"585 \",\"pages\":\"Pages 133-143\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306452225009108\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306452225009108","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

早期和准确的阿尔茨海默病（AD）诊断对于有效的干预至关重要，但由于神经退行性疾病的缓慢和复杂的进展，仍然具有挑战性。最近的脑成像分析研究强调了深度学习技术在诊断脑部疾病的计算机辅助干预中的关键作用。在这项研究中，我们提出了一种基于时空注意机制的新型深度学习框架AlzFormer，用于使用结构MRI扫描对AD， MCI和CN个体进行多类别分类。与传统的深度学习模型不同，我们通过将t1加权MRI体积作为顺序输入（其中切片对应于视频帧），使用时空自注意来模拟片间连续性。我们的模型进行了微调，并使用来自ADNI数据集的1.5 T MRI扫描进行了评估。为了确保所有MRI数据的解剖一致性，所有MRI体积都经过颅骨剥离和空间归一化到MNI空间的预处理。AlzFormer在测试集中实现了94%的总体准确率，具有平衡的分类f1分数（AD: 0.94, MCI: 0.99, CN: 0.98）和宏观平均AUC为0.98。我们还利用注意图分析来确定具有临床意义的模式，特别强调与AD有关的皮层下结构和内侧颞区。这些发现证明了基于变压器的结构在使用结构MRI对大脑疾病进行稳健和可解释的分类方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

AlzFormer: Video-based space-time attention model for early diagnosis of Alzheimer’s disease

Early and accurate Alzheimer’s disease (AD) diagnosis is critical for effective intervention, but it is still challenging due to neurodegeneration’s slow and complex progression. Recent studies in brain imaging analysis have highlighted the crucial roles of deep learning techniques in computer-assisted interventions for diagnosing brain diseases. In this study, we propose AlzFormer, a novel deep learning framework based on a space–time attention mechanism, for multiclass classification of AD, MCI, and CN individuals using structural MRI scans. Unlike conventional deep learning models, we used spatiotemporal self-attention to model inter-slice continuity by treating T1-weighted MRI volumes as sequential inputs, where slices correspond to video frames. Our model was fine-tuned and evaluated using 1.5 T MRI scans from the ADNI dataset. To ensure the anatomical consistency of all the MRI data, All MRI volumes were pre-processed with skull stripping and spatial normalization to MNI space. AlzFormer achieved an overall accuracy of 94 % on the test set, with balanced class-wise F1-scores (AD: 0.94, MCI: 0.99, CN: 0.98) and a macro-average AUC of 0.98. We also utilized attention map analysis to identify clinically significant patterns, particularly emphasizing subcortical structures and medial temporal regions implicated in AD. These findings demonstrate the potential of transformer-based architectures for robust and interpretable classification of brain disorders using structural MRI.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neuroscience 医学-神经科学

CiteScore

6.20

自引率

0.00%

发文量

394

审稿时长

52 days

期刊介绍： Neuroscience publishes papers describing the results of original research on any aspect of the scientific study of the nervous system. Any paper, however short, will be considered for publication provided that it reports significant, new and carefully confirmed findings with full experimental details.