{"title":"A 3D efficient and essentialized swin transformer network for alzheimer’s disease diagnosis","authors":"Shengchao Huang, Qun Dai","doi":"10.1007/s10489-025-06884-6","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning methods (e.g., convolutional neural networks, CNNs) have been widely applied to Alzheimer’s disease diagnosis based on structural magnetic resonance imaging (sMRI) data. However, CNN-based methods face significant Limitations in capturing the global feature distribution of the whole brain. Transformer-based models have shown promise in addressing this issue, but they often sacrifice local feature sensitivity. Moreover, the large number of parameters in Transformer-based models results in a strong dependence on large-scale datasets, which is difficult to satisfy in real-world 3D medical imaging scenarios. Through comprehensive consideration, we propose a 3D Efficient and Essentialized Swin Transformer Network (E2STN) to strike a balance between being lightweight and comprehensive feature extraction, thereby boosting Alzheimer’s disease diagnosis performance in 3D dataset scenarios. Specifically, E2STN includes four modules: an Efficient Swin Transformer (EST) module for identifying global structural information and being lightweight to reduce reliance on large-scale datasets, which is a novel task-oriented Transformer architecture; a Focused Feature Enhancement Convolution Unit (FFE-CU) for enhancing lesion details, thereby compensating for the limited perception of fine-grained pathological information by the Transformer; a Disease Risk Map generator (DRMg) for visualizing pathological regions; and an ROI-based classifier for precise categorization. Our proposed method has been validated by two diagnosis tasks (i.e., Alzheimer’s disease diagnosis and mild cognitive impairment conversion prediction) on the ADNI dataset. Compared to several state-of-the-art methods, our model demonstrates superior performance.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 15","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06884-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning methods (e.g., convolutional neural networks, CNNs) have been widely applied to Alzheimer’s disease diagnosis based on structural magnetic resonance imaging (sMRI) data. However, CNN-based methods face significant Limitations in capturing the global feature distribution of the whole brain. Transformer-based models have shown promise in addressing this issue, but they often sacrifice local feature sensitivity. Moreover, the large number of parameters in Transformer-based models results in a strong dependence on large-scale datasets, which is difficult to satisfy in real-world 3D medical imaging scenarios. Through comprehensive consideration, we propose a 3D Efficient and Essentialized Swin Transformer Network (E2STN) to strike a balance between being lightweight and comprehensive feature extraction, thereby boosting Alzheimer’s disease diagnosis performance in 3D dataset scenarios. Specifically, E2STN includes four modules: an Efficient Swin Transformer (EST) module for identifying global structural information and being lightweight to reduce reliance on large-scale datasets, which is a novel task-oriented Transformer architecture; a Focused Feature Enhancement Convolution Unit (FFE-CU) for enhancing lesion details, thereby compensating for the limited perception of fine-grained pathological information by the Transformer; a Disease Risk Map generator (DRMg) for visualizing pathological regions; and an ROI-based classifier for precise categorization. Our proposed method has been validated by two diagnosis tasks (i.e., Alzheimer’s disease diagnosis and mild cognitive impairment conversion prediction) on the ADNI dataset. Compared to several state-of-the-art methods, our model demonstrates superior performance.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.