Kuo Zhang, Zhongyi Hu, Shuzhi Wu, Lei Xiao, Hui Huang
{"title":"Transmednet:变形医学三合一神经病学网络","authors":"Kuo Zhang, Zhongyi Hu, Shuzhi Wu, Lei Xiao, Hui Huang","doi":"10.1002/cpe.70285","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In recent years, Transformers have gradually gained widespread application in the field of computer vision (CV). However, when applied to medical imaging, common slicing strategies often miss key information along the third dimension. To address this, we propose a novel diagnosis model, Transformer Medical Triad Neurology Networks (TransmedNet), designed to better capture 3D brain image features. The technological innovations of this model are primarily manifested in three aspects: Firstly, the model employs a hierarchical mechanism. The bottommost layer partitions the brain, enhancing the comprehension within each brain region. The intermediate layer employs a segmentation moving window mechanism to extract correlations between adjacent windows. The topmost layer utilizes global multi-head self-attention to focus on overall correlations. Secondly, the model adopts a combination of Transformer and convolutional neural network architectures to balance global and local features, enhancing the overall performance of the model. Lastly, the model thoroughly considers the three-dimensional features of brain images by incorporating a three-dimensional multi-head self-attention mechanism, ensuring equal importance is given to each dimension. Our experiments yielded promising results. The classification accuracy reached 99.21% for distinguishing Alzheimer's disease (AD) from cognitively normal (CN) subjects, and 97.46% for distinguishing autism spectrum disorder (ASD) from CN. The results demonstrate that the TransmedNet model enhances the classification performance of brain imaging diseases.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 23-24","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transmednet: Transformer Medical Triad Neurology Networks\",\"authors\":\"Kuo Zhang, Zhongyi Hu, Shuzhi Wu, Lei Xiao, Hui Huang\",\"doi\":\"10.1002/cpe.70285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>In recent years, Transformers have gradually gained widespread application in the field of computer vision (CV). However, when applied to medical imaging, common slicing strategies often miss key information along the third dimension. To address this, we propose a novel diagnosis model, Transformer Medical Triad Neurology Networks (TransmedNet), designed to better capture 3D brain image features. The technological innovations of this model are primarily manifested in three aspects: Firstly, the model employs a hierarchical mechanism. The bottommost layer partitions the brain, enhancing the comprehension within each brain region. The intermediate layer employs a segmentation moving window mechanism to extract correlations between adjacent windows. The topmost layer utilizes global multi-head self-attention to focus on overall correlations. Secondly, the model adopts a combination of Transformer and convolutional neural network architectures to balance global and local features, enhancing the overall performance of the model. Lastly, the model thoroughly considers the three-dimensional features of brain images by incorporating a three-dimensional multi-head self-attention mechanism, ensuring equal importance is given to each dimension. Our experiments yielded promising results. The classification accuracy reached 99.21% for distinguishing Alzheimer's disease (AD) from cognitively normal (CN) subjects, and 97.46% for distinguishing autism spectrum disorder (ASD) from CN. The results demonstrate that the TransmedNet model enhances the classification performance of brain imaging diseases.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 23-24\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70285\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70285","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
摘要
近年来,变压器在计算机视觉领域逐渐得到了广泛的应用。然而,当应用于医学成像时,常见的切片策略往往会错过沿第三维的关键信息。为了解决这个问题,我们提出了一个新的诊断模型,Transformer Medical Triad Neurology Networks (TransmedNet),旨在更好地捕捉3D脑图像特征。该模型的技术创新主要表现在三个方面:首先,模型采用了层次化机制。最下面的一层划分了大脑,增强了每个大脑区域的理解力。中间层采用分割移动窗口机制提取相邻窗口之间的相关性。最顶层利用全局多头自关注来关注整体相关性。其次,该模型采用Transformer和卷积神经网络相结合的架构来平衡全局和局部特征,提高了模型的整体性能;最后,该模型结合了三维多头自注意机制,充分考虑了脑图像的三维特征,确保每个维度都得到同等的重视。我们的实验产生了有希望的结果。识别阿尔茨海默病(AD)与认知正常(CN)的准确率为99.21%,识别自闭症谱系障碍(ASD)与认知正常(CN)的准确率为97.46%。结果表明,TransmedNet模型提高了脑成像疾病的分类性能。
Transmednet: Transformer Medical Triad Neurology Networks
In recent years, Transformers have gradually gained widespread application in the field of computer vision (CV). However, when applied to medical imaging, common slicing strategies often miss key information along the third dimension. To address this, we propose a novel diagnosis model, Transformer Medical Triad Neurology Networks (TransmedNet), designed to better capture 3D brain image features. The technological innovations of this model are primarily manifested in three aspects: Firstly, the model employs a hierarchical mechanism. The bottommost layer partitions the brain, enhancing the comprehension within each brain region. The intermediate layer employs a segmentation moving window mechanism to extract correlations between adjacent windows. The topmost layer utilizes global multi-head self-attention to focus on overall correlations. Secondly, the model adopts a combination of Transformer and convolutional neural network architectures to balance global and local features, enhancing the overall performance of the model. Lastly, the model thoroughly considers the three-dimensional features of brain images by incorporating a three-dimensional multi-head self-attention mechanism, ensuring equal importance is given to each dimension. Our experiments yielded promising results. The classification accuracy reached 99.21% for distinguishing Alzheimer's disease (AD) from cognitively normal (CN) subjects, and 97.46% for distinguishing autism spectrum disorder (ASD) from CN. The results demonstrate that the TransmedNet model enhances the classification performance of brain imaging diseases.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.