Mir Nafiul Nagib;Rahat Pervez;Afsana Alam Nova;Hadiur Rahman Nabil;Zeyar Aung;M. F. Mridha
{"title":"TuSegNet: A Transformer-Based and Attention-Enhanced Architecture for Brain Tumor Segmentation","authors":"Mir Nafiul Nagib;Rahat Pervez;Afsana Alam Nova;Hadiur Rahman Nabil;Zeyar Aung;M. F. Mridha","doi":"10.1109/OJCS.2025.3569758","DOIUrl":null,"url":null,"abstract":"Brain tumor segmentation is crucial in medical imaging, allowing informed diagnosis and treatment planning. In this study, we propose TuSegNet, a new transformer-based and attention-enhanced architecture for robust brain tumor segmentation. The model combines convolutional layers with transformer blocks for global context awareness, incorporates Atrous Spatial Pyramid Pooling (ASPP) for multi-scale feature extraction, and employs channel attention mechanisms to concentrate on tumor-relevant parts. Evaluated on three datasets—Dataset A, Dataset B, and a combined dataset—TuSegNet achieves state-of-the-art performance with a Dice Similarity Coefficient (DSC) of 0.895, 0.910, and 0.930, respectively, and an Intersection over Union (IoU) of 0.820, 0.835, and 0.860. Ablation studies validate the importance of ASPP and attention mechanisms, while comparative analysis demonstrates outstanding performance over existing SOTA models such as Swin UNet and TransUNet. The proposed methodology improves segmentation accuracy and highlights the importance of hybrid architectures in handling complex medical imaging tasks. These developments underscore the potential of TuSegNet for real-world healthcare applications in brain tumor diagnosis.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"750-761"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11002687","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11002687/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Brain tumor segmentation is crucial in medical imaging, allowing informed diagnosis and treatment planning. In this study, we propose TuSegNet, a new transformer-based and attention-enhanced architecture for robust brain tumor segmentation. The model combines convolutional layers with transformer blocks for global context awareness, incorporates Atrous Spatial Pyramid Pooling (ASPP) for multi-scale feature extraction, and employs channel attention mechanisms to concentrate on tumor-relevant parts. Evaluated on three datasets—Dataset A, Dataset B, and a combined dataset—TuSegNet achieves state-of-the-art performance with a Dice Similarity Coefficient (DSC) of 0.895, 0.910, and 0.930, respectively, and an Intersection over Union (IoU) of 0.820, 0.835, and 0.860. Ablation studies validate the importance of ASPP and attention mechanisms, while comparative analysis demonstrates outstanding performance over existing SOTA models such as Swin UNet and TransUNet. The proposed methodology improves segmentation accuracy and highlights the importance of hybrid architectures in handling complex medical imaging tasks. These developments underscore the potential of TuSegNet for real-world healthcare applications in brain tumor diagnosis.