CS U-NET: A Medical Image Segmentation Method Integrating Spatial and Contextual Attention Mechanisms Based on U-NET

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-03-26 DOI:10.1002/ima.70072

Zhang Fanyang, Zhang Fan

{"title":"CS U-NET: A Medical Image Segmentation Method Integrating Spatial and Contextual Attention Mechanisms Based on U-NET","authors":"Zhang Fanyang, Zhang Fan","doi":"10.1002/ima.70072","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Medical image segmentation is a crucial process in medical image analysis, with convolutional neural network (CNN)-based methods achieving notable success in recent years. Among these, U-Net has gained widespread use due to its simple yet effective architecture. However, CNNs still struggle to capture global, long-range semantic information. To address this limitation, we present CS U-NET, a novel method built upon Swin-U-Net, which integrates spatial and contextual attention mechanisms. This hybrid approach combines the strengths of both transformers and U-Net architectures to enhance segmentation performance. In this framework, tokenized image patches are processed through a transformer-based U-shaped encoder-decoder, enabling the learning of both local and global semantic features via skip connections. Our method achieves a Dice Similarity Coefficient of 78.64% and a 95% Hausdorff distance of 21.25 on the Synapse multiorgan segmentation dataset, outperforming Trans-U-Net and other state-of-the-art U-Net variants by 4% and 6%, respectively. The experimental results highlight the significant improvements in prediction accuracy and edge detail preservation provided by our approach.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70072","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Medical image segmentation is a crucial process in medical image analysis, with convolutional neural network (CNN)-based methods achieving notable success in recent years. Among these, U-Net has gained widespread use due to its simple yet effective architecture. However, CNNs still struggle to capture global, long-range semantic information. To address this limitation, we present CS U-NET, a novel method built upon Swin-U-Net, which integrates spatial and contextual attention mechanisms. This hybrid approach combines the strengths of both transformers and U-Net architectures to enhance segmentation performance. In this framework, tokenized image patches are processed through a transformer-based U-shaped encoder-decoder, enabling the learning of both local and global semantic features via skip connections. Our method achieves a Dice Similarity Coefficient of 78.64% and a 95% Hausdorff distance of 21.25 on the Synapse multiorgan segmentation dataset, outperforming Trans-U-Net and other state-of-the-art U-Net variants by 4% and 6%, respectively. The experimental results highlight the significant improvements in prediction accuracy and edge detail preservation provided by our approach.

查看原文本刊更多论文

CS U-NET：一种基于U-NET的融合空间注意机制和上下文注意机制的医学图像分割方法

医学图像分割是医学图像分析中的一个关键环节，近年来基于卷积神经网络（CNN）的医学图像分割方法取得了显著的成功。其中，U-Net以其简单有效的架构得到了广泛的应用。然而，cnn仍然难以捕获全局的、远程的语义信息。为了解决这一限制，我们提出了CS U-NET，这是一种基于swing - U-NET的新方法，它集成了空间和上下文注意机制。这种混合方法结合了变压器和U-Net架构的优势，以提高分割性能。在该框架中，标记化的图像补丁通过基于变压器的u形编码器-解码器进行处理，从而通过跳过连接学习局部和全局语义特征。我们的方法在Synapse多器官分割数据集上实现了78.64%的Dice相似系数和21.25的95% Hausdorff距离，分别比Trans-U-Net和其他最先进的U-Net变体高出4%和6%。实验结果表明，该方法在预测精度和边缘细节保留方面有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.