SAMP-Net: a medical image segmentation network with split attention and multi-layer perceptron.

IF 2.6 4区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Medical & Biological Engineering & Computing Pub Date : 2025-08-01 Epub Date: 2025-03-11 DOI:10.1007/s11517-025-03331-z

Xiaoxuan Ma, Sihan Shan, Dong Sui

{"title":"SAMP-Net: a medical image segmentation network with split attention and multi-layer perceptron.","authors":"Xiaoxuan Ma, Sihan Shan, Dong Sui","doi":"10.1007/s11517-025-03331-z","DOIUrl":null,"url":null,"abstract":"<p><p>Convolutional neural networks (CNNs) have achieved remarkable success in computer vision, particularly in medical image segmentation. U-Net, a prominent architecture, marked a major breakthrough and remains widely used in practice. However, its uniform downsampling strategy and simple stacking of convolutional layers in the encoder limit its ability to capture rich features at multiple depths, reducing its efficiency for rapid image processing. To address these limitations, this paper proposes a novel segmentation network that integrates attention mechanisms with multilayer perceptrons (MLPs). The network is designed to progressively capture and refine features at different levels. At the low-level layers, the primary feature conservation (PFC) block is introduced to preserve essential spatial details and reduce the loss of primary features during downsampling. In the mid-level layers, the compact attention block (CAB) enhances feature interaction through a multi-path attention structure, improving the network's ability to capture diverse semantic information. At the high-level layers, Shift MLP and Tokenized MLP blocks are incorporated. The Shift MLP block shifts feature channels along different axes, allowing for enhanced local feature modeling by focusing on specific regions of the convolutional features. The Tokenized MLP block converts these features into abstract tokens and leverages MLPs to model their representations in the latent space, effectively reducing the number of parameters and computational complexity while improving segmentation performance. The experiments conducted on the colorectal cancer tumor dataset CCI and the public dataset ISIC-2018 demonstrate that the proposed method significantly outperforms U-Net, U-Net++, Swin-U-Net, Attention U-Net, and RA-U-Net in terms of performance, with average improvements of 6.67%, 5.53%, 10.18%, 4.78%, and 3.55%, respectively. The code is available at the following link: https://github.com/QingTianer/SAMP-Net.git.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"2437-2450"},"PeriodicalIF":2.6000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03331-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/11 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Convolutional neural networks (CNNs) have achieved remarkable success in computer vision, particularly in medical image segmentation. U-Net, a prominent architecture, marked a major breakthrough and remains widely used in practice. However, its uniform downsampling strategy and simple stacking of convolutional layers in the encoder limit its ability to capture rich features at multiple depths, reducing its efficiency for rapid image processing. To address these limitations, this paper proposes a novel segmentation network that integrates attention mechanisms with multilayer perceptrons (MLPs). The network is designed to progressively capture and refine features at different levels. At the low-level layers, the primary feature conservation (PFC) block is introduced to preserve essential spatial details and reduce the loss of primary features during downsampling. In the mid-level layers, the compact attention block (CAB) enhances feature interaction through a multi-path attention structure, improving the network's ability to capture diverse semantic information. At the high-level layers, Shift MLP and Tokenized MLP blocks are incorporated. The Shift MLP block shifts feature channels along different axes, allowing for enhanced local feature modeling by focusing on specific regions of the convolutional features. The Tokenized MLP block converts these features into abstract tokens and leverages MLPs to model their representations in the latent space, effectively reducing the number of parameters and computational complexity while improving segmentation performance. The experiments conducted on the colorectal cancer tumor dataset CCI and the public dataset ISIC-2018 demonstrate that the proposed method significantly outperforms U-Net, U-Net++, Swin-U-Net, Attention U-Net, and RA-U-Net in terms of performance, with average improvements of 6.67%, 5.53%, 10.18%, 4.78%, and 3.55%, respectively. The code is available at the following link: https://github.com/QingTianer/SAMP-Net.git.

查看原文本刊更多论文

SAMP-Net：一种分离注意和多层感知器的医学图像分割网络。

卷积神经网络（cnn）在计算机视觉领域取得了显著的成功，特别是在医学图像分割方面。U-Net是一种杰出的架构，标志着重大突破，并在实践中得到广泛应用。然而，其均匀的下采样策略和编码器中卷积层的简单堆叠限制了其在多个深度捕获丰富特征的能力，降低了其快速图像处理的效率。为了解决这些限制，本文提出了一种新的分割网络，该网络将注意力机制与多层感知器（mlp）相结合。该网络旨在逐步捕获和细化不同层次的特征。在低层，引入了主要特征守恒（PFC）块，以保留基本的空间细节，减少下采样过程中主要特征的损失。在中间层，紧凑注意块（CAB）通过多路径注意结构增强了特征交互，提高了网络捕获多种语义信息的能力。在高层，Shift MLP和Tokenized MLP块被合并。Shift MLP块沿着不同的轴移动特征通道，通过关注卷积特征的特定区域来增强局部特征建模。Tokenized MLP块将这些特征转换为抽象令牌，并利用MLP在潜在空间中对其表示进行建模，有效地减少了参数数量和计算复杂度，同时提高了分割性能。在结直肠癌肿瘤数据集CCI和公共数据集ISIC-2018上进行的实验表明，该方法的性能显著优于U-Net、U-Net++、swing -U-Net、Attention U-Net和RA-U-Net，平均分别提高6.67%、5.53%、10.18%、4.78%和3.55%。代码可从以下链接获得：https://github.com/QingTianer/SAMP-Net.git。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical & Biological Engineering & Computing 医学-工程：生物医学

CiteScore

6.00

自引率

3.10%

发文量

249

审稿时长

3.5 months

期刊介绍： Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging. MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field. MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).