SAMP-Net: a medical image segmentation network with split attention and multi-layer perceptron.

IF 2.6 4区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Xiaoxuan Ma, Sihan Shan, Dong Sui
{"title":"SAMP-Net: a medical image segmentation network with split attention and multi-layer perceptron.","authors":"Xiaoxuan Ma, Sihan Shan, Dong Sui","doi":"10.1007/s11517-025-03331-z","DOIUrl":null,"url":null,"abstract":"<p><p>Convolutional neural networks (CNNs) have achieved remarkable success in computer vision, particularly in medical image segmentation. U-Net, a prominent architecture, marked a major breakthrough and remains widely used in practice. However, its uniform downsampling strategy and simple stacking of convolutional layers in the encoder limit its ability to capture rich features at multiple depths, reducing its efficiency for rapid image processing. To address these limitations, this paper proposes a novel segmentation network that integrates attention mechanisms with multilayer perceptrons (MLPs). The network is designed to progressively capture and refine features at different levels. At the low-level layers, the primary feature conservation (PFC) block is introduced to preserve essential spatial details and reduce the loss of primary features during downsampling. In the mid-level layers, the compact attention block (CAB) enhances feature interaction through a multi-path attention structure, improving the network's ability to capture diverse semantic information. At the high-level layers, Shift MLP and Tokenized MLP blocks are incorporated. The Shift MLP block shifts feature channels along different axes, allowing for enhanced local feature modeling by focusing on specific regions of the convolutional features. The Tokenized MLP block converts these features into abstract tokens and leverages MLPs to model their representations in the latent space, effectively reducing the number of parameters and computational complexity while improving segmentation performance. The experiments conducted on the colorectal cancer tumor dataset CCI and the public dataset ISIC-2018 demonstrate that the proposed method significantly outperforms U-Net, U-Net++, Swin-U-Net, Attention U-Net, and RA-U-Net in terms of performance, with average improvements of 6.67%, 5.53%, 10.18%, 4.78%, and 3.55%, respectively. The code is available at the following link: https://github.com/QingTianer/SAMP-Net.git.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03331-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Convolutional neural networks (CNNs) have achieved remarkable success in computer vision, particularly in medical image segmentation. U-Net, a prominent architecture, marked a major breakthrough and remains widely used in practice. However, its uniform downsampling strategy and simple stacking of convolutional layers in the encoder limit its ability to capture rich features at multiple depths, reducing its efficiency for rapid image processing. To address these limitations, this paper proposes a novel segmentation network that integrates attention mechanisms with multilayer perceptrons (MLPs). The network is designed to progressively capture and refine features at different levels. At the low-level layers, the primary feature conservation (PFC) block is introduced to preserve essential spatial details and reduce the loss of primary features during downsampling. In the mid-level layers, the compact attention block (CAB) enhances feature interaction through a multi-path attention structure, improving the network's ability to capture diverse semantic information. At the high-level layers, Shift MLP and Tokenized MLP blocks are incorporated. The Shift MLP block shifts feature channels along different axes, allowing for enhanced local feature modeling by focusing on specific regions of the convolutional features. The Tokenized MLP block converts these features into abstract tokens and leverages MLPs to model their representations in the latent space, effectively reducing the number of parameters and computational complexity while improving segmentation performance. The experiments conducted on the colorectal cancer tumor dataset CCI and the public dataset ISIC-2018 demonstrate that the proposed method significantly outperforms U-Net, U-Net++, Swin-U-Net, Attention U-Net, and RA-U-Net in terms of performance, with average improvements of 6.67%, 5.53%, 10.18%, 4.78%, and 3.55%, respectively. The code is available at the following link: https://github.com/QingTianer/SAMP-Net.git.

SAMP-Net:一种分离注意和多层感知器的医学图像分割网络。
卷积神经网络(cnn)在计算机视觉领域取得了显著的成功,特别是在医学图像分割方面。U-Net是一种杰出的架构,标志着重大突破,并在实践中得到广泛应用。然而,其均匀的下采样策略和编码器中卷积层的简单堆叠限制了其在多个深度捕获丰富特征的能力,降低了其快速图像处理的效率。为了解决这些限制,本文提出了一种新的分割网络,该网络将注意力机制与多层感知器(mlp)相结合。该网络旨在逐步捕获和细化不同层次的特征。在低层,引入了主要特征守恒(PFC)块,以保留基本的空间细节,减少下采样过程中主要特征的损失。在中间层,紧凑注意块(CAB)通过多路径注意结构增强了特征交互,提高了网络捕获多种语义信息的能力。在高层,Shift MLP和Tokenized MLP块被合并。Shift MLP块沿着不同的轴移动特征通道,通过关注卷积特征的特定区域来增强局部特征建模。Tokenized MLP块将这些特征转换为抽象令牌,并利用MLP在潜在空间中对其表示进行建模,有效地减少了参数数量和计算复杂度,同时提高了分割性能。在结直肠癌肿瘤数据集CCI和公共数据集ISIC-2018上进行的实验表明,该方法的性能显著优于U-Net、U-Net++、swing -U-Net、Attention U-Net和RA-U-Net,平均分别提高6.67%、5.53%、10.18%、4.78%和3.55%。代码可从以下链接获得:https://github.com/QingTianer/SAMP-Net.git。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medical & Biological Engineering & Computing
Medical & Biological Engineering & Computing 医学-工程:生物医学
CiteScore
6.00
自引率
3.10%
发文量
249
审稿时长
3.5 months
期刊介绍: Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging. MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field. MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信