DGEAHorNet: high-order spatial interaction network with dual cross global efficient attention for medical image segmentation.

IF 2 4区医学 Q3 ENGINEERING, BIOMEDICAL

Physical and Engineering Sciences in Medicine Pub Date : 2025-09-01 Epub Date: 2025-07-24 DOI:10.1007/s13246-025-01583-5

Haixin Peng, Xinjun An, Xue Chen, Zhenxiang Chen

{"title":"DGEAHorNet: high-order spatial interaction network with dual cross global efficient attention for medical image segmentation.","authors":"Haixin Peng, Xinjun An, Xue Chen, Zhenxiang Chen","doi":"10.1007/s13246-025-01583-5","DOIUrl":null,"url":null,"abstract":"<p><p>Medical image segmentation is a complex and challenging task, which aims to accurately segment various structures or abnormal regions in medical images. However, obtaining accurate segmentation results is difficult because of the great uncertainty in the shape, location, and scale of the target region. To address these challenges, we propose a higher-order spatial interaction framework with dual cross global efficient attention (DGEAHorNet), which employs a neural network architecture based on recursive gate convolution to adequately extract multi-scale contextual information from images. Specifically, a Dual Cross-Attentions (DCA) is added to the skip connection that can effectively blend multi-stage encoder features and narrow the semantic gap. In the bottleneck stage, global channel spatial attention module (GCSAM) is used to extract image global information. To obtain better feature representation, we feed the output from the GCSAM into the multi-branch dense layer (SENetV2) for excitation. Furthermore, we adopt Depthwise Over-parameterized Convolutional Layer (DO-Conv) in order to replace the common convolutional layer in the input and output part of our network, then add Efficient Attention (EA) to diminish computational complexity and enhance our model's performance. For evaluating the effectiveness of our proposed DGEAHorNet, we conduct comprehensive experiments on four publicly-available datasets, and achieving 0.9320, 0.9337, 0.9312 and 0.7799 in Dice similarity coefficient on ISIC2018, ISIC2017, CVC-ClinicDB and HRF respectively. Our results show that DGEAHorNet has better performance compared with advanced methods. The code is publicly available at https://github.com/penghaixin/mymodel .</p>","PeriodicalId":48490,"journal":{"name":"Physical and Engineering Sciences in Medicine","volume":" ","pages":"1265-1280"},"PeriodicalIF":2.0000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical and Engineering Sciences in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13246-025-01583-5","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Medical image segmentation is a complex and challenging task, which aims to accurately segment various structures or abnormal regions in medical images. However, obtaining accurate segmentation results is difficult because of the great uncertainty in the shape, location, and scale of the target region. To address these challenges, we propose a higher-order spatial interaction framework with dual cross global efficient attention (DGEAHorNet), which employs a neural network architecture based on recursive gate convolution to adequately extract multi-scale contextual information from images. Specifically, a Dual Cross-Attentions (DCA) is added to the skip connection that can effectively blend multi-stage encoder features and narrow the semantic gap. In the bottleneck stage, global channel spatial attention module (GCSAM) is used to extract image global information. To obtain better feature representation, we feed the output from the GCSAM into the multi-branch dense layer (SENetV2) for excitation. Furthermore, we adopt Depthwise Over-parameterized Convolutional Layer (DO-Conv) in order to replace the common convolutional layer in the input and output part of our network, then add Efficient Attention (EA) to diminish computational complexity and enhance our model's performance. For evaluating the effectiveness of our proposed DGEAHorNet, we conduct comprehensive experiments on four publicly-available datasets, and achieving 0.9320, 0.9337, 0.9312 and 0.7799 in Dice similarity coefficient on ISIC2018, ISIC2017, CVC-ClinicDB and HRF respectively. Our results show that DGEAHorNet has better performance compared with advanced methods. The code is publicly available at https://github.com/penghaixin/mymodel .

查看原文本刊更多论文

基于双交叉全局高效关注的高阶空间交互网络，用于医学图像分割。

医学图像分割是一项复杂而富有挑战性的任务，其目的是准确分割医学图像中的各种结构或异常区域。然而，由于目标区域的形状、位置和规模存在很大的不确定性，难以获得准确的分割结果。为了解决这些挑战，我们提出了一个具有双交叉全局有效注意的高阶空间交互框架（DGEAHorNet），该框架采用基于递归门卷积的神经网络架构来充分提取图像中的多尺度上下文信息。具体来说，在跳过连接中加入了双交叉注意（Dual cross - attention， DCA），可以有效地融合多级编码器特征，缩小语义差距。在瓶颈阶段，采用全局通道空间注意模块（GCSAM）提取图像全局信息。为了获得更好的特征表示，我们将GCSAM的输出输入到多分支密集层（SENetV2）中进行激励。在此基础上，采用深度过参数化卷积层（DO-Conv）取代网络输入和输出部分的普通卷积层，并加入高效注意（EA）来降低计算复杂度，提高模型性能。为了评估我们提出的DGEAHorNet的有效性，我们在4个公开的数据集上进行了综合实验，在ISIC2018、ISIC2017、CVC-ClinicDB和HRF上的Dice相似系数分别达到了0.9320、0.9337、0.9312和0.7799。结果表明，与先进的方法相比，DGEAHorNet具有更好的性能。该代码可在https://github.com/penghaixin/mymodel上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Physical and Engineering Sciences in Medicine Multiple-

CiteScore

8.40

自引率

4.50%

发文量

110