基于多尺度卷积关注的高效双线解码器网络

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-09-13 DOI:10.1016/j.bspc.2025.108611

Riad Hassan , M. Rubaiyat Hossain Mondal , Sheikh Iqbal Ahamed , Fahad Mostafa , Md Mostafijur Rahman

{"title":"基于多尺度卷积关注的高效双线解码器网络","authors":"Riad Hassan , M. Rubaiyat Hossain Mondal , Sheikh Iqbal Ahamed , Fahad Mostafa , Md Mostafijur Rahman","doi":"10.1016/j.bspc.2025.108611","DOIUrl":null,"url":null,"abstract":"<div><div>Proper segmentation of organs-at-risk is important for radiation therapy, surgical planning, and diagnostic decision-making in medical image analysis. While deep learning-based segmentation architectures have made significant progress, they often fail to balance segmentation accuracy with computational efficiency. Most of the current state-of-the-art (SOTA) methods either prioritize performance at the cost of high computational complexity or compromise accuracy for efficiency. This paper addresses this gap by introducing an efficient dual-line decoder segmentation network (EDLDNet). In addition to noise-free decoder, the proposed method features a noisy decoder, which learns to incorporate structured perturbation at training time for better model robustness, yet at inference time only the noise-free decoder is executed, leading to lower computational cost. Multi-Scale Convolutional Attention Modules (MSCAMs), Attention Gates (AGs), and Up-Convolution Blocks (UCBs) are further utilized to optimize feature representation and boost segmentation performance. By leveraging multi-scale segmentation masks from both decoders, a mutation-based loss function is also utilized to enhance the model’s generalization. The proposed method outperforms SOTA segmentation architectures on four publicly available medical imaging datasets (Synapse, ACDC, SegThor, and LCTSC). EDLDNet achieves SOTA performance with an 84.00% Dice score on the Synapse dataset, surpassing baseline model like UNet by 13.89% in Dice score while significantly reducing Multiply-Accumulate Operations (MACs) by 89.7%. Compared to recent approaches like EMCAD, the proposed EDLDNet not only achieves higher Dice score but also maintains comparable computational efficiency. The outstanding performance across diverse datasets establishes EDLDNet’s outstanding generalization, computational and robustness efficiency. The source code, pre-processed data, and pretrained weights are available at <span><span>https://github.com/riadhassan/EDLDNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108611"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An efficient dual-line decoder network with multi-scale convolutional attention for multi-organ segmentation\",\"authors\":\"Riad Hassan , M. Rubaiyat Hossain Mondal , Sheikh Iqbal Ahamed , Fahad Mostafa , Md Mostafijur Rahman\",\"doi\":\"10.1016/j.bspc.2025.108611\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Proper segmentation of organs-at-risk is important for radiation therapy, surgical planning, and diagnostic decision-making in medical image analysis. While deep learning-based segmentation architectures have made significant progress, they often fail to balance segmentation accuracy with computational efficiency. Most of the current state-of-the-art (SOTA) methods either prioritize performance at the cost of high computational complexity or compromise accuracy for efficiency. This paper addresses this gap by introducing an efficient dual-line decoder segmentation network (EDLDNet). In addition to noise-free decoder, the proposed method features a noisy decoder, which learns to incorporate structured perturbation at training time for better model robustness, yet at inference time only the noise-free decoder is executed, leading to lower computational cost. Multi-Scale Convolutional Attention Modules (MSCAMs), Attention Gates (AGs), and Up-Convolution Blocks (UCBs) are further utilized to optimize feature representation and boost segmentation performance. By leveraging multi-scale segmentation masks from both decoders, a mutation-based loss function is also utilized to enhance the model’s generalization. The proposed method outperforms SOTA segmentation architectures on four publicly available medical imaging datasets (Synapse, ACDC, SegThor, and LCTSC). EDLDNet achieves SOTA performance with an 84.00% Dice score on the Synapse dataset, surpassing baseline model like UNet by 13.89% in Dice score while significantly reducing Multiply-Accumulate Operations (MACs) by 89.7%. Compared to recent approaches like EMCAD, the proposed EDLDNet not only achieves higher Dice score but also maintains comparable computational efficiency. The outstanding performance across diverse datasets establishes EDLDNet’s outstanding generalization, computational and robustness efficiency. The source code, pre-processed data, and pretrained weights are available at <span><span>https://github.com/riadhassan/EDLDNet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108611\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S174680942501122X\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S174680942501122X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

正确分割危险器官对于放射治疗、手术计划和医学影像分析中的诊断决策是重要的。虽然基于深度学习的分割架构取得了重大进展，但它们往往无法平衡分割精度和计算效率。大多数当前最先进的（SOTA）方法要么以高计算复杂性为代价优先考虑性能，要么为了效率而牺牲准确性。本文通过引入一种高效的双线解码器分割网络（EDLDNet）来解决这一问题。除了无噪声解码器之外，该方法还具有一个噪声解码器，该解码器在训练时学习纳入结构化扰动以获得更好的模型鲁棒性，而在推理时仅执行无噪声解码器，从而降低了计算成本。进一步利用多尺度卷积注意模块（MSCAMs）、注意门（AGs）和上卷积块（ucb）来优化特征表示和提高分割性能。通过利用来自两个解码器的多尺度分割掩码，还利用基于突变的损失函数来增强模型的泛化。该方法在四个公开可用的医学成像数据集（Synapse、ACDC、SegThor和LCTSC）上优于SOTA分割架构。EDLDNet在Synapse数据集上的Dice得分为84.00%，达到SOTA性能，比UNet等基准模型的Dice得分高出13.89%，同时显著减少了乘法累积操作（mac） 89.7%。与EMCAD等最新方法相比，提出的EDLDNet不仅获得了更高的Dice分数，而且保持了相当的计算效率。EDLDNet在不同数据集上的出色性能，使其具有出色的泛化、计算和鲁棒性效率。源代码、预处理数据和预训练的权重可从https://github.com/riadhassan/EDLDNet获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An efficient dual-line decoder network with multi-scale convolutional attention for multi-organ segmentation

Proper segmentation of organs-at-risk is important for radiation therapy, surgical planning, and diagnostic decision-making in medical image analysis. While deep learning-based segmentation architectures have made significant progress, they often fail to balance segmentation accuracy with computational efficiency. Most of the current state-of-the-art (SOTA) methods either prioritize performance at the cost of high computational complexity or compromise accuracy for efficiency. This paper addresses this gap by introducing an efficient dual-line decoder segmentation network (EDLDNet). In addition to noise-free decoder, the proposed method features a noisy decoder, which learns to incorporate structured perturbation at training time for better model robustness, yet at inference time only the noise-free decoder is executed, leading to lower computational cost. Multi-Scale Convolutional Attention Modules (MSCAMs), Attention Gates (AGs), and Up-Convolution Blocks (UCBs) are further utilized to optimize feature representation and boost segmentation performance. By leveraging multi-scale segmentation masks from both decoders, a mutation-based loss function is also utilized to enhance the model’s generalization. The proposed method outperforms SOTA segmentation architectures on four publicly available medical imaging datasets (Synapse, ACDC, SegThor, and LCTSC). EDLDNet achieves SOTA performance with an 84.00% Dice score on the Synapse dataset, surpassing baseline model like UNet by 13.89% in Dice score while significantly reducing Multiply-Accumulate Operations (MACs) by 89.7%. Compared to recent approaches like EMCAD, the proposed EDLDNet not only achieves higher Dice score but also maintains comparable computational efficiency. The outstanding performance across diverse datasets establishes EDLDNet’s outstanding generalization, computational and robustness efficiency. The source code, pre-processed data, and pretrained weights are available at https://github.com/riadhassan/EDLDNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.