A fused attention-based hybrid model for semi-supervised medical image segmentation

Biomedical engineering advances Pub Date : 2026-06-01 Epub Date: 2025-12-22 DOI:10.1016/j.bea.2025.100203

Masum Shah Junayed, Sheida Nabavi

{"title":"A fused attention-based hybrid model for semi-supervised medical image segmentation","authors":"Masum Shah Junayed, Sheida Nabavi","doi":"10.1016/j.bea.2025.100203","DOIUrl":null,"url":null,"abstract":"<div><div>Addressing the significant challenge of extensive data annotation in medical image segmentation, semi-supervised techniques have become an effective alternative for leveraging both labeled and unlabeled data. However, accurately capturing complex local structures and global contextual information remains difficult, often leading to inconsistent segmentation results. To mitigate these limitations, we propose a fused transformer-based hybrid architecture for semi-supervised medical image segmentation. The model integrates a parallel backbone comprising Deformation Convolution Blocks (DCB) and Fused Transformer Blocks (FTB), followed by a segmentation head for precise mask prediction. The DCB enhances spatial adaptability for irregular and artifact-rich regions, while the FTB – with its fused attention mechanism – captures long-range dependencies efficiently. A Ghost Layer Perceptron (GLP) embedded within the transformer further improves computational efficiency without compromising representation quality. In addition, the incorporation of a consistency loss and unsupervised contrastive learning facilitates robust feature discrimination on unlabeled data, improving generalization across modalities. Extensive experiments on four publicly available medical imaging datasets demonstrate that the proposed model achieves comparable or better accuracy than recent state-of-the-art methods, while requiring substantially fewer parameters and lower computational cost, underscoring its practicality for real-world clinical applications.</div></div>","PeriodicalId":72384,"journal":{"name":"Biomedical engineering advances","volume":"11 ","pages":"Article 100203"},"PeriodicalIF":0.0000,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical engineering advances","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667099225000593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/22 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Addressing the significant challenge of extensive data annotation in medical image segmentation, semi-supervised techniques have become an effective alternative for leveraging both labeled and unlabeled data. However, accurately capturing complex local structures and global contextual information remains difficult, often leading to inconsistent segmentation results. To mitigate these limitations, we propose a fused transformer-based hybrid architecture for semi-supervised medical image segmentation. The model integrates a parallel backbone comprising Deformation Convolution Blocks (DCB) and Fused Transformer Blocks (FTB), followed by a segmentation head for precise mask prediction. The DCB enhances spatial adaptability for irregular and artifact-rich regions, while the FTB – with its fused attention mechanism – captures long-range dependencies efficiently. A Ghost Layer Perceptron (GLP) embedded within the transformer further improves computational efficiency without compromising representation quality. In addition, the incorporation of a consistency loss and unsupervised contrastive learning facilitates robust feature discrimination on unlabeled data, improving generalization across modalities. Extensive experiments on four publicly available medical imaging datasets demonstrate that the proposed model achieves comparable or better accuracy than recent state-of-the-art methods, while requiring substantially fewer parameters and lower computational cost, underscoring its practicality for real-world clinical applications.

查看原文本刊更多论文

基于融合注意力的半监督医学图像分割混合模型

为了解决医学图像分割中大量数据注释的重大挑战，半监督技术已成为利用标记和未标记数据的有效替代方案。然而，准确捕获复杂的局部结构和全局上下文信息仍然是困难的，往往导致不一致的分割结果。为了减轻这些限制，我们提出了一种基于融合变压器的半监督医学图像分割混合架构。该模型集成了由变形卷积块（DCB）和熔融变压器块（FTB）组成的并行主干，然后是用于精确掩模预测的分割头。DCB增强了对不规则区域和伪像丰富区域的空间适应性，而FTB通过其融合注意机制有效地捕获了远程依赖关系。嵌入在变压器中的幽灵层感知器（GLP）在不影响表示质量的情况下进一步提高了计算效率。此外，一致性损失和无监督对比学习的结合促进了对未标记数据的鲁棒特征识别，提高了跨模态的泛化。在四个公开可用的医学成像数据集上进行的大量实验表明，所提出的模型达到了与最近最先进的方法相当或更好的准确性，同时需要更少的参数和更低的计算成本，强调了其在现实世界临床应用中的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical engineering advances Bioengineering, Biomedical Engineering

自引率

0.00%

发文量

审稿时长

59 days