A co-training approach integrating CNN and Mamba for semi-supervised 3D medical image segmentation

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-10-04 DOI:10.1016/j.bspc.2025.108670

Yun Jiang , Pengyu Chen , Bingxi Liu, Miaofeng Lu, Longgang Yang, Yuhang Li, Jinliang Su

{"title":"A co-training approach integrating CNN and Mamba for semi-supervised 3D medical image segmentation","authors":"Yun Jiang , Pengyu Chen , Bingxi Liu, Miaofeng Lu, Longgang Yang, Yuhang Li, Jinliang Su","doi":"10.1016/j.bspc.2025.108670","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate 3D medical image segmentation is vital for clinical applications but hindered by the scarcity of expert-annotated data, posing challenges for fully supervised models that require extensive labels. To address this issue, semi-supervised learning (SSL) has emerged as an effective solution by enabling models to learn from limited labeled data and achieve performance comparable to fully supervised learning, thereby significantly reducing the burden of manual annotation. Among various SSL strategies, deep co-training has demonstrated its effectiveness, yet current methods suffer from excessive information sharing between subnetworks or convergence during the training process. In this paper, we propose a semi-supervised 3D medical image segmentation method based on heterogeneous co-training, which combines the local feature perception capability of Convolutional Neural Networks (CNNs) with the efficient sequence modeling ability of the Mamba architecture. This integration enables more robust and complementary feature learning. To further enhance the utilization of unlabeled data, we introduce a Confidence-Aware Consistency(CAC) loss, which enforces consistency between model predictions while maintaining the learning capacity of individual networks. In addition, we propose a TriMix data augmentation strategy and incorporate a feature perturbation mechanism to increase the diversity of training samples and improve generalization. Extensive experiments were performed on three publicly available datasets: BraTS2019, left atrium (LA) and pancreas. The proposed method achieved superior performance compared to existing semi-supervised segmentation approaches, as measured by Dice score and HD95, demonstrating its effectiveness in enhancing the accuracy and robustness of 3D medical image segmentation with limited labeled data.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108670"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425011814","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate 3D medical image segmentation is vital for clinical applications but hindered by the scarcity of expert-annotated data, posing challenges for fully supervised models that require extensive labels. To address this issue, semi-supervised learning (SSL) has emerged as an effective solution by enabling models to learn from limited labeled data and achieve performance comparable to fully supervised learning, thereby significantly reducing the burden of manual annotation. Among various SSL strategies, deep co-training has demonstrated its effectiveness, yet current methods suffer from excessive information sharing between subnetworks or convergence during the training process. In this paper, we propose a semi-supervised 3D medical image segmentation method based on heterogeneous co-training, which combines the local feature perception capability of Convolutional Neural Networks (CNNs) with the efficient sequence modeling ability of the Mamba architecture. This integration enables more robust and complementary feature learning. To further enhance the utilization of unlabeled data, we introduce a Confidence-Aware Consistency(CAC) loss, which enforces consistency between model predictions while maintaining the learning capacity of individual networks. In addition, we propose a TriMix data augmentation strategy and incorporate a feature perturbation mechanism to increase the diversity of training samples and improve generalization. Extensive experiments were performed on three publicly available datasets: BraTS2019, left atrium (LA) and pancreas. The proposed method achieved superior performance compared to existing semi-supervised segmentation approaches, as measured by Dice score and HD95, demonstrating its effectiveness in enhancing the accuracy and robustness of 3D medical image segmentation with limited labeled data.

查看原文本刊更多论文

一种集成CNN和Mamba的半监督三维医学图像分割的协同训练方法

准确的3D医学图像分割对于临床应用至关重要，但由于缺乏专家注释的数据，这对需要大量标签的完全监督模型提出了挑战。为了解决这个问题，半监督学习（SSL）已经成为一种有效的解决方案，它使模型能够从有限的标记数据中学习，并实现与完全监督学习相当的性能，从而大大减少了手动注释的负担。在各种SSL策略中，深度协同训练已经证明了其有效性，但目前的方法存在子网络间信息共享过多或训练过程收敛的问题。本文提出了一种基于异构协同训练的半监督三维医学图像分割方法，该方法将卷积神经网络（cnn）的局部特征感知能力与Mamba架构的高效序列建模能力相结合。这种集成支持更健壮和互补的特性学习。为了进一步提高对未标记数据的利用，我们引入了信心感知一致性（CAC）损失，它在保持单个网络学习能力的同时强制模型预测之间的一致性。此外，我们提出了一种TriMix数据增强策略，并结合特征扰动机制来增加训练样本的多样性和提高泛化。在三个公开可用的数据集上进行了广泛的实验：BraTS2019、左心房（LA）和胰腺。通过Dice评分和HD95来衡量，该方法与现有的半监督分割方法相比取得了更好的性能，证明了其在有限标记数据下提高三维医学图像分割的准确性和鲁棒性的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.