{"title":"A co-training approach integrating CNN and Mamba for semi-supervised 3D medical image segmentation","authors":"Yun Jiang , Pengyu Chen , Bingxi Liu, Miaofeng Lu, Longgang Yang, Yuhang Li, Jinliang Su","doi":"10.1016/j.bspc.2025.108670","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate 3D medical image segmentation is vital for clinical applications but hindered by the scarcity of expert-annotated data, posing challenges for fully supervised models that require extensive labels. To address this issue, semi-supervised learning (SSL) has emerged as an effective solution by enabling models to learn from limited labeled data and achieve performance comparable to fully supervised learning, thereby significantly reducing the burden of manual annotation. Among various SSL strategies, deep co-training has demonstrated its effectiveness, yet current methods suffer from excessive information sharing between subnetworks or convergence during the training process. In this paper, we propose a semi-supervised 3D medical image segmentation method based on heterogeneous co-training, which combines the local feature perception capability of Convolutional Neural Networks (CNNs) with the efficient sequence modeling ability of the Mamba architecture. This integration enables more robust and complementary feature learning. To further enhance the utilization of unlabeled data, we introduce a Confidence-Aware Consistency(CAC) loss, which enforces consistency between model predictions while maintaining the learning capacity of individual networks. In addition, we propose a TriMix data augmentation strategy and incorporate a feature perturbation mechanism to increase the diversity of training samples and improve generalization. Extensive experiments were performed on three publicly available datasets: BraTS2019, left atrium (LA) and pancreas. The proposed method achieved superior performance compared to existing semi-supervised segmentation approaches, as measured by Dice score and HD95, demonstrating its effectiveness in enhancing the accuracy and robustness of 3D medical image segmentation with limited labeled data.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108670"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425011814","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate 3D medical image segmentation is vital for clinical applications but hindered by the scarcity of expert-annotated data, posing challenges for fully supervised models that require extensive labels. To address this issue, semi-supervised learning (SSL) has emerged as an effective solution by enabling models to learn from limited labeled data and achieve performance comparable to fully supervised learning, thereby significantly reducing the burden of manual annotation. Among various SSL strategies, deep co-training has demonstrated its effectiveness, yet current methods suffer from excessive information sharing between subnetworks or convergence during the training process. In this paper, we propose a semi-supervised 3D medical image segmentation method based on heterogeneous co-training, which combines the local feature perception capability of Convolutional Neural Networks (CNNs) with the efficient sequence modeling ability of the Mamba architecture. This integration enables more robust and complementary feature learning. To further enhance the utilization of unlabeled data, we introduce a Confidence-Aware Consistency(CAC) loss, which enforces consistency between model predictions while maintaining the learning capacity of individual networks. In addition, we propose a TriMix data augmentation strategy and incorporate a feature perturbation mechanism to increase the diversity of training samples and improve generalization. Extensive experiments were performed on three publicly available datasets: BraTS2019, left atrium (LA) and pancreas. The proposed method achieved superior performance compared to existing semi-supervised segmentation approaches, as measured by Dice score and HD95, demonstrating its effectiveness in enhancing the accuracy and robustness of 3D medical image segmentation with limited labeled data.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.