Baosheng Zou , Ying Han , Zongguang Zhou , Kang Li , Guotai Wang
{"title":"UTMT:基于不确定性引导Twin Mean Teacher的手术内镜图像半监督分割","authors":"Baosheng Zou , Ying Han , Zongguang Zhou , Kang Li , Guotai Wang","doi":"10.1016/j.bspc.2025.108735","DOIUrl":null,"url":null,"abstract":"<div><div>Semantic segmentation of surgical endoscopic images plays an important role in surgical skill analysis and guidance. Deep learning based on fully supervised learning has achieved remarkable performance in this task, but it relies on a large amount of training images with pixel-level annotations that are time-consuming and difficult to collect. To reduce the annotation cost, we propose a novel semi-supervised segmentation framework based on Uncertainty guided Twin Mean Teacher (UTMT) for surgical endoscopic image segmentation. UTMT has two parallel teacher–student structures to deal with unannotated training images, where each student is supervised by pseudo-labels obtained by not only its mean teacher, but also its fellow student. The combination of mean teacher and fellow student can reduce the inherent bias of a single model and improve the quality of pseudo-labels. In addition, considering that the pseudo-label may be noisy, we propose an uncertainty-based correction method to emphasize high-confidence pseudo-labels obtained from different networks and suppress the unreliable parts for more robust learning. Experimental results on two public surgical endoscopic image datasets demonstrated that UTMT significantly improved the segmentation performance when only 1% or 5% of the training images were labeled, and it outperformed six state-of-the-art semi-supervised segmentation methods. Furthermore, compared with fully supervised learning, UTMT achieved a similar performance while reducing the annotation cost by 90% on the two datasets.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108735"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UTMT: Semi-supervised segmentation of surgical endoscopic images based on Uncertainty guided Twin Mean Teacher\",\"authors\":\"Baosheng Zou , Ying Han , Zongguang Zhou , Kang Li , Guotai Wang\",\"doi\":\"10.1016/j.bspc.2025.108735\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Semantic segmentation of surgical endoscopic images plays an important role in surgical skill analysis and guidance. Deep learning based on fully supervised learning has achieved remarkable performance in this task, but it relies on a large amount of training images with pixel-level annotations that are time-consuming and difficult to collect. To reduce the annotation cost, we propose a novel semi-supervised segmentation framework based on Uncertainty guided Twin Mean Teacher (UTMT) for surgical endoscopic image segmentation. UTMT has two parallel teacher–student structures to deal with unannotated training images, where each student is supervised by pseudo-labels obtained by not only its mean teacher, but also its fellow student. The combination of mean teacher and fellow student can reduce the inherent bias of a single model and improve the quality of pseudo-labels. In addition, considering that the pseudo-label may be noisy, we propose an uncertainty-based correction method to emphasize high-confidence pseudo-labels obtained from different networks and suppress the unreliable parts for more robust learning. Experimental results on two public surgical endoscopic image datasets demonstrated that UTMT significantly improved the segmentation performance when only 1% or 5% of the training images were labeled, and it outperformed six state-of-the-art semi-supervised segmentation methods. Furthermore, compared with fully supervised learning, UTMT achieved a similar performance while reducing the annotation cost by 90% on the two datasets.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108735\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012467\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012467","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
UTMT: Semi-supervised segmentation of surgical endoscopic images based on Uncertainty guided Twin Mean Teacher
Semantic segmentation of surgical endoscopic images plays an important role in surgical skill analysis and guidance. Deep learning based on fully supervised learning has achieved remarkable performance in this task, but it relies on a large amount of training images with pixel-level annotations that are time-consuming and difficult to collect. To reduce the annotation cost, we propose a novel semi-supervised segmentation framework based on Uncertainty guided Twin Mean Teacher (UTMT) for surgical endoscopic image segmentation. UTMT has two parallel teacher–student structures to deal with unannotated training images, where each student is supervised by pseudo-labels obtained by not only its mean teacher, but also its fellow student. The combination of mean teacher and fellow student can reduce the inherent bias of a single model and improve the quality of pseudo-labels. In addition, considering that the pseudo-label may be noisy, we propose an uncertainty-based correction method to emphasize high-confidence pseudo-labels obtained from different networks and suppress the unreliable parts for more robust learning. Experimental results on two public surgical endoscopic image datasets demonstrated that UTMT significantly improved the segmentation performance when only 1% or 5% of the training images were labeled, and it outperformed six state-of-the-art semi-supervised segmentation methods. Furthermore, compared with fully supervised learning, UTMT achieved a similar performance while reducing the annotation cost by 90% on the two datasets.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.