TripleNet: Exploiting Complementary Features and Pseudo-Labels for Semi-Supervised Salient Object Detection

IF 13.7

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-08-27 DOI:10.1109/TIP.2025.3601334

Liyuan Chen;Ming-Hsuan Yang;Jian Pu;Zhonglong Zheng

{"title":"TripleNet: Exploiting Complementary Features and Pseudo-Labels for Semi-Supervised Salient Object Detection","authors":"Liyuan Chen;Ming-Hsuan Yang;Jian Pu;Zhonglong Zheng","doi":"10.1109/TIP.2025.3601334","DOIUrl":null,"url":null,"abstract":"Due to the limited output categories, semi-supervised salient object detection faces challenges in adapting conventional semi-supervised strategies. To address this limitation, we propose a multi-branch architecture that extracts complementary features from labeled data. Specifically, we introduce TripleNet, a three-branch network architecture designed for contour, content, and holistic saliency prediction. The supervision signals for the contour and content branches are derived by decomposing the limited ground truths. After training on the labeled data, the model produces pseudo-labels for unlabeled images, including contour, content, and salient objects. By leveraging the complementarity between the contour and content branches, we construct coupled pseudo-saliency labels by integrating the pseudo-contour and pseudo-content labels, which differ from the model-inferred pseudo-saliency labels. We further develop an enhanced pseudo-labeling mechanism that generates enhanced pseudo-saliency labels by combining reliable regions from both pseudo-saliency labels. Moreover, we incorporate a partial binary cross-entropy loss function to guide the learning of the saliency branch to focus on effective regions within the enhanced pseudo-saliency labels, which are identified through our adaptive thresholding approach. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance using only 329 labeled training images.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"5628-5641"},"PeriodicalIF":13.7000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11142954/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Due to the limited output categories, semi-supervised salient object detection faces challenges in adapting conventional semi-supervised strategies. To address this limitation, we propose a multi-branch architecture that extracts complementary features from labeled data. Specifically, we introduce TripleNet, a three-branch network architecture designed for contour, content, and holistic saliency prediction. The supervision signals for the contour and content branches are derived by decomposing the limited ground truths. After training on the labeled data, the model produces pseudo-labels for unlabeled images, including contour, content, and salient objects. By leveraging the complementarity between the contour and content branches, we construct coupled pseudo-saliency labels by integrating the pseudo-contour and pseudo-content labels, which differ from the model-inferred pseudo-saliency labels. We further develop an enhanced pseudo-labeling mechanism that generates enhanced pseudo-saliency labels by combining reliable regions from both pseudo-saliency labels. Moreover, we incorporate a partial binary cross-entropy loss function to guide the learning of the saliency branch to focus on effective regions within the enhanced pseudo-saliency labels, which are identified through our adaptive thresholding approach. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance using only 329 labeled training images.

查看原文本刊更多论文

TripleNet：利用互补特征和伪标签进行半监督显著目标检测

由于输出类别有限，半监督显著目标检测在适应传统半监督策略方面面临挑战。为了解决这一限制，我们提出了一种从标记数据中提取互补特征的多分支架构。具体来说，我们介绍了TripleNet，这是一种三分支网络架构，用于轮廓、内容和整体显著性预测。通过对有限的基础真值进行分解，得到轮廓分支和内容分支的监督信号。在对标记的数据进行训练后，模型对未标记的图像产生伪标签，包括轮廓、内容和显著对象。利用轮廓和内容分支之间的互补性，我们通过整合伪轮廓和伪内容标签来构建耦合的伪显著性标签，这与模型推断的伪显著性标签不同。我们进一步开发了一种增强的伪标记机制，通过组合来自两个伪显著性标签的可靠区域来生成增强的伪显著性标签。此外，我们结合了一个部分二元交叉熵损失函数来指导显著性分支的学习，以关注增强的伪显著性标签内的有效区域，这些区域是通过我们的自适应阈值方法识别的。大量的实验表明，该方法仅使用329个标记训练图像就能达到最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量