RGB-D Fusion Through Zero-Shot Fuzzy Membership Learning for Salient Object Detection

IEEE transactions on artificial intelligence Pub Date : 2024-03-12 DOI:10.1109/TAI.2024.3376640

Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb

{"title":"RGB-D Fusion Through Zero-Shot Fuzzy Membership Learning for Salient Object Detection","authors":"Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb","doi":"10.1109/TAI.2024.3376640","DOIUrl":null,"url":null,"abstract":"Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10471194/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.

查看原文本刊更多论文

通过零镜头模糊成员学习实现 RGB-D 融合，以检测突出物体

最近，基于色彩和深度数据的突出物体检测（SOD）技术在各种数据集的图像上取得了显著进步，这主要归功于使用现代机器学习技术进行的 RGB-D 融合。然而，近来人们很少关注在没有地面实况数据训练的情况下为 SOD 进行 RGB-D 融合的问题。本文提出了一种基于模糊成员学习新概念的零镜头深度 RGB-D 融合方法，它不需要任何训练数据。要融合的组成突出对象映射使用参数模糊成员函数表示，并通过我们的零镜头模糊成员学习（Z-FML）网络估算最佳参数值。最佳参数值与组成突出对象图一起用于模糊推理系统，以执行融合。我们提出了一种名为 "成员相似性度量（MSM）"的度量方法，并利用它对 Z-FML 网络进行训练，从而设计出一种损失函数，使组成突出对象图与融合突出对象图之间的相似性最大化。从理论上说明了 MSM 的推导及其特性，并推导出了 Z-FML 网络训练所涉及的梯度。使用多个数据集进行的定性和定量评估表明，与最先进的技术相比，我们的 RGB-D 融合技术和基于融合的 RGB-D SOD 技术非常有效。我们还通过经验证明了采用新型 MSM 训练 Z-FML 网络的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on artificial intelligence

CiteScore

7.70

自引率

0.00%

发文量