RGB-D Fusion Through Zero-Shot Fuzzy Membership Learning for Salient Object Detection

Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb
{"title":"RGB-D Fusion Through Zero-Shot Fuzzy Membership Learning for Salient Object Detection","authors":"Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb","doi":"10.1109/TAI.2024.3376640","DOIUrl":null,"url":null,"abstract":"Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10471194/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.
通过零镜头模糊成员学习实现 RGB-D 融合,以检测突出物体
最近,基于色彩和深度数据的突出物体检测(SOD)技术在各种数据集的图像上取得了显著进步,这主要归功于使用现代机器学习技术进行的 RGB-D 融合。然而,近来人们很少关注在没有地面实况数据训练的情况下为 SOD 进行 RGB-D 融合的问题。本文提出了一种基于模糊成员学习新概念的零镜头深度 RGB-D 融合方法,它不需要任何训练数据。要融合的组成突出对象映射使用参数模糊成员函数表示,并通过我们的零镜头模糊成员学习(Z-FML)网络估算最佳参数值。最佳参数值与组成突出对象图一起用于模糊推理系统,以执行融合。我们提出了一种名为 "成员相似性度量(MSM)"的度量方法,并利用它对 Z-FML 网络进行训练,从而设计出一种损失函数,使组成突出对象图与融合突出对象图之间的相似性最大化。从理论上说明了 MSM 的推导及其特性,并推导出了 Z-FML 网络训练所涉及的梯度。使用多个数据集进行的定性和定量评估表明,与最先进的技术相比,我们的 RGB-D 融合技术和基于融合的 RGB-D SOD 技术非常有效。我们还通过经验证明了采用新型 MSM 训练 Z-FML 网络的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信