Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb
{"title":"RGB-D Fusion Through Zero-Shot Fuzzy Membership Learning for Salient Object Detection","authors":"Sudipta Bhuyan;Aupendu Kar;Debashis Sen;Sankha Deb","doi":"10.1109/TAI.2024.3376640","DOIUrl":null,"url":null,"abstract":"Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10471194/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Significant improvement has been achieved lately in color and depth data-based salient object detection (SOD) on images from varied datasets, which is mainly due to RGB-D fusion using modern machine learning techniques. However, little emphasis has been given recently on performing RGB-D fusion for SOD in the absence of ground truth data for training. This article proposes a zero-shot deep RGB-D fusion approach based on the novel concept of fuzzy membership learning, which does not require any data for training. The constituent salient object maps to be fused are represented using parametric fuzzy membership functions and the optimal parameter values are estimated through our zero-shot fuzzy membership learning (Z-FML) network. The optimal parameter values are used in a fuzzy inference system along with the constituent salient object maps to perform the fusion. A measure called the membership similarity measure (MSM) is proposed, and the Z-FML network is trained using it to devise a loss function that maximizes the similarity between the constituent salient object maps and the fused salient object map. The deduction of MSM and its properties are shown theoretically, and the gradients involved in the training of the Z-FML network are derived. Qualitative and quantitative evaluations using several datasets signify the effectiveness of our RGB-D fusion and our fusion-based RGB-D SOD in comparison with the state-of-the-art. We also empirically demonstrate the advantage of employing the novel MSM for training our Z-FML network.