Select, Supplement and Focus for RGB-D Saliency Detection

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2020-06-01 DOI:10.1109/CVPR42600.2020.00353

Miao Zhang, Weisong Ren, Yongri Piao, Zhengkun Rong, Huchuan Lu

{"title":"Select, Supplement and Focus for RGB-D Saliency Detection","authors":"Miao Zhang, Weisong Ren, Yongri Piao, Zhengkun Rong, Huchuan Lu","doi":"10.1109/CVPR42600.2020.00353","DOIUrl":null,"url":null,"abstract":"Depth data containing a preponderance of discriminative power in location have been proven beneficial for accurate saliency prediction. However, RGB-D saliency detection methods are also negatively influenced by randomly distributed erroneous or missing regions on the depth map or along the object boundaries. This offers the possibility of achieving more effective inference by well designed models. In this paper, we propose a new framework for accurate RGB-D saliency detection taking account of local and global complementarities from two modalities. This is achieved by designing a complimentary interaction model discriminative enough to simultaneously select useful representation from RGB and depth data, and meanwhile to refine the object boundaries. Moreover, we proposed a compensation-aware loss to further process the information not being considered in the complimentary interaction model, leading to improvement of the generalization ability for challenging scenes. Experiments on six public datasets show that our method outperforms18state-of-the-art methods.","PeriodicalId":6715,"journal":{"name":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"27 1","pages":"3469-3478"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"147","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR42600.2020.00353","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 147

Abstract

Depth data containing a preponderance of discriminative power in location have been proven beneficial for accurate saliency prediction. However, RGB-D saliency detection methods are also negatively influenced by randomly distributed erroneous or missing regions on the depth map or along the object boundaries. This offers the possibility of achieving more effective inference by well designed models. In this paper, we propose a new framework for accurate RGB-D saliency detection taking account of local and global complementarities from two modalities. This is achieved by designing a complimentary interaction model discriminative enough to simultaneously select useful representation from RGB and depth data, and meanwhile to refine the object boundaries. Moreover, we proposed a compensation-aware loss to further process the information not being considered in the complimentary interaction model, leading to improvement of the generalization ability for challenging scenes. Experiments on six public datasets show that our method outperforms18state-of-the-art methods.

查看原文本刊更多论文

RGB-D显著性检测的选择、补充和聚焦

包含位置判别能力优势的深度数据已被证明有利于准确的显著性预测。然而，RGB-D显著性检测方法也会受到深度图上或物体边界上随机分布的错误或缺失区域的负面影响。这为通过设计良好的模型实现更有效的推理提供了可能性。在本文中，我们提出了一个精确的RGB-D显著性检测的新框架，考虑了两种模式的局部和全局互补性。这是通过设计一个互补的交互模型来实现的，该模型具有足够的判别性，可以同时从RGB和深度数据中选择有用的表示，同时可以细化对象边界。此外，我们提出了补偿感知损失来进一步处理互补交互模型中未考虑的信息，从而提高了对具有挑战性场景的泛化能力。在六个公共数据集上的实验表明，我们的方法优于18种最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量