Channel Attention Based Iterative Residual Learning for Depth Map Super-Resolution

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2020-06-01 DOI:10.1109/cvpr42600.2020.00567

Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, H. Li, Ruigang Yang

{"title":"Channel Attention Based Iterative Residual Learning for Depth Map Super-Resolution","authors":"Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, H. Li, Ruigang Yang","doi":"10.1109/cvpr42600.2020.00567","DOIUrl":null,"url":null,"abstract":"Despite the remarkable progresses made in deep learning based depth map super-resolution (DSR), how to tackle real-world degradation in low-resolution (LR) depth maps remains a major challenge. Existing DSR model is generally trained and tested on synthetic dataset, which is very different from what would get from a real depth sensor. In this paper, we argue that DSR models trained under this setting are restrictive and not effective in dealing with realworld DSR tasks. We make two contributions in tackling real-world degradation of different depth sensors. First, we propose to classify the generation of LR depth maps into two types: non-linear downsampling with noise and interval downsampling, for which DSR models are learned correspondingly. Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively reexploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss. Extensive experiments on benchmarking datasets demonstrate the superiority of our method over current state-of-the-art DSR methods.","PeriodicalId":6715,"journal":{"name":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"257 1","pages":"5630-5639"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"63","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cvpr42600.2020.00567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 63

Abstract

Despite the remarkable progresses made in deep learning based depth map super-resolution (DSR), how to tackle real-world degradation in low-resolution (LR) depth maps remains a major challenge. Existing DSR model is generally trained and tested on synthetic dataset, which is very different from what would get from a real depth sensor. In this paper, we argue that DSR models trained under this setting are restrictive and not effective in dealing with realworld DSR tasks. We make two contributions in tackling real-world degradation of different depth sensors. First, we propose to classify the generation of LR depth maps into two types: non-linear downsampling with noise and interval downsampling, for which DSR models are learned correspondingly. Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively reexploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss. Extensive experiments on benchmarking datasets demonstrate the superiority of our method over current state-of-the-art DSR methods.

查看原文本刊更多论文

基于通道注意的深度图超分辨率迭代残差学习

尽管在基于深度学习的深度图超分辨率(DSR)方面取得了显著进展，但如何解决低分辨率(LR)深度图在现实世界中的退化问题仍然是一个主要挑战。现有的DSR模型一般都是在合成数据集上进行训练和测试的，这与真实深度传感器得到的结果有很大的差异。在本文中，我们认为在这种设置下训练的DSR模型是限制性的，并且不能有效地处理现实世界的DSR任务。在解决不同深度传感器的实际退化问题方面，我们做出了两个贡献。首先，我们提出将LR深度图的生成分为两种类型:带噪声的非线性下采样和区间下采样，并相应地学习DSR模型。其次，我们提出了一种新的现实DSR框架，该框架由四个模块组成:1)具有深度监督的迭代残差学习模块，以粗到精的方式学习深度图的有效高频分量;2)采用通道注意策略，增强高频成分丰富的通道;3)多阶段融合模块，可有效再利用粗到精过程的结果;4)深度细化模块，通过TGV正则化和输入损失对深度图进行改进。在基准数据集上进行的大量实验表明，我们的方法优于当前最先进的DSR方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量