Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner?

2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pub Date : 2021-10-01 DOI:10.1109/ICCV48922.2021.01249

Lijun Wang, Yifan Wang, Linzhao Wang, Yu-Wei Zhan, Ying Wang, Huchuan Lu

{"title":"Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner?","authors":"Lijun Wang, Yifan Wang, Linzhao Wang, Yu-Wei Zhan, Ying Wang, Huchuan Lu","doi":"10.1109/ICCV48922.2021.01249","DOIUrl":null,"url":null,"abstract":"Geometric constraints are shown to enforce scale consistency and remedy the scale ambiguity issue in self-supervised monocular depth estimation. Meanwhile, scale-invariant losses focus on learning relative depth, leading to accurate relative depth prediction. To combine the best of both worlds, we learn scale-consistent self-supervised depth in a scale-invariant manner. Towards this goal, we present a scale-aware geometric (SAG) loss, which enforces scale consistency through point cloud alignment. Compared to prior arts, SAG loss takes relative scale into consideration during relative motion estimation, enabling more precise alignment and explicit supervision for scale inference. In addition, a novel two-stream architecture for depth estimation is designed, which disentangles scale from depth estimation and allows depth to be learned in a scale-invariant manner. The integration of SAG loss and two-stream network enables more consistent scale inference and more accurate relative depth estimation. Our method achieves state-of-the-art performance under both scale-invariant and scale-dependent evaluation settings.","PeriodicalId":6820,"journal":{"name":"2021 IEEE/CVF International Conference on Computer Vision (ICCV)","volume":"6 1","pages":"12707-12716"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CVF International Conference on Computer Vision (ICCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV48922.2021.01249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

Abstract

Geometric constraints are shown to enforce scale consistency and remedy the scale ambiguity issue in self-supervised monocular depth estimation. Meanwhile, scale-invariant losses focus on learning relative depth, leading to accurate relative depth prediction. To combine the best of both worlds, we learn scale-consistent self-supervised depth in a scale-invariant manner. Towards this goal, we present a scale-aware geometric (SAG) loss, which enforces scale consistency through point cloud alignment. Compared to prior arts, SAG loss takes relative scale into consideration during relative motion estimation, enabling more precise alignment and explicit supervision for scale inference. In addition, a novel two-stream architecture for depth estimation is designed, which disentangles scale from depth estimation and allows depth to be learned in a scale-invariant manner. The integration of SAG loss and two-stream network enables more consistent scale inference and more accurate relative depth estimation. Our method achieves state-of-the-art performance under both scale-invariant and scale-dependent evaluation settings.

查看原文本刊更多论文

尺度一致的单目深度能否以自监督尺度不变的方式学习?

在自监督单目深度估计中，几何约束可以增强尺度一致性并解决尺度模糊问题。同时，尺度不变损失侧重于学习相对深度，从而实现准确的相对深度预测。为了结合这两个世界的优点，我们以尺度不变的方式学习尺度一致的自监督深度。为了实现这一目标，我们提出了一种尺度感知几何损失(SAG)，它通过点云对齐来增强尺度一致性。与现有技术相比，SAG损失在相对运动估计中考虑了相对尺度，从而实现了更精确的对齐和对尺度推理的明确监督。此外，设计了一种新的深度估计的双流架构，将尺度与深度估计分离开来，并允许深度以尺度不变的方式学习。将SAG损失与两流网络相结合，使尺度推断更加一致，相对深度估计更加准确。我们的方法在尺度不变和尺度相关的评估设置下都实现了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量