Self-Supervised Depth Estimation Based on the Consistency of Synthetic-real Image Prediction

2023 International Conference on Advanced Robotics and Mechatronics (ICARM) Pub Date : 2023-07-08 DOI:10.1109/ICARM58088.2023.10218857

Wei Tong, Yubing Gao, E. Wu, Limin Zhu

{"title":"Self-Supervised Depth Estimation Based on the Consistency of Synthetic-real Image Prediction","authors":"Wei Tong, Yubing Gao, E. Wu, Limin Zhu","doi":"10.1109/ICARM58088.2023.10218857","DOIUrl":null,"url":null,"abstract":"Learning-based multi-view stereo aims to restore the real scene from multiple images with overlapping areas. The mainstream self-supervised MVS method trains the model based on the assumption that spatial points from different perspectives share the same color information. To further suppress the interference from specular reflection and illumination noise, this work proposes a self-supervised MVS network based on the consistency of synthetic-real image prediction. The network first applies the coarse-to-fine manner to gradually refine the depth map, and the source images are projected to the reference view to generate the synthesized reference image. Then the synthesized image with real source images are re-input into the network to form a cycled network, and the consistency constraint of the prediction results of the two periods is introduced to improve the color anti-interference of the self- supervised MVS model. The comprehensive experiments on the public dataset show that the proposed work can further improve the reconstruction performance of the benchmark model, which verifies the effectiveness of the proposed work.","PeriodicalId":220013,"journal":{"name":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARM58088.2023.10218857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Learning-based multi-view stereo aims to restore the real scene from multiple images with overlapping areas. The mainstream self-supervised MVS method trains the model based on the assumption that spatial points from different perspectives share the same color information. To further suppress the interference from specular reflection and illumination noise, this work proposes a self-supervised MVS network based on the consistency of synthetic-real image prediction. The network first applies the coarse-to-fine manner to gradually refine the depth map, and the source images are projected to the reference view to generate the synthesized reference image. Then the synthesized image with real source images are re-input into the network to form a cycled network, and the consistency constraint of the prediction results of the two periods is introduced to improve the color anti-interference of the self- supervised MVS model. The comprehensive experiments on the public dataset show that the proposed work can further improve the reconstruction performance of the benchmark model, which verifies the effectiveness of the proposed work.

查看原文本刊更多论文

基于合成真实图像预测一致性的自监督深度估计

基于学习的多视点立体视觉旨在从多幅重叠区域的图像中还原真实场景。主流的自监督MVS方法基于不同角度的空间点共享相同颜色信息的假设来训练模型。为了进一步抑制镜面反射和照明噪声的干扰，本文提出了一种基于合成-真实图像预测一致性的自监督MVS网络。该网络首先采用由粗到精的方式对深度图进行逐步细化，并将源图像投影到参考视图中生成合成参考图像。然后将合成图像与真实源图像重新输入到网络中，形成一个循环网络，并引入两个周期预测结果的一致性约束，以提高自监督MVS模型的颜色抗干扰性。在公共数据集上的综合实验表明，本文所提方法可以进一步提高基准模型的重构性能，验证了本文所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Advanced Robotics and Mechatronics (ICARM)

自引率

0.00%

发文量