从粗到细:一种基于左右一致性的单目深度估计模型

2019 IEEE 19th International Conference on Communication Technology (ICCT) Pub Date : 2019-10-01 DOI:10.1109/ICCT46805.2019.8947220

Zeyu Lei, Yan Wang, Yufan Xu, Rui Huang

{"title":"从粗到细:一种基于左右一致性的单目深度估计模型","authors":"Zeyu Lei, Yan Wang, Yufan Xu, Rui Huang","doi":"10.1109/ICCT46805.2019.8947220","DOIUrl":null,"url":null,"abstract":"Predicting depth from an image is an essential problem in the area of computer vision and deep learning shows a great potential in this area. However most deep Convolutional Neural Networks are need to train them using vast amount of manually labelled data, which is difficult or even scarcely possible in some special environment. In this paper, we proposed an unsupervised method based on left-right consistence with multi-loss fusion, which can perform single image depth estimation, despite the absence of ground truth data. We treat the issue as an image reconstruction problem by training our network with a combine of SSIM and Huber loss. To achieve estimation the depth from coarse to fine, we estimate a coarse map in the former layer and using bilinear sample to transmit the map to the latter layer to obtain a fine depth map. Our method achieves more accurate result on KITTI driving dataset.","PeriodicalId":306112,"journal":{"name":"2019 IEEE 19th International Conference on Communication Technology (ICCT)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"From Coarse to Fine: A Monocular Depth Estimation Model Based on Left-Right Consistency\",\"authors\":\"Zeyu Lei, Yan Wang, Yufan Xu, Rui Huang\",\"doi\":\"10.1109/ICCT46805.2019.8947220\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting depth from an image is an essential problem in the area of computer vision and deep learning shows a great potential in this area. However most deep Convolutional Neural Networks are need to train them using vast amount of manually labelled data, which is difficult or even scarcely possible in some special environment. In this paper, we proposed an unsupervised method based on left-right consistence with multi-loss fusion, which can perform single image depth estimation, despite the absence of ground truth data. We treat the issue as an image reconstruction problem by training our network with a combine of SSIM and Huber loss. To achieve estimation the depth from coarse to fine, we estimate a coarse map in the former layer and using bilinear sample to transmit the map to the latter layer to obtain a fine depth map. Our method achieves more accurate result on KITTI driving dataset.\",\"PeriodicalId\":306112,\"journal\":{\"name\":\"2019 IEEE 19th International Conference on Communication Technology (ICCT)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 19th International Conference on Communication Technology (ICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCT46805.2019.8947220\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 19th International Conference on Communication Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT46805.2019.8947220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

从图像中预测深度是计算机视觉领域的一个重要问题，深度学习在这一领域显示出巨大的潜力。然而，大多数深度卷积神经网络需要使用大量人工标记的数据进行训练，这在某些特殊环境下是困难的，甚至是几乎不可能的。在本文中，我们提出了一种基于多损失融合的左右一致性的无监督方法，该方法可以在没有地面真值数据的情况下进行单幅图像深度估计。我们通过结合SSIM和Huber损失训练我们的网络，将该问题视为图像重建问题。为了实现从粗到细的深度估计，我们在前一层估计一个粗图，并使用双线性样本将该图传输到后一层，得到一个精细的深度图。我们的方法在KITTI驾驶数据集上得到了更准确的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

From Coarse to Fine: A Monocular Depth Estimation Model Based on Left-Right Consistency

Predicting depth from an image is an essential problem in the area of computer vision and deep learning shows a great potential in this area. However most deep Convolutional Neural Networks are need to train them using vast amount of manually labelled data, which is difficult or even scarcely possible in some special environment. In this paper, we proposed an unsupervised method based on left-right consistence with multi-loss fusion, which can perform single image depth estimation, despite the absence of ground truth data. We treat the issue as an image reconstruction problem by training our network with a combine of SSIM and Huber loss. To achieve estimation the depth from coarse to fine, we estimate a coarse map in the former layer and using bilinear sample to transmit the map to the latter layer to obtain a fine depth map. Our method achieves more accurate result on KITTI driving dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 19th International Conference on Communication Technology (ICCT)

自引率

0.00%

发文量