Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation

2022 11th International Conference of Information and Communication Technology (ICTech)) Pub Date : 2022-02-01 DOI:10.1109/ICTech55460.2022.00104

Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li

{"title":"Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation","authors":"Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li","doi":"10.1109/ICTech55460.2022.00104","DOIUrl":null,"url":null,"abstract":"In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.","PeriodicalId":290836,"journal":{"name":"2022 11th International Conference of Information and Communication Technology (ICTech))","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference of Information and Communication Technology (ICTech))","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTech55460.2022.00104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.

查看原文本刊更多论文

融合二维目标检测和自监督单目深度估计的快速三维重建

近年来，实时三维重建由于其越来越多的应用而受到许多研究人员的欢迎。基于深度估计的三维重建方法在自动驾驶中存在大量重构无效区域。为了进一步提高三维重建的实时性，我们提出了一种新的方法，通过提取深度图的重要区域来减少计算资源的消耗。首先，通过二维目标检测模型生成提取重要区域的二值掩码。然后，设计TRDE模块，在生成的深度图中提取目标区域。最后，在KITTI数据集上的定性和定量结果表明，我们的方法可以进行深度图优化，减少三维重建过程中的计算资源消耗。因此，我们的方法通过融合二维目标检测和自监督单目深度估计实现了更快的三维重建。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 11th International Conference of Information and Communication Technology (ICTech))

自引率

0.00%

发文量