Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation

Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li
{"title":"Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation","authors":"Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li","doi":"10.1109/ICTech55460.2022.00104","DOIUrl":null,"url":null,"abstract":"In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.","PeriodicalId":290836,"journal":{"name":"2022 11th International Conference of Information and Communication Technology (ICTech))","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference of Information and Communication Technology (ICTech))","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTech55460.2022.00104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.
融合二维目标检测和自监督单目深度估计的快速三维重建
近年来,实时三维重建由于其越来越多的应用而受到许多研究人员的欢迎。基于深度估计的三维重建方法在自动驾驶中存在大量重构无效区域。为了进一步提高三维重建的实时性,我们提出了一种新的方法,通过提取深度图的重要区域来减少计算资源的消耗。首先,通过二维目标检测模型生成提取重要区域的二值掩码。然后,设计TRDE模块,在生成的深度图中提取目标区域。最后,在KITTI数据集上的定性和定量结果表明,我们的方法可以进行深度图优化,减少三维重建过程中的计算资源消耗。因此,我们的方法通过融合二维目标检测和自监督单目深度估计实现了更快的三维重建。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信