A 3D Grid Mapping System Based on Depth Prediction from a Monocular Camera

2020 12th International Conference on Advanced Computational Intelligence (ICACI) Pub Date : 2020-08-01 DOI:10.1109/ICACI49185.2020.9177824

Peifeng Yan, Yuqing Lan, Shaowu Yang

{"title":"A 3D Grid Mapping System Based on Depth Prediction from a Monocular Camera","authors":"Peifeng Yan, Yuqing Lan, Shaowu Yang","doi":"10.1109/ICACI49185.2020.9177824","DOIUrl":null,"url":null,"abstract":"In complex unknown 3D environments, an accurate 3D volumetric representation of the environment is important for an intelligent robot. Simultaneous Localization and Mapping (SLAM) is considered as a fundamental direction in this area. RGB-D information is very important in traditional SLAM methods. The depth information obtained by sensors like some RGB-D cameras has limits in precision and accuracy. High-precision sensors like lasers and radars are often very expensive. Efficient algorithms should be adopted into those traditional SLAM system. They can not only improve the system efficiency in poorly-equipped conditions but also reduce the resources consumption of robots. To tackle the trade-off between performance and cost, this paper proposes a system producing a 3D grid map that can be used for navigation with a monocular camera and IMU of small size. Our system uses a deep neural network to predict the depth information of a monocular image and utilize the dynamic frame hopping strategy to make a smoother prediction result. Furthermore, we complete a 3D grid map directly used for navigation. The whole grid mapping process occupies little computation and storage resource at the same time. We adopt the octree structure and a keyframe method in the process of the 3D grid mapping to reduce resource consumption. Experiments in a real-world environment show that our approach achieves good results in depth prediction and can well update the 3D grid map for navigation.","PeriodicalId":137804,"journal":{"name":"2020 12th International Conference on Advanced Computational Intelligence (ICACI)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI49185.2020.9177824","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In complex unknown 3D environments, an accurate 3D volumetric representation of the environment is important for an intelligent robot. Simultaneous Localization and Mapping (SLAM) is considered as a fundamental direction in this area. RGB-D information is very important in traditional SLAM methods. The depth information obtained by sensors like some RGB-D cameras has limits in precision and accuracy. High-precision sensors like lasers and radars are often very expensive. Efficient algorithms should be adopted into those traditional SLAM system. They can not only improve the system efficiency in poorly-equipped conditions but also reduce the resources consumption of robots. To tackle the trade-off between performance and cost, this paper proposes a system producing a 3D grid map that can be used for navigation with a monocular camera and IMU of small size. Our system uses a deep neural network to predict the depth information of a monocular image and utilize the dynamic frame hopping strategy to make a smoother prediction result. Furthermore, we complete a 3D grid map directly used for navigation. The whole grid mapping process occupies little computation and storage resource at the same time. We adopt the octree structure and a keyframe method in the process of the 3D grid mapping to reduce resource consumption. Experiments in a real-world environment show that our approach achieves good results in depth prediction and can well update the 3D grid map for navigation.

查看原文本刊更多论文

基于单目相机深度预测的三维网格映射系统

在复杂的未知三维环境中，对环境进行精确的三维体表示对智能机器人至关重要。同时定位与制图(SLAM)被认为是该领域的一个基本方向。在传统的SLAM方法中，RGB-D信息非常重要。像一些RGB-D相机这样的传感器所获得的深度信息在精度和准确性上是有限的。像激光和雷达这样的高精度传感器通常非常昂贵。传统的SLAM系统需要采用高效的算法。它们不仅可以在装备差的条件下提高系统效率，还可以减少机器人的资源消耗。为了解决性能和成本之间的权衡问题，本文提出了一种生成3D网格地图的系统，该系统可用于单目相机和小尺寸IMU的导航。该系统采用深度神经网络对单眼图像的深度信息进行预测，并利用动态跳帧策略使预测结果更加平滑。此外，我们完成了一个3D网格地图直接用于导航。整个网格映射过程同时占用很少的计算和存储资源。在三维网格映射过程中，我们采用八叉树结构和关键帧方法来减少资源消耗。在现实环境中的实验表明，该方法在深度预测方面取得了较好的效果，并且可以很好地更新三维网格地图用于导航。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 12th International Conference on Advanced Computational Intelligence (ICACI)

自引率

0.00%

发文量