A 3D Grid Mapping System Based on Depth Prediction from a Monocular Camera

Peifeng Yan, Yuqing Lan, Shaowu Yang
{"title":"A 3D Grid Mapping System Based on Depth Prediction from a Monocular Camera","authors":"Peifeng Yan, Yuqing Lan, Shaowu Yang","doi":"10.1109/ICACI49185.2020.9177824","DOIUrl":null,"url":null,"abstract":"In complex unknown 3D environments, an accurate 3D volumetric representation of the environment is important for an intelligent robot. Simultaneous Localization and Mapping (SLAM) is considered as a fundamental direction in this area. RGB-D information is very important in traditional SLAM methods. The depth information obtained by sensors like some RGB-D cameras has limits in precision and accuracy. High-precision sensors like lasers and radars are often very expensive. Efficient algorithms should be adopted into those traditional SLAM system. They can not only improve the system efficiency in poorly-equipped conditions but also reduce the resources consumption of robots. To tackle the trade-off between performance and cost, this paper proposes a system producing a 3D grid map that can be used for navigation with a monocular camera and IMU of small size. Our system uses a deep neural network to predict the depth information of a monocular image and utilize the dynamic frame hopping strategy to make a smoother prediction result. Furthermore, we complete a 3D grid map directly used for navigation. The whole grid mapping process occupies little computation and storage resource at the same time. We adopt the octree structure and a keyframe method in the process of the 3D grid mapping to reduce resource consumption. Experiments in a real-world environment show that our approach achieves good results in depth prediction and can well update the 3D grid map for navigation.","PeriodicalId":137804,"journal":{"name":"2020 12th International Conference on Advanced Computational Intelligence (ICACI)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI49185.2020.9177824","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In complex unknown 3D environments, an accurate 3D volumetric representation of the environment is important for an intelligent robot. Simultaneous Localization and Mapping (SLAM) is considered as a fundamental direction in this area. RGB-D information is very important in traditional SLAM methods. The depth information obtained by sensors like some RGB-D cameras has limits in precision and accuracy. High-precision sensors like lasers and radars are often very expensive. Efficient algorithms should be adopted into those traditional SLAM system. They can not only improve the system efficiency in poorly-equipped conditions but also reduce the resources consumption of robots. To tackle the trade-off between performance and cost, this paper proposes a system producing a 3D grid map that can be used for navigation with a monocular camera and IMU of small size. Our system uses a deep neural network to predict the depth information of a monocular image and utilize the dynamic frame hopping strategy to make a smoother prediction result. Furthermore, we complete a 3D grid map directly used for navigation. The whole grid mapping process occupies little computation and storage resource at the same time. We adopt the octree structure and a keyframe method in the process of the 3D grid mapping to reduce resource consumption. Experiments in a real-world environment show that our approach achieves good results in depth prediction and can well update the 3D grid map for navigation.
基于单目相机深度预测的三维网格映射系统
在复杂的未知三维环境中,对环境进行精确的三维体表示对智能机器人至关重要。同时定位与制图(SLAM)被认为是该领域的一个基本方向。在传统的SLAM方法中,RGB-D信息非常重要。像一些RGB-D相机这样的传感器所获得的深度信息在精度和准确性上是有限的。像激光和雷达这样的高精度传感器通常非常昂贵。传统的SLAM系统需要采用高效的算法。它们不仅可以在装备差的条件下提高系统效率,还可以减少机器人的资源消耗。为了解决性能和成本之间的权衡问题,本文提出了一种生成3D网格地图的系统,该系统可用于单目相机和小尺寸IMU的导航。该系统采用深度神经网络对单眼图像的深度信息进行预测,并利用动态跳帧策略使预测结果更加平滑。此外,我们完成了一个3D网格地图直接用于导航。整个网格映射过程同时占用很少的计算和存储资源。在三维网格映射过程中,我们采用八叉树结构和关键帧方法来减少资源消耗。在现实环境中的实验表明,该方法在深度预测方面取得了较好的效果,并且可以很好地更新三维网格地图用于导航。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信