Multiple Dynamic Object Tracking for Visual SLAM

Fuxin Liu Hubei, Yanduo Zhang, Xun Li
{"title":"Multiple Dynamic Object Tracking for Visual SLAM","authors":"Fuxin Liu Hubei, Yanduo Zhang, Xun Li","doi":"10.1109/ICRCV55858.2022.9953172","DOIUrl":null,"url":null,"abstract":"The assumption based on scene rigidity has been accepted widely in visual SLAM framework. However, the supposition limits the development of SLAM algorithm in the real world. Especially for automatic driving, many complicate cases involved in it, which demands our SLAM system that provides accurate position robustly and perceives the surrounding environment reliably. Therefore, in this paper, we propose a novel visual SLAM front-end module, which uses instance segmentation and dense optical flow estimation to ensure the efficient separation of static background and dynamic targets. For potential moving objects, we take advantage of Unscented Kalman Filter (UKF) to track moving targets and update the according moving state. In light of scale inconsistency in the camera pose estimation, we recover the scene structure and obtain the scale factor in the key frame by the depth estimation network. At the end, we integrate the estimated camera pose and dynamic object tracking into a unified visual odometry. In the process of trajectory optimization, we adopt the sliding window mechanism to acquire the spatio-temporal information of the dynamic object. The experiment results show that the tracking of dynamic objects not only can provide rich clues for surroundings understanding, but also help the tracking of camera pose, and then improve the robustness of the SLAM system in dynamic environment.","PeriodicalId":399667,"journal":{"name":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCV55858.2022.9953172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The assumption based on scene rigidity has been accepted widely in visual SLAM framework. However, the supposition limits the development of SLAM algorithm in the real world. Especially for automatic driving, many complicate cases involved in it, which demands our SLAM system that provides accurate position robustly and perceives the surrounding environment reliably. Therefore, in this paper, we propose a novel visual SLAM front-end module, which uses instance segmentation and dense optical flow estimation to ensure the efficient separation of static background and dynamic targets. For potential moving objects, we take advantage of Unscented Kalman Filter (UKF) to track moving targets and update the according moving state. In light of scale inconsistency in the camera pose estimation, we recover the scene structure and obtain the scale factor in the key frame by the depth estimation network. At the end, we integrate the estimated camera pose and dynamic object tracking into a unified visual odometry. In the process of trajectory optimization, we adopt the sliding window mechanism to acquire the spatio-temporal information of the dynamic object. The experiment results show that the tracking of dynamic objects not only can provide rich clues for surroundings understanding, but also help the tracking of camera pose, and then improve the robustness of the SLAM system in dynamic environment.
基于视觉SLAM的多动态目标跟踪
基于场景刚性的假设在视觉SLAM框架中被广泛接受。然而,这种假设限制了SLAM算法在现实世界中的发展。特别是在自动驾驶中,涉及到许多复杂的情况,这就要求我们的SLAM系统能够鲁棒地提供准确的位置并可靠地感知周围环境。因此,在本文中,我们提出了一种新的视觉SLAM前端模块,该模块采用实例分割和密集光流估计来保证静态背景和动态目标的有效分离。对于潜在的运动目标,我们利用无气味卡尔曼滤波(UKF)来跟踪运动目标并根据运动状态进行更新。针对摄像机姿态估计中存在尺度不一致的问题,利用深度估计网络恢复场景结构,获得关键帧中的尺度因子。最后,我们将估计的相机姿态和动态目标跟踪整合到一个统一的视觉里程计中。在轨迹优化过程中,采用滑动窗口机制获取动态目标的时空信息。实验结果表明,动态目标的跟踪不仅可以为周围环境的理解提供丰富的线索,而且有助于相机姿态的跟踪,从而提高SLAM系统在动态环境中的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信