Less is more: An effective method to extract object features for visual dynamic SLAM

IF 3.4 2区工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Displays Pub Date : 2025-09-23 DOI:10.1016/j.displa.2025.103224

Jianbo Zhang , Liang Yuan , Teng Ran , Jun Jia , Shuo Yang , Long Tang

{"title":"Less is more: An effective method to extract object features for visual dynamic SLAM","authors":"Jianbo Zhang , Liang Yuan , Teng Ran , Jun Jia , Shuo Yang , Long Tang","doi":"10.1016/j.displa.2025.103224","DOIUrl":null,"url":null,"abstract":"<div><div>Visual Simultaneous Localization and Mapping (VSLAM) is an essential foundation in augmented reality (AR) and mobile robotics. Dynamic scenes in the real world are a main challenge for VSLAM because it contravenes the fundamental assumptions based on static environments. Joint pose optimization with dynamic object modeling and camera pose estimation is a novel approach. However, it is challenging to model the motion of both the camera and the dynamic object when they are moving simultaneously. In this paper, we propose an efficient feature extraction approach for modeling dynamic object motion. We describe the object comprehensively through a more optimal feature selection strategy, which improves the performance of object tracking and pose estimation. The proposed approach combines image gradients and feature point clustering on dynamic objects. In the back-end optimization stage, we introduce rigid constraints on the dynamic object to optimize the poses using the graph model and obtain a high accuracy. The experimental results on the KITTI datasets demonstrate that the performance of the proposed approach is efficient and accurate.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103224"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002616","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Visual Simultaneous Localization and Mapping (VSLAM) is an essential foundation in augmented reality (AR) and mobile robotics. Dynamic scenes in the real world are a main challenge for VSLAM because it contravenes the fundamental assumptions based on static environments. Joint pose optimization with dynamic object modeling and camera pose estimation is a novel approach. However, it is challenging to model the motion of both the camera and the dynamic object when they are moving simultaneously. In this paper, we propose an efficient feature extraction approach for modeling dynamic object motion. We describe the object comprehensively through a more optimal feature selection strategy, which improves the performance of object tracking and pose estimation. The proposed approach combines image gradients and feature point clustering on dynamic objects. In the back-end optimization stage, we introduce rigid constraints on the dynamic object to optimize the poses using the graph model and obtain a high accuracy. The experimental results on the KITTI datasets demonstrate that the performance of the proposed approach is efficient and accurate.

查看原文本刊更多论文

少即是多：一种有效的视觉动态SLAM目标特征提取方法

视觉同步定位与制图（VSLAM）是增强现实（AR）和移动机器人技术的重要基础。现实世界中的动态场景是VSLAM面临的主要挑战，因为它违背了基于静态环境的基本假设。结合动态目标建模和相机姿态估计的关节姿态优化是一种新颖的方法。然而，当相机和动态物体同时运动时，对它们的运动建模是一项挑战。在本文中，我们提出了一种有效的特征提取方法来建模动态物体运动。通过更优的特征选择策略对目标进行更全面的描述，提高了目标跟踪和姿态估计的性能。该方法结合了动态目标上的图像梯度和特征点聚类。在后端优化阶段，引入动态对象的刚性约束，利用图模型优化姿态，获得了较高的优化精度。在KITTI数据集上的实验结果表明了该方法的有效性和准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Displays 工程技术-工程：电子与电气

CiteScore

4.60

自引率

25.60%

发文量

138

审稿时长

92 days

期刊介绍： Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.