FPEVO: Fused point-edge visual odometry for low-structured and low-textured scenes

IF 3.1 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Visual Communication and Image Representation Pub Date : 2025-10-08 DOI:10.1016/j.jvcir.2025.104599

Dylan Brown , Hans Grobler , Johan Pieter de Villiers

{"title":"FPEVO: Fused point-edge visual odometry for low-structured and low-textured scenes","authors":"Dylan Brown , Hans Grobler , Johan Pieter de Villiers","doi":"10.1016/j.jvcir.2025.104599","DOIUrl":null,"url":null,"abstract":"<div><div>Visual odometry is an essential component of vision-based robotic navigation systems. A primary limitation of existing visual odometry solutions is their inability to achieve satisfactory performance in both high- and low-textured regions. In this paper, a robust RGB-D visual odometry method is proposed that fuses point and edge features. By combining the descriptiveness of feature points with the structure provided by edge data, a method that is robust to low-textured scenes is developed. Edge features are first detected and grouped based on the Gestalt principles of continuity and proximity. Edge groups are then associated between the current and previous frames using point features in the vicinity of the edges. Pose estimation is thereafter performed by first matching points between associated edge groups, filtering these points based on structural constraints imposed by the edges, and estimating the motion of the agent. Compared to state-of-the-art alternatives, such as REVO, MSC-VO, DROID-VO and SplaTAM on the TUM RGB-D, ICL-NUIM and Tartan-Air datasets, the resulting method reduces the root mean square absolute trajectory error, and translational and rotational relative pose errors by up to 58%, 75%, and 82%, respectively. This indicates that our method is not only more accurate than current approaches, but also more consistent, especially in low-structured and low-textured environments.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"112 ","pages":"Article 104599"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325002135","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Visual odometry is an essential component of vision-based robotic navigation systems. A primary limitation of existing visual odometry solutions is their inability to achieve satisfactory performance in both high- and low-textured regions. In this paper, a robust RGB-D visual odometry method is proposed that fuses point and edge features. By combining the descriptiveness of feature points with the structure provided by edge data, a method that is robust to low-textured scenes is developed. Edge features are first detected and grouped based on the Gestalt principles of continuity and proximity. Edge groups are then associated between the current and previous frames using point features in the vicinity of the edges. Pose estimation is thereafter performed by first matching points between associated edge groups, filtering these points based on structural constraints imposed by the edges, and estimating the motion of the agent. Compared to state-of-the-art alternatives, such as REVO, MSC-VO, DROID-VO and SplaTAM on the TUM RGB-D, ICL-NUIM and Tartan-Air datasets, the resulting method reduces the root mean square absolute trajectory error, and translational and rotational relative pose errors by up to 58%, 75%, and 82%, respectively. This indicates that our method is not only more accurate than current approaches, but also more consistent, especially in low-structured and low-textured environments.

查看原文本刊更多论文

FPEVO：用于低结构和低纹理场景的融合点边缘视觉里程计

视觉里程计是基于视觉的机器人导航系统的重要组成部分。现有视觉里程计解决方案的主要限制是它们无法在高纹理和低纹理区域实现令人满意的性能。本文提出了一种融合点和边缘特征的鲁棒RGB-D视觉里程计方法。将特征点的描述性与边缘数据提供的结构相结合，提出了一种对低纹理场景具有鲁棒性的方法。首先根据格式塔的连续性和接近性原则对边缘特征进行检测和分组。然后使用边缘附近的点特征将边缘组关联到当前帧和以前的帧之间。然后通过首先在相关边缘组之间匹配点来进行姿态估计，根据边缘施加的结构约束过滤这些点，并估计代理的运动。与TUM RGB-D、ICL-NUIM和Tartan-Air数据集上的REVO、MSC-VO、DROID-VO和SplaTAM等最先进的替代方法相比，该方法将绝对轨迹误差的均方根、平移和旋转相对姿态误差分别降低了58%、75%和82%。这表明我们的方法不仅比目前的方法更准确，而且更一致，特别是在低结构和低纹理环境中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Visual Communication and Image Representation 工程技术-计算机：软件工程

CiteScore

5.40

自引率

11.50%

发文量

188

审稿时长

9.9 months

期刊介绍： The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.