Dylan Brown , Hans Grobler , Johan Pieter de Villiers
{"title":"FPEVO: Fused point-edge visual odometry for low-structured and low-textured scenes","authors":"Dylan Brown , Hans Grobler , Johan Pieter de Villiers","doi":"10.1016/j.jvcir.2025.104599","DOIUrl":null,"url":null,"abstract":"<div><div>Visual odometry is an essential component of vision-based robotic navigation systems. A primary limitation of existing visual odometry solutions is their inability to achieve satisfactory performance in both high- and low-textured regions. In this paper, a robust RGB-D visual odometry method is proposed that fuses point and edge features. By combining the descriptiveness of feature points with the structure provided by edge data, a method that is robust to low-textured scenes is developed. Edge features are first detected and grouped based on the Gestalt principles of continuity and proximity. Edge groups are then associated between the current and previous frames using point features in the vicinity of the edges. Pose estimation is thereafter performed by first matching points between associated edge groups, filtering these points based on structural constraints imposed by the edges, and estimating the motion of the agent. Compared to state-of-the-art alternatives, such as REVO, MSC-VO, DROID-VO and SplaTAM on the TUM RGB-D, ICL-NUIM and Tartan-Air datasets, the resulting method reduces the root mean square absolute trajectory error, and translational and rotational relative pose errors by up to 58%, 75%, and 82%, respectively. This indicates that our method is not only more accurate than current approaches, but also more consistent, especially in low-structured and low-textured environments.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"112 ","pages":"Article 104599"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325002135","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Visual odometry is an essential component of vision-based robotic navigation systems. A primary limitation of existing visual odometry solutions is their inability to achieve satisfactory performance in both high- and low-textured regions. In this paper, a robust RGB-D visual odometry method is proposed that fuses point and edge features. By combining the descriptiveness of feature points with the structure provided by edge data, a method that is robust to low-textured scenes is developed. Edge features are first detected and grouped based on the Gestalt principles of continuity and proximity. Edge groups are then associated between the current and previous frames using point features in the vicinity of the edges. Pose estimation is thereafter performed by first matching points between associated edge groups, filtering these points based on structural constraints imposed by the edges, and estimating the motion of the agent. Compared to state-of-the-art alternatives, such as REVO, MSC-VO, DROID-VO and SplaTAM on the TUM RGB-D, ICL-NUIM and Tartan-Air datasets, the resulting method reduces the root mean square absolute trajectory error, and translational and rotational relative pose errors by up to 58%, 75%, and 82%, respectively. This indicates that our method is not only more accurate than current approaches, but also more consistent, especially in low-structured and low-textured environments.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.