Chenyang Zhang;Shuo Gu;Xiao Li;Jianghua Deng;Sheng Jin
{"title":"VID-SLAM: A New Visual Inertial SLAM Algorithm Coupling an RGB-D Camera and IMU Based on Adaptive Point and Line Features","authors":"Chenyang Zhang;Shuo Gu;Xiao Li;Jianghua Deng;Sheng Jin","doi":"10.1109/JSEN.2024.3486556","DOIUrl":null,"url":null,"abstract":"The visual-inertial simultaneous localization and mapping (VI-SLAM), which integrates data from monocular or stereo cameras, has garnered significant attention and development. The RGB-D camera, capable of capturing both color and depth images simultaneously, can perceive a comprehensive view of the surroundings. To fully leverage two types of measurement information from the RGB-D camera and inertial measurement unit (IMU) sensor for accurate pose estimation, we propose a new VI-SLAM algorithm, VID-SLAM, that effectively couples the RGB-D camera with the IMU. In our proposal, we first develop an adaptive point feature detection approach that rapidly detects and tracks sufficient point features. This approach uses adaptive nonmaximum suppression and the KD-Tree algorithm to ensure a homogeneous distribution of point features. Second, we incorporate line features into the pose estimation module of the simultaneous localization and mapping (SLAM) algorithm. By screening line features based on the geometric properties of vanishing points, we ensure that the detected lines align with the edges of scene objects as early as possible. Beyond the 2-D reprojection error of line features, we introduce a new error term that leverages the geometric constraints of plane normal vectors formed by matched line features and the optical center of the RGB-D camera; furthermore, we estimate the pose of the RGB-D camera by loosely coupling point-line visual features with IMU preintegration measurements. In the back end of VID-SLAM, we tightly couple the point-line feature error model with the IMU preintegration to jointly optimize the camera pose. Extensive qualitative and quantitative comparisons demonstrate that our VID-SLAM algorithm achieves robust performance and comparable accuracy.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 24","pages":"41548-41562"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10740612/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The visual-inertial simultaneous localization and mapping (VI-SLAM), which integrates data from monocular or stereo cameras, has garnered significant attention and development. The RGB-D camera, capable of capturing both color and depth images simultaneously, can perceive a comprehensive view of the surroundings. To fully leverage two types of measurement information from the RGB-D camera and inertial measurement unit (IMU) sensor for accurate pose estimation, we propose a new VI-SLAM algorithm, VID-SLAM, that effectively couples the RGB-D camera with the IMU. In our proposal, we first develop an adaptive point feature detection approach that rapidly detects and tracks sufficient point features. This approach uses adaptive nonmaximum suppression and the KD-Tree algorithm to ensure a homogeneous distribution of point features. Second, we incorporate line features into the pose estimation module of the simultaneous localization and mapping (SLAM) algorithm. By screening line features based on the geometric properties of vanishing points, we ensure that the detected lines align with the edges of scene objects as early as possible. Beyond the 2-D reprojection error of line features, we introduce a new error term that leverages the geometric constraints of plane normal vectors formed by matched line features and the optical center of the RGB-D camera; furthermore, we estimate the pose of the RGB-D camera by loosely coupling point-line visual features with IMU preintegration measurements. In the back end of VID-SLAM, we tightly couple the point-line feature error model with the IMU preintegration to jointly optimize the camera pose. Extensive qualitative and quantitative comparisons demonstrate that our VID-SLAM algorithm achieves robust performance and comparable accuracy.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice