{"title":"STSLAM: Robust visual SLAM in dynamic scenes via image segmentation and instance tracking","authors":"Yiwei Xiu , Xiao Liang , Guodong Chen","doi":"10.1016/j.robot.2025.105150","DOIUrl":null,"url":null,"abstract":"<div><div>Although visual simultaneous localization and mapping (SLAM) has made significant progress in localization accuracy, its robustness can be further improved. The primary reason for this is the insufficient modeling of dynamic instances, which leads to tracking failures for current SLAM methods in dynamic scenes. Furthermore, the lack of semantic information is also a problem in the traditional visual SLAM field. To solve these problems, this paper proposes a visual SLAM algorithm called <em>segmentation and tracking SLAM</em> (STSLAM). We apply image segmentation and instance tracking to visual SLAM. The image segmentation and instance tracking task is achieved through a video panoptic segmentation algorithm. By integrating the learning-based algorithm into the SLAM system, STSLAM not only achieves motion estimation for each dynamic instance but also introduces novel factors for factor graph construction to constrain these dynamic instances. Meanwhile, we use the learning-based algorithm to assign semantics to the map and build a panoptic point cloud map. Finally, ablation studies and comparative experiments are conducted on the KITTI, TUM RGB-D and Bonn RGB-D Dynamic dataset, which verify the effectiveness of the STSLAM method and achieve state-of-the-art performance.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"194 ","pages":"Article 105150"},"PeriodicalIF":5.2000,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025002477","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Although visual simultaneous localization and mapping (SLAM) has made significant progress in localization accuracy, its robustness can be further improved. The primary reason for this is the insufficient modeling of dynamic instances, which leads to tracking failures for current SLAM methods in dynamic scenes. Furthermore, the lack of semantic information is also a problem in the traditional visual SLAM field. To solve these problems, this paper proposes a visual SLAM algorithm called segmentation and tracking SLAM (STSLAM). We apply image segmentation and instance tracking to visual SLAM. The image segmentation and instance tracking task is achieved through a video panoptic segmentation algorithm. By integrating the learning-based algorithm into the SLAM system, STSLAM not only achieves motion estimation for each dynamic instance but also introduces novel factors for factor graph construction to constrain these dynamic instances. Meanwhile, we use the learning-based algorithm to assign semantics to the map and build a panoptic point cloud map. Finally, ablation studies and comparative experiments are conducted on the KITTI, TUM RGB-D and Bonn RGB-D Dynamic dataset, which verify the effectiveness of the STSLAM method and achieve state-of-the-art performance.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.