{"title":"MS-SLAM:采用滑动窗口地图稀疏化技术的内存高效视觉 SLAM","authors":"Xiaoyu Zhang, Jinhu Dong, Yin Zhang, Yun‐Hui Liu","doi":"10.1002/rob.22431","DOIUrl":null,"url":null,"abstract":"<jats:label/>While most visual SLAM systems traditionally prioritize accuracy or speed, the associated memory consumption would also become a concern for robots working in large‐scale environments, primarily due to the perpetual preservation of increasing number of redundant map points. Although these redundant map points are initially constructed to ensure robust frame tracking, they contribute little once the robot moves to other locations and are primarily kept for potential loop closure. After continuous optimization, these map points are accurate and actually not all of them are essential for loop closure. Therefore, this paper proposes MS‐SLAM, a memory‐efficient visual SLAM system with map sparsification aimed at selecting only parts of useful map points to keep in the global map. In MS‐SLAM, all local map points are temporarily kept to ensure robust frame tracking and further optimization, while redundant nonlocal map points are removed through the proposed novel sliding window map sparsification, which is efficient and running concurrently with original SLAM tracking. The loop closure still operates well with the selected useful map points. Through exhaustive experiments across various scenes in both public and self‐collected data sets, MS‐SLAM has demonstrated comparable accuracy with the state‐of‐the‐art visual SLAM while significantly reducing memory consumption by over 70% in large‐scale scenes. This facilitates the scalability of visual SLAM in large‐scale environments, making it a promising solution for real‐world applications. We will release our codes at <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" xlink:href=\"https://github.com/fishmarch/MS-SLAM\">https://github.com/fishmarch/MS-SLAM</jats:ext-link>.","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"54 1","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MS‐SLAM: Memory‐Efficient Visual SLAM With Sliding Window Map Sparsification\",\"authors\":\"Xiaoyu Zhang, Jinhu Dong, Yin Zhang, Yun‐Hui Liu\",\"doi\":\"10.1002/rob.22431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<jats:label/>While most visual SLAM systems traditionally prioritize accuracy or speed, the associated memory consumption would also become a concern for robots working in large‐scale environments, primarily due to the perpetual preservation of increasing number of redundant map points. Although these redundant map points are initially constructed to ensure robust frame tracking, they contribute little once the robot moves to other locations and are primarily kept for potential loop closure. After continuous optimization, these map points are accurate and actually not all of them are essential for loop closure. Therefore, this paper proposes MS‐SLAM, a memory‐efficient visual SLAM system with map sparsification aimed at selecting only parts of useful map points to keep in the global map. In MS‐SLAM, all local map points are temporarily kept to ensure robust frame tracking and further optimization, while redundant nonlocal map points are removed through the proposed novel sliding window map sparsification, which is efficient and running concurrently with original SLAM tracking. The loop closure still operates well with the selected useful map points. Through exhaustive experiments across various scenes in both public and self‐collected data sets, MS‐SLAM has demonstrated comparable accuracy with the state‐of‐the‐art visual SLAM while significantly reducing memory consumption by over 70% in large‐scale scenes. This facilitates the scalability of visual SLAM in large‐scale environments, making it a promising solution for real‐world applications. We will release our codes at <jats:ext-link xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\" xlink:href=\\\"https://github.com/fishmarch/MS-SLAM\\\">https://github.com/fishmarch/MS-SLAM</jats:ext-link>.\",\"PeriodicalId\":192,\"journal\":{\"name\":\"Journal of Field Robotics\",\"volume\":\"54 1\",\"pages\":\"\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Field Robotics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1002/rob.22431\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1002/rob.22431","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
摘要
虽然大多数视觉 SLAM 系统传统上都优先考虑精度或速度,但对于在大规模环境中工作的机器人来说,相关的内存消耗也是一个令人担忧的问题,这主要是由于需要永久保存越来越多的冗余地图点。虽然这些冗余地图点最初是为了确保稳健的帧跟踪而构建的,但一旦机器人移动到其他位置,这些冗余地图点的作用就微乎其微了,主要是为了潜在的闭环而保留。经过不断优化后,这些地图点就会变得精确,实际上并非所有地图点都是闭环所必需的。因此,本文提出了 MS-SLAM,一种具有地图稀疏化功能的内存高效视觉 SLAM 系统,旨在只选择部分有用的地图点保留在全局地图中。在 MS-SLAM 中,所有局部地图点都被暂时保留,以确保帧跟踪的稳健性和进一步优化,而多余的非局部地图点则通过所提出的新型滑动窗口地图稀疏化技术被移除。在选定有用地图点的情况下,环路闭合仍然运行良好。通过在公共数据集和自收集数据集的各种场景中进行详尽的实验,MS-SLAM 的精确度与最先进的视觉 SLAM 不相上下,同时在大规模场景中显著减少了 70% 以上的内存消耗。这促进了视觉 SLAM 在大规模环境中的可扩展性,使其成为现实世界应用中一个前景广阔的解决方案。我们将在 https://github.com/fishmarch/MS-SLAM 发布我们的代码。
MS‐SLAM: Memory‐Efficient Visual SLAM With Sliding Window Map Sparsification
While most visual SLAM systems traditionally prioritize accuracy or speed, the associated memory consumption would also become a concern for robots working in large‐scale environments, primarily due to the perpetual preservation of increasing number of redundant map points. Although these redundant map points are initially constructed to ensure robust frame tracking, they contribute little once the robot moves to other locations and are primarily kept for potential loop closure. After continuous optimization, these map points are accurate and actually not all of them are essential for loop closure. Therefore, this paper proposes MS‐SLAM, a memory‐efficient visual SLAM system with map sparsification aimed at selecting only parts of useful map points to keep in the global map. In MS‐SLAM, all local map points are temporarily kept to ensure robust frame tracking and further optimization, while redundant nonlocal map points are removed through the proposed novel sliding window map sparsification, which is efficient and running concurrently with original SLAM tracking. The loop closure still operates well with the selected useful map points. Through exhaustive experiments across various scenes in both public and self‐collected data sets, MS‐SLAM has demonstrated comparable accuracy with the state‐of‐the‐art visual SLAM while significantly reducing memory consumption by over 70% in large‐scale scenes. This facilitates the scalability of visual SLAM in large‐scale environments, making it a promising solution for real‐world applications. We will release our codes at https://github.com/fishmarch/MS-SLAM.
期刊介绍:
The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments.
The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.