Yuhang Ming;Di Ma;Weichen Dai;Han Yang;Rui Fan;Guofeng Zhang;Wanzeng Kong
{"title":"SLC$^{2}$-SLAM:使用NeRF SLAM共享潜在代码的语义引导循环闭包","authors":"Yuhang Ming;Di Ma;Weichen Dai;Han Yang;Rui Fan;Guofeng Zhang;Wanzeng Kong","doi":"10.1109/LRA.2025.3553352","DOIUrl":null,"url":null,"abstract":"Targeting the notorious cumulative drift errors in NeRF SLAM, we propose a Semantic-guided Loop Closure using Shared Latent Code, dubbed SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM. We argue that latent codes stored in many NeRF SLAM systems are not fully exploited, as they are only used for better reconstruction. In this letter, we propose a simple yet effective way to detect potential loops using the same latent codes as local features. To further improve the loop detection performance, we use the semantic information, which are also decoded from the same latent codes to guide the aggregation of local features. Finally, with the potential loops detected, we close them with a graph optimization followed by bundle adjustment to refine both the estimated poses and the reconstructed scene. To evaluate the performance of our SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM, we conduct extensive experiments on Replica and ScanNet datasets. Our proposed semantic-guided loop closure significantly outperforms the pre-trained NetVLAD and ORB combined with Bag-of-Words, which are used in all the other NeRF SLAM with loop closure. As a result, our SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM also demonstrated better tracking and reconstruction performance, especially in larger scenes with more loops, like ScanNet.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 5","pages":"4978-4985"},"PeriodicalIF":4.6000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SLC$^{2}$-SLAM: Semantic-Guided Loop Closure Using Shared Latent Code for NeRF SLAM\",\"authors\":\"Yuhang Ming;Di Ma;Weichen Dai;Han Yang;Rui Fan;Guofeng Zhang;Wanzeng Kong\",\"doi\":\"10.1109/LRA.2025.3553352\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Targeting the notorious cumulative drift errors in NeRF SLAM, we propose a Semantic-guided Loop Closure using Shared Latent Code, dubbed SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM. We argue that latent codes stored in many NeRF SLAM systems are not fully exploited, as they are only used for better reconstruction. In this letter, we propose a simple yet effective way to detect potential loops using the same latent codes as local features. To further improve the loop detection performance, we use the semantic information, which are also decoded from the same latent codes to guide the aggregation of local features. Finally, with the potential loops detected, we close them with a graph optimization followed by bundle adjustment to refine both the estimated poses and the reconstructed scene. To evaluate the performance of our SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM, we conduct extensive experiments on Replica and ScanNet datasets. Our proposed semantic-guided loop closure significantly outperforms the pre-trained NetVLAD and ORB combined with Bag-of-Words, which are used in all the other NeRF SLAM with loop closure. As a result, our SLC<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>-SLAM also demonstrated better tracking and reconstruction performance, especially in larger scenes with more loops, like ScanNet.\",\"PeriodicalId\":13241,\"journal\":{\"name\":\"IEEE Robotics and Automation Letters\",\"volume\":\"10 5\",\"pages\":\"4978-4985\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Robotics and Automation Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10935649/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10935649/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
摘要
针对 NeRF SLAM 中臭名昭著的累积漂移误差,我们提出了一种使用共享潜码的语义引导环路闭合(Semantic-guided Loop Closure)方法,称为 SLC$^{2}$-SLAM。我们认为,许多 NeRF SLAM 系统中存储的潜码并没有得到充分利用,因为它们只是用于更好地重建。在这封信中,我们提出了一种简单而有效的方法,利用相同的潜码作为局部特征来检测潜在的环路。为了进一步提高环路检测性能,我们使用了语义信息,这些信息也是从相同的潜码中解码出来的,用于指导局部特征的聚合。最后,在检测到潜在环路后,我们通过图优化将其关闭,然后进行捆绑调整,以完善估计姿势和重建场景。为了评估 SLC$^{2}$-SLAM 的性能,我们在 Replica 和 ScanNet 数据集上进行了大量实验。我们提出的以语义为导向的环路闭合大大优于预先训练的 NetVLAD 和结合词袋的 ORB,所有其他带有环路闭合的 NeRF SLAM 都使用了这两种方法。因此,我们的 SLC$^{2}$-SLAM 也表现出了更好的跟踪和重建性能,尤其是在 ScanNet 等具有更多回路的大型场景中。
SLC$^{2}$-SLAM: Semantic-Guided Loop Closure Using Shared Latent Code for NeRF SLAM
Targeting the notorious cumulative drift errors in NeRF SLAM, we propose a Semantic-guided Loop Closure using Shared Latent Code, dubbed SLC$^{2}$-SLAM. We argue that latent codes stored in many NeRF SLAM systems are not fully exploited, as they are only used for better reconstruction. In this letter, we propose a simple yet effective way to detect potential loops using the same latent codes as local features. To further improve the loop detection performance, we use the semantic information, which are also decoded from the same latent codes to guide the aggregation of local features. Finally, with the potential loops detected, we close them with a graph optimization followed by bundle adjustment to refine both the estimated poses and the reconstructed scene. To evaluate the performance of our SLC$^{2}$-SLAM, we conduct extensive experiments on Replica and ScanNet datasets. Our proposed semantic-guided loop closure significantly outperforms the pre-trained NetVLAD and ORB combined with Bag-of-Words, which are used in all the other NeRF SLAM with loop closure. As a result, our SLC$^{2}$-SLAM also demonstrated better tracking and reconstruction performance, especially in larger scenes with more loops, like ScanNet.
期刊介绍:
The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.