基于Top-K优化和语义描述符的SLAM算法改进。

IF 3.9 2区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Scientific Reports Pub Date : 2025-03-10 DOI:10.1038/s41598-025-90968-3

Yang Jiang, Yao Wu, Bin Zhao

{"title":"基于Top-K优化和语义描述符的SLAM算法改进。","authors":"Yang Jiang, Yao Wu, Bin Zhao","doi":"10.1038/s41598-025-90968-3","DOIUrl":null,"url":null,"abstract":"To address the computational challenges faced by edge devices using deep learning to process LiDAR point cloud data, this paper proposes a SLAM algorithm incorporating Top-K optimization to generate semantic descriptors and global semantic map for laser data efficiently. This approach aims to reduce computational complexity while enhancing processing speed. The algorithm extracts semantic information from LiDAR data, constructs two-dimensional semantic descriptors, and improves the robot's semantic understanding of its surrounding environment. In the loop closure detection phase, the algorithm identifies loop candidates by calculating the geometric and semantic similarities of the descriptors. It utilizes front-end odometry to stitch together subgraphs from these loop candidates, thereby detecting true loop closures. Finally, true loop closures add constraints in the factor graph, facilitating pose optimization. Experimental results show that this descriptor can match more loop closures without affecting accuracy. The algorithm enhances the pose estimation accuracy of the robot and generates global point cloud maps rich in semantic information. Under the influence of the Top-K strategy, the average inference time is reduced by 10.7%, and the memory usage decreases by 19.5% compared with before in the Network Inference module. This Top-K strategy significantly conserves computational resources for optimizing edge-device deep learning algorithms, particularly when processing LiDAR point cloud data. Additionally, it effectively reduces the computational load in practical applications while maintaining inference accuracy and efficiency.","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"8280"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11894159/pdf/","citationCount":"0","resultStr":"{\"title\":\"Enhancing SLAM algorithm with Top-K optimization and semantic descriptors.\",\"authors\":\"Yang Jiang, Yao Wu, Bin Zhao\",\"doi\":\"10.1038/s41598-025-90968-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the computational challenges faced by edge devices using deep learning to process LiDAR point cloud data, this paper proposes a SLAM algorithm incorporating Top-K optimization to generate semantic descriptors and global semantic map for laser data efficiently. This approach aims to reduce computational complexity while enhancing processing speed. The algorithm extracts semantic information from LiDAR data, constructs two-dimensional semantic descriptors, and improves the robot's semantic understanding of its surrounding environment. In the loop closure detection phase, the algorithm identifies loop candidates by calculating the geometric and semantic similarities of the descriptors. It utilizes front-end odometry to stitch together subgraphs from these loop candidates, thereby detecting true loop closures. Finally, true loop closures add constraints in the factor graph, facilitating pose optimization. Experimental results show that this descriptor can match more loop closures without affecting accuracy. The algorithm enhances the pose estimation accuracy of the robot and generates global point cloud maps rich in semantic information. Under the influence of the Top-K strategy, the average inference time is reduced by 10.7%, and the memory usage decreases by 19.5% compared with before in the Network Inference module. This Top-K strategy significantly conserves computational resources for optimizing edge-device deep learning algorithms, particularly when processing LiDAR point cloud data. Additionally, it effectively reduces the computational load in practical applications while maintaining inference accuracy and efficiency.\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"8280\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11894159/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-90968-3\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-90968-3","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

为了解决边缘设备使用深度学习处理LiDAR点云数据所面临的计算挑战，本文提出了一种结合Top-K优化的SLAM算法，用于高效地生成激光数据的语义描述符和全局语义图。该方法旨在降低计算复杂度，同时提高处理速度。该算法从激光雷达数据中提取语义信息，构建二维语义描述符，提高机器人对周围环境的语义理解。在循环闭合检测阶段，该算法通过计算描述符的几何和语义相似性来识别候选循环。它利用前端里程计将这些候选环路的子图拼接在一起，从而检测真正的环路闭合。最后，真循环闭包在因子图中添加了约束，便于姿态优化。实验结果表明，该描述符可以在不影响精度的情况下匹配更多的循环闭包。该算法提高了机器人姿态估计的精度，生成了具有丰富语义信息的全局点云图。在Top-K策略的影响下，网络推理模块的平均推理时间比以前减少了10.7%，内存占用比以前减少了19.5%。这种Top-K策略大大节省了优化边缘设备深度学习算法的计算资源，特别是在处理激光雷达点云数据时。在实际应用中有效地降低了计算量，同时保持了推理的准确性和效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhancing SLAM algorithm with Top-K optimization and semantic descriptors.

To address the computational challenges faced by edge devices using deep learning to process LiDAR point cloud data, this paper proposes a SLAM algorithm incorporating Top-K optimization to generate semantic descriptors and global semantic map for laser data efficiently. This approach aims to reduce computational complexity while enhancing processing speed. The algorithm extracts semantic information from LiDAR data, constructs two-dimensional semantic descriptors, and improves the robot's semantic understanding of its surrounding environment. In the loop closure detection phase, the algorithm identifies loop candidates by calculating the geometric and semantic similarities of the descriptors. It utilizes front-end odometry to stitch together subgraphs from these loop candidates, thereby detecting true loop closures. Finally, true loop closures add constraints in the factor graph, facilitating pose optimization. Experimental results show that this descriptor can match more loop closures without affecting accuracy. The algorithm enhances the pose estimation accuracy of the robot and generates global point cloud maps rich in semantic information. Under the influence of the Top-K strategy, the average inference time is reduced by 10.7%, and the memory usage decreases by 19.5% compared with before in the Network Inference module. This Top-K strategy significantly conserves computational resources for optimizing edge-device deep learning algorithms, particularly when processing LiDAR point cloud data. Additionally, it effectively reduces the computational load in practical applications while maintaining inference accuracy and efficiency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Scientific Reports Natural Science Disciplines-

CiteScore

7.50

自引率

4.30%

发文量

19567

审稿时长

3.9 months

期刊介绍： We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.