使用检索-触发-重新排序范式增强激光雷达在具有挑战性的环境中的位置识别

IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL
Zhenghua Zhang , Zhihua Xu , Hu Liu , Xuan Wang , Qipeng Li , Xiaoxiang Cao , Guoliang Chen
{"title":"使用检索-触发-重新排序范式增强激光雷达在具有挑战性的环境中的位置识别","authors":"Zhenghua Zhang ,&nbsp;Zhihua Xu ,&nbsp;Hu Liu ,&nbsp;Xuan Wang ,&nbsp;Qipeng Li ,&nbsp;Xiaoxiang Cao ,&nbsp;Guoliang Chen","doi":"10.1016/j.isprsjprs.2025.07.036","DOIUrl":null,"url":null,"abstract":"<div><div>LiDAR place recognition (LPR) plays a critical role in simultaneous localization and mapping (SLAM) and autonomous driving systems. However, the common separation between offline database construction and online localization introduces challenges such as rotational variance, sensor discrepancies, and long-term environmental changes. Existing methods relying on fixed-length global descriptors often struggle in such scenarios due to their inherently limited capacity to encode comprehensive environmental information. To address these challenges, we propose RTR-Net, a novel framework based on Retrieval-Trigger-Reranking paradigm, to enhance LPR performance in challenging environments. The framework operates in three phases: (1) Retrieval, where a lightweight backbone generates global descriptors and local regional features for initial candidate selection; (2) Trigger, a training-free module that assesses spatial consistency between query and candidates to activate reranking only when necessary; and (3) Reranking, which refines rankings by fusing local features, global descriptors, and spatial consistency scores via spatial and channel attention mechanisms. Additionally, a regional sampling method is proposed to mitigate field-of-view (FoV) discrepancies across heterogeneous LiDAR sensors. Comprehensive evaluations on four large-scale datasets (Oxford RobotCar, NUS Inhouse, HeLiPR, MulRan) demonstrate that RTR-Net not only achieves state-of-the-art results but also stands out as a versatile, plug-and-play module. It is compatible with existing LPR methods—whether region-based or sparse voxelization-based—enhancing their localization accuracy in challenging conditions without requiring structural modifications or retraining. Further experiments on heterogeneous LPR and long-term environmental variations validate RTR-Net’s robustness, achieving leading performance across sensor types and temporal shifts. The proposed regional sampling method effectively alleviates FoV disparities, demonstrating broad applicability within current LPR frameworks.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"228 ","pages":"Pages 613-629"},"PeriodicalIF":12.2000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing LiDAR place recognition in challenging environments using Retrieval-Trigger-Reranking paradigm\",\"authors\":\"Zhenghua Zhang ,&nbsp;Zhihua Xu ,&nbsp;Hu Liu ,&nbsp;Xuan Wang ,&nbsp;Qipeng Li ,&nbsp;Xiaoxiang Cao ,&nbsp;Guoliang Chen\",\"doi\":\"10.1016/j.isprsjprs.2025.07.036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>LiDAR place recognition (LPR) plays a critical role in simultaneous localization and mapping (SLAM) and autonomous driving systems. However, the common separation between offline database construction and online localization introduces challenges such as rotational variance, sensor discrepancies, and long-term environmental changes. Existing methods relying on fixed-length global descriptors often struggle in such scenarios due to their inherently limited capacity to encode comprehensive environmental information. To address these challenges, we propose RTR-Net, a novel framework based on Retrieval-Trigger-Reranking paradigm, to enhance LPR performance in challenging environments. The framework operates in three phases: (1) Retrieval, where a lightweight backbone generates global descriptors and local regional features for initial candidate selection; (2) Trigger, a training-free module that assesses spatial consistency between query and candidates to activate reranking only when necessary; and (3) Reranking, which refines rankings by fusing local features, global descriptors, and spatial consistency scores via spatial and channel attention mechanisms. Additionally, a regional sampling method is proposed to mitigate field-of-view (FoV) discrepancies across heterogeneous LiDAR sensors. Comprehensive evaluations on four large-scale datasets (Oxford RobotCar, NUS Inhouse, HeLiPR, MulRan) demonstrate that RTR-Net not only achieves state-of-the-art results but also stands out as a versatile, plug-and-play module. It is compatible with existing LPR methods—whether region-based or sparse voxelization-based—enhancing their localization accuracy in challenging conditions without requiring structural modifications or retraining. Further experiments on heterogeneous LPR and long-term environmental variations validate RTR-Net’s robustness, achieving leading performance across sensor types and temporal shifts. The proposed regional sampling method effectively alleviates FoV disparities, demonstrating broad applicability within current LPR frameworks.</div></div>\",\"PeriodicalId\":50269,\"journal\":{\"name\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"volume\":\"228 \",\"pages\":\"Pages 613-629\"},\"PeriodicalIF\":12.2000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0924271625003065\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003065","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

激光雷达位置识别(LPR)在同步定位和地图绘制(SLAM)和自动驾驶系统中起着至关重要的作用。然而,离线数据库构建和在线本地化之间的常见分离引入了诸如旋转方差、传感器差异和长期环境变化等挑战。现有的方法依赖于固定长度的全局描述符,由于其编码综合环境信息的能力固有的局限性,常常在这种情况下挣扎。为了应对这些挑战,我们提出了RTR-Net,一个基于检索-触发-重新排序范式的新框架,以提高LPR在具有挑战性的环境中的性能。该框架分为三个阶段:(1)检索,其中轻量级主干生成全局描述符和局部区域特征,用于初始候选选择;(2)触发器,一个无需培训的模块,评估查询和候选之间的空间一致性,仅在必要时激活重新排序;(3)重新排序,通过空间和通道注意机制融合局部特征、全局描述符和空间一致性评分来改进排名。此外,提出了一种区域采样方法来缓解异构激光雷达传感器之间的视场差异。对四个大型数据集(Oxford RobotCar, NUS Inhouse, HeLiPR, MulRan)的综合评估表明,RTR-Net不仅达到了最先进的结果,而且作为一个通用的即插即用模块脱颖而出。它与现有的LPR方法兼容——无论是基于区域的还是基于稀疏体化的——在具有挑战性的条件下提高定位精度,而不需要结构修改或再培训。对异构LPR和长期环境变化的进一步实验验证了RTR-Net的鲁棒性,在传感器类型和时间变化方面取得了领先的性能。提出的区域采样方法有效地缓解了视场差异,在现有的LPR框架中具有广泛的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Enhancing LiDAR place recognition in challenging environments using Retrieval-Trigger-Reranking paradigm
LiDAR place recognition (LPR) plays a critical role in simultaneous localization and mapping (SLAM) and autonomous driving systems. However, the common separation between offline database construction and online localization introduces challenges such as rotational variance, sensor discrepancies, and long-term environmental changes. Existing methods relying on fixed-length global descriptors often struggle in such scenarios due to their inherently limited capacity to encode comprehensive environmental information. To address these challenges, we propose RTR-Net, a novel framework based on Retrieval-Trigger-Reranking paradigm, to enhance LPR performance in challenging environments. The framework operates in three phases: (1) Retrieval, where a lightweight backbone generates global descriptors and local regional features for initial candidate selection; (2) Trigger, a training-free module that assesses spatial consistency between query and candidates to activate reranking only when necessary; and (3) Reranking, which refines rankings by fusing local features, global descriptors, and spatial consistency scores via spatial and channel attention mechanisms. Additionally, a regional sampling method is proposed to mitigate field-of-view (FoV) discrepancies across heterogeneous LiDAR sensors. Comprehensive evaluations on four large-scale datasets (Oxford RobotCar, NUS Inhouse, HeLiPR, MulRan) demonstrate that RTR-Net not only achieves state-of-the-art results but also stands out as a versatile, plug-and-play module. It is compatible with existing LPR methods—whether region-based or sparse voxelization-based—enhancing their localization accuracy in challenging conditions without requiring structural modifications or retraining. Further experiments on heterogeneous LPR and long-term environmental variations validate RTR-Net’s robustness, achieving leading performance across sensor types and temporal shifts. The proposed regional sampling method effectively alleviates FoV disparities, demonstrating broad applicability within current LPR frameworks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ISPRS Journal of Photogrammetry and Remote Sensing
ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术
CiteScore
21.00
自引率
6.30%
发文量
273
审稿时长
40 days
期刊介绍: The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信