{"title":"用于视觉位置识别的Patch-to-Region图搜索","authors":"Weiliang Zuo, Liguo Liu, Yizhe Li, Yanqing Shen, Fuhua Xiang, Jingmin Xin, Nanning Zheng","doi":"10.1016/j.patcog.2025.111673","DOIUrl":null,"url":null,"abstract":"<div><div>Visual Place Recognition (VPR) is a task to estimate the target location based on visual information in changing scenarios, which usually uses a two-stage strategy of global retrieval and reranking. Existing reranking methods in VPR establish a single correspondence between the query image and the candidate images for reranking, which almost overlooks the neighbor correspondences in retrieved candidate images that can help to enhance reranking. In this paper, we propose a <strong>P</strong>atch-to-<strong>R</strong>egion <strong>G</strong>raph <strong>S</strong>earch (PRGS) method to enhance reranking using neighbor correspondences in candidate images. Firstly, considering that searching for neighbor correspondences relies on important features, we design a <strong>P</strong>atch-to-<strong>R</strong>egion (PR) module, which aggregates patch level features into region level features for highlighting important features. Secondly, to estimate the candidate image reranking score using the neighbor correspondences, we design a <strong>G</strong>raph <strong>S</strong>earch (GS) module, which establishes the neighbor correspondences among all candidates and query images in graph space. What is more, PRGS integrates well with both CNN and transformer backbone. We achieve competitive performance on several benchmarks, offering a 64% improvement in matching time and approximately 59% reduction in FLOPs compared to state-of-the-art methods. The code is released at <span><span>https://github.com/LKELN/PRGS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"166 ","pages":"Article 111673"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PRGS: Patch-to-Region Graph Search for Visual Place Recognition\",\"authors\":\"Weiliang Zuo, Liguo Liu, Yizhe Li, Yanqing Shen, Fuhua Xiang, Jingmin Xin, Nanning Zheng\",\"doi\":\"10.1016/j.patcog.2025.111673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Visual Place Recognition (VPR) is a task to estimate the target location based on visual information in changing scenarios, which usually uses a two-stage strategy of global retrieval and reranking. Existing reranking methods in VPR establish a single correspondence between the query image and the candidate images for reranking, which almost overlooks the neighbor correspondences in retrieved candidate images that can help to enhance reranking. In this paper, we propose a <strong>P</strong>atch-to-<strong>R</strong>egion <strong>G</strong>raph <strong>S</strong>earch (PRGS) method to enhance reranking using neighbor correspondences in candidate images. Firstly, considering that searching for neighbor correspondences relies on important features, we design a <strong>P</strong>atch-to-<strong>R</strong>egion (PR) module, which aggregates patch level features into region level features for highlighting important features. Secondly, to estimate the candidate image reranking score using the neighbor correspondences, we design a <strong>G</strong>raph <strong>S</strong>earch (GS) module, which establishes the neighbor correspondences among all candidates and query images in graph space. What is more, PRGS integrates well with both CNN and transformer backbone. We achieve competitive performance on several benchmarks, offering a 64% improvement in matching time and approximately 59% reduction in FLOPs compared to state-of-the-art methods. The code is released at <span><span>https://github.com/LKELN/PRGS</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"166 \",\"pages\":\"Article 111673\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325003334\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003334","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
PRGS: Patch-to-Region Graph Search for Visual Place Recognition
Visual Place Recognition (VPR) is a task to estimate the target location based on visual information in changing scenarios, which usually uses a two-stage strategy of global retrieval and reranking. Existing reranking methods in VPR establish a single correspondence between the query image and the candidate images for reranking, which almost overlooks the neighbor correspondences in retrieved candidate images that can help to enhance reranking. In this paper, we propose a Patch-to-Region Graph Search (PRGS) method to enhance reranking using neighbor correspondences in candidate images. Firstly, considering that searching for neighbor correspondences relies on important features, we design a Patch-to-Region (PR) module, which aggregates patch level features into region level features for highlighting important features. Secondly, to estimate the candidate image reranking score using the neighbor correspondences, we design a Graph Search (GS) module, which establishes the neighbor correspondences among all candidates and query images in graph space. What is more, PRGS integrates well with both CNN and transformer backbone. We achieve competitive performance on several benchmarks, offering a 64% improvement in matching time and approximately 59% reduction in FLOPs compared to state-of-the-art methods. The code is released at https://github.com/LKELN/PRGS.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.