Deben Lu , Wendong Xiao , Teng Ran , Liang Yuan , Jingchuan Wang
{"title":"GND-APR:基于图神经扩散的自动驾驶绝对姿态回归器","authors":"Deben Lu , Wendong Xiao , Teng Ran , Liang Yuan , Jingchuan Wang","doi":"10.1016/j.patrec.2025.05.009","DOIUrl":null,"url":null,"abstract":"<div><div>Visual relocalization consists of estimating the camera translation and rotation in known scenarios. However, the lack of robust features occurs in pose regressors under self-driving environments containing perturbations (such as weather changes, seasons, illumination, and dynamic objects), leading to inaccurate poses. In this paper, we propose GND-APR to cope with the aforementioned issue via <strong>G</strong>raph dynamic attention <strong>N</strong>eural <strong>D</strong>iffusion and shared memory unit. Specifically, we propose graph dynamic attention neural diffusion for feature map and vector interaction, which graph dynamic attention facilitates better information interaction between feature correlations and enhances feature representation. Meanwhile, we enhance temporal feature fusion through a shared memory unit, it allows the network to record essential and robust features as an implicit defense against perturbations. Our model decreases the mean translation and rotation errors by 54% and 15%, respectively, on the 4Seasons dataset. Experiments on two challenging self-driving datasets demonstrate the superiority of our approach over other state-of-the-art methods.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"196 ","pages":"Pages 93-99"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GND-APR: Absolute pose regressor with graph neural diffusion for self-driving\",\"authors\":\"Deben Lu , Wendong Xiao , Teng Ran , Liang Yuan , Jingchuan Wang\",\"doi\":\"10.1016/j.patrec.2025.05.009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Visual relocalization consists of estimating the camera translation and rotation in known scenarios. However, the lack of robust features occurs in pose regressors under self-driving environments containing perturbations (such as weather changes, seasons, illumination, and dynamic objects), leading to inaccurate poses. In this paper, we propose GND-APR to cope with the aforementioned issue via <strong>G</strong>raph dynamic attention <strong>N</strong>eural <strong>D</strong>iffusion and shared memory unit. Specifically, we propose graph dynamic attention neural diffusion for feature map and vector interaction, which graph dynamic attention facilitates better information interaction between feature correlations and enhances feature representation. Meanwhile, we enhance temporal feature fusion through a shared memory unit, it allows the network to record essential and robust features as an implicit defense against perturbations. Our model decreases the mean translation and rotation errors by 54% and 15%, respectively, on the 4Seasons dataset. Experiments on two challenging self-driving datasets demonstrate the superiority of our approach over other state-of-the-art methods.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"196 \",\"pages\":\"Pages 93-99\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167865525001989\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525001989","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
GND-APR: Absolute pose regressor with graph neural diffusion for self-driving
Visual relocalization consists of estimating the camera translation and rotation in known scenarios. However, the lack of robust features occurs in pose regressors under self-driving environments containing perturbations (such as weather changes, seasons, illumination, and dynamic objects), leading to inaccurate poses. In this paper, we propose GND-APR to cope with the aforementioned issue via Graph dynamic attention Neural Diffusion and shared memory unit. Specifically, we propose graph dynamic attention neural diffusion for feature map and vector interaction, which graph dynamic attention facilitates better information interaction between feature correlations and enhances feature representation. Meanwhile, we enhance temporal feature fusion through a shared memory unit, it allows the network to record essential and robust features as an implicit defense against perturbations. Our model decreases the mean translation and rotation errors by 54% and 15%, respectively, on the 4Seasons dataset. Experiments on two challenging self-driving datasets demonstrate the superiority of our approach over other state-of-the-art methods.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.