Jiwei Nie;Dingyu Xue;Feng Pan;Shuai Cheng;Wei Liu;Jun Hu;Zuotao Ning
{"title":"MIXVPR++:利用层次区域特征混合器和自适应 Gabor 纹理融合器增强视觉场所识别能力","authors":"Jiwei Nie;Dingyu Xue;Feng Pan;Shuai Cheng;Wei Liu;Jun Hu;Zuotao Ning","doi":"10.1109/LRA.2024.3511416","DOIUrl":null,"url":null,"abstract":"Visual Place Recognition (VPR) is crucial for various computer vision and robotics applications. Traditional VPR techniques relying on handcrafted features, have been enhanced by using Convolutional Neural Networks (CNNs). Recently, MixVPR has set new benchmarks in VPR by using advanced feature aggregation techniques. However, MixVPR's full-image feature mixing approach can lead to the ignoring of critical local detail information and regional saliency information in large-scale images. To overcome this, we propose MIXVPR++, which integrates an Adaptive Gabor Texture Fuser with a Learnable Gabor Filter for enriching semantic context with texture details information and a Hierarchical-Region Feature-Mixer for better spatial hierarchy capture regional saliency information, thereby enhancing robustness and accuracy. Extensive experiments demonstrate that MIXVPR++ outperforms state-of-the-art methods across most challenging benchmarks. Despite its impressive performance, MIXVPR++ shows limitations in handling severe viewpoint changes, indicating an area for future improvement.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 1","pages":"580-587"},"PeriodicalIF":4.6000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MIXVPR++: Enhanced Visual Place Recognition With Hierarchical-Region Feature-Mixer and Adaptive Gabor Texture Fuser\",\"authors\":\"Jiwei Nie;Dingyu Xue;Feng Pan;Shuai Cheng;Wei Liu;Jun Hu;Zuotao Ning\",\"doi\":\"10.1109/LRA.2024.3511416\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual Place Recognition (VPR) is crucial for various computer vision and robotics applications. Traditional VPR techniques relying on handcrafted features, have been enhanced by using Convolutional Neural Networks (CNNs). Recently, MixVPR has set new benchmarks in VPR by using advanced feature aggregation techniques. However, MixVPR's full-image feature mixing approach can lead to the ignoring of critical local detail information and regional saliency information in large-scale images. To overcome this, we propose MIXVPR++, which integrates an Adaptive Gabor Texture Fuser with a Learnable Gabor Filter for enriching semantic context with texture details information and a Hierarchical-Region Feature-Mixer for better spatial hierarchy capture regional saliency information, thereby enhancing robustness and accuracy. Extensive experiments demonstrate that MIXVPR++ outperforms state-of-the-art methods across most challenging benchmarks. Despite its impressive performance, MIXVPR++ shows limitations in handling severe viewpoint changes, indicating an area for future improvement.\",\"PeriodicalId\":13241,\"journal\":{\"name\":\"IEEE Robotics and Automation Letters\",\"volume\":\"10 1\",\"pages\":\"580-587\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Robotics and Automation Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10777538/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10777538/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
MIXVPR++: Enhanced Visual Place Recognition With Hierarchical-Region Feature-Mixer and Adaptive Gabor Texture Fuser
Visual Place Recognition (VPR) is crucial for various computer vision and robotics applications. Traditional VPR techniques relying on handcrafted features, have been enhanced by using Convolutional Neural Networks (CNNs). Recently, MixVPR has set new benchmarks in VPR by using advanced feature aggregation techniques. However, MixVPR's full-image feature mixing approach can lead to the ignoring of critical local detail information and regional saliency information in large-scale images. To overcome this, we propose MIXVPR++, which integrates an Adaptive Gabor Texture Fuser with a Learnable Gabor Filter for enriching semantic context with texture details information and a Hierarchical-Region Feature-Mixer for better spatial hierarchy capture regional saliency information, thereby enhancing robustness and accuracy. Extensive experiments demonstrate that MIXVPR++ outperforms state-of-the-art methods across most challenging benchmarks. Despite its impressive performance, MIXVPR++ shows limitations in handling severe viewpoint changes, indicating an area for future improvement.
期刊介绍:
The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.