Zongbao Liang, Xing Liu, Bo Chen, YunFei Yuan, Yang Song, Haifei Jiang
{"title":"UAV-based cross-view geo-localization fusion spatial attention mechanism and Netvlad","authors":"Zongbao Liang, Xing Liu, Bo Chen, YunFei Yuan, Yang Song, Haifei Jiang","doi":"10.1109/icsai53574.2021.9664015","DOIUrl":null,"url":null,"abstract":"The purpose of cross-view image geo-localization is to retrieve the same geographic target from images with different views acquired from different platforms. Facing the great differences in the appearance of images from different viewpoints, we propose an image retrieval method (spatial-Netvlad-siamese net, SNSnet) that fuses the trainable local aggregation descriptor vector (Netvlad) and the spatial attention mechanism. SNSnet can simultaneously process two different viewpoint images, and achieve image retrieval and matching by increasing the distance between mismatched image pairs and decreasing the distance between matched image pairs. We conducted experiments on mutual retrieval between satellite viewpoint images and UAV viewpoint images and achieved good results.","PeriodicalId":131284,"journal":{"name":"2021 7th International Conference on Systems and Informatics (ICSAI)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icsai53574.2021.9664015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The purpose of cross-view image geo-localization is to retrieve the same geographic target from images with different views acquired from different platforms. Facing the great differences in the appearance of images from different viewpoints, we propose an image retrieval method (spatial-Netvlad-siamese net, SNSnet) that fuses the trainable local aggregation descriptor vector (Netvlad) and the spatial attention mechanism. SNSnet can simultaneously process two different viewpoint images, and achieve image retrieval and matching by increasing the distance between mismatched image pairs and decreasing the distance between matched image pairs. We conducted experiments on mutual retrieval between satellite viewpoint images and UAV viewpoint images and achieved good results.