Zongbao Liang, Xing Liu, Bo Chen, YunFei Yuan, Yang Song, Haifei Jiang
{"title":"基于无人机的交叉视角地理定位融合空间注意机制与Netvlad","authors":"Zongbao Liang, Xing Liu, Bo Chen, YunFei Yuan, Yang Song, Haifei Jiang","doi":"10.1109/icsai53574.2021.9664015","DOIUrl":null,"url":null,"abstract":"The purpose of cross-view image geo-localization is to retrieve the same geographic target from images with different views acquired from different platforms. Facing the great differences in the appearance of images from different viewpoints, we propose an image retrieval method (spatial-Netvlad-siamese net, SNSnet) that fuses the trainable local aggregation descriptor vector (Netvlad) and the spatial attention mechanism. SNSnet can simultaneously process two different viewpoint images, and achieve image retrieval and matching by increasing the distance between mismatched image pairs and decreasing the distance between matched image pairs. We conducted experiments on mutual retrieval between satellite viewpoint images and UAV viewpoint images and achieved good results.","PeriodicalId":131284,"journal":{"name":"2021 7th International Conference on Systems and Informatics (ICSAI)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UAV-based cross-view geo-localization fusion spatial attention mechanism and Netvlad\",\"authors\":\"Zongbao Liang, Xing Liu, Bo Chen, YunFei Yuan, Yang Song, Haifei Jiang\",\"doi\":\"10.1109/icsai53574.2021.9664015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of cross-view image geo-localization is to retrieve the same geographic target from images with different views acquired from different platforms. Facing the great differences in the appearance of images from different viewpoints, we propose an image retrieval method (spatial-Netvlad-siamese net, SNSnet) that fuses the trainable local aggregation descriptor vector (Netvlad) and the spatial attention mechanism. SNSnet can simultaneously process two different viewpoint images, and achieve image retrieval and matching by increasing the distance between mismatched image pairs and decreasing the distance between matched image pairs. We conducted experiments on mutual retrieval between satellite viewpoint images and UAV viewpoint images and achieved good results.\",\"PeriodicalId\":131284,\"journal\":{\"name\":\"2021 7th International Conference on Systems and Informatics (ICSAI)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th International Conference on Systems and Informatics (ICSAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icsai53574.2021.9664015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icsai53574.2021.9664015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
UAV-based cross-view geo-localization fusion spatial attention mechanism and Netvlad
The purpose of cross-view image geo-localization is to retrieve the same geographic target from images with different views acquired from different platforms. Facing the great differences in the appearance of images from different viewpoints, we propose an image retrieval method (spatial-Netvlad-siamese net, SNSnet) that fuses the trainable local aggregation descriptor vector (Netvlad) and the spatial attention mechanism. SNSnet can simultaneously process two different viewpoint images, and achieve image retrieval and matching by increasing the distance between mismatched image pairs and decreasing the distance between matched image pairs. We conducted experiments on mutual retrieval between satellite viewpoint images and UAV viewpoint images and achieved good results.