基于测地线的多尺度地空地理定位图像匹配网络

A. A. Rasna, C. Mohan
{"title":"基于测地线的多尺度地空地理定位图像匹配网络","authors":"A. A. Rasna, C. Mohan","doi":"10.1109/AERO55745.2023.10115935","DOIUrl":null,"url":null,"abstract":"Airport surveillance activities using remote sensing images are challenging due to object variations largely affecting the geo-localization and object detection/segmentation tasks. Furthermore, the problem of localization is even larger due to scale variations. Traditionally image-based geo-referencing is accomplished by superimposing ground positioning system (GPS) location to the queried image. It is also observed both the query and the geo-tagged reference images are taken from the same ground view or aerial height in the case of remote sensing images. In our research, we intend to revisit the scale effect on object variability, by introducing the concept of geodesic representations along with image-matching networks. The architecture pipeline introduces a data processing layer wherein objects are geo-referenced to generate the metadata information. This metadata consists of three-dimensional data including the orientation information of the object. A regression task is added to the training set which leverages the metadata information. We use the gradient weighted class activation maps (Grad-CAM) to generate the activation maps and selection based on high threshold values for the pixel. The orientations and the locations are further calculated using the geodesic representations. The baseline architecture for local feature extraction uses a simple Siamese network with a ResNet backbone network. A NetVLAD layer is used to generate the global features. We also introduce a Geospatial attention network (GsAN) to aid in enhanced localization of objects. The dataset used for experiments consisted of CVUSA and our custom dataset providing airport runway views for different scales and arbitrary orientations. The performance evaluations focused on recall as a retrieval metric and comparing various loss functions. The performance metrics indicate a higher accuracy rate.","PeriodicalId":344285,"journal":{"name":"2023 IEEE Aerospace Conference","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Geodesic Based Image Matching Network for the Multi-scale Ground to Aerial Geo-localization\",\"authors\":\"A. A. Rasna, C. Mohan\",\"doi\":\"10.1109/AERO55745.2023.10115935\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Airport surveillance activities using remote sensing images are challenging due to object variations largely affecting the geo-localization and object detection/segmentation tasks. Furthermore, the problem of localization is even larger due to scale variations. Traditionally image-based geo-referencing is accomplished by superimposing ground positioning system (GPS) location to the queried image. It is also observed both the query and the geo-tagged reference images are taken from the same ground view or aerial height in the case of remote sensing images. In our research, we intend to revisit the scale effect on object variability, by introducing the concept of geodesic representations along with image-matching networks. The architecture pipeline introduces a data processing layer wherein objects are geo-referenced to generate the metadata information. This metadata consists of three-dimensional data including the orientation information of the object. A regression task is added to the training set which leverages the metadata information. We use the gradient weighted class activation maps (Grad-CAM) to generate the activation maps and selection based on high threshold values for the pixel. The orientations and the locations are further calculated using the geodesic representations. The baseline architecture for local feature extraction uses a simple Siamese network with a ResNet backbone network. A NetVLAD layer is used to generate the global features. We also introduce a Geospatial attention network (GsAN) to aid in enhanced localization of objects. The dataset used for experiments consisted of CVUSA and our custom dataset providing airport runway views for different scales and arbitrary orientations. The performance evaluations focused on recall as a retrieval metric and comparing various loss functions. The performance metrics indicate a higher accuracy rate.\",\"PeriodicalId\":344285,\"journal\":{\"name\":\"2023 IEEE Aerospace Conference\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Aerospace Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AERO55745.2023.10115935\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Aerospace Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AERO55745.2023.10115935","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

使用遥感图像的机场监视活动具有挑战性,因为物体的变化在很大程度上影响了地理定位和物体检测/分割任务。此外,由于规模的变化,本地化问题甚至更大。传统的基于图像的地理参考是通过将地面定位系统(GPS)的位置叠加到查询的图像中来完成的。还观察到,查询和地理标记参考图像在遥感图像的情况下都是从相同的地面视图或空中高度拍摄的。在我们的研究中,我们打算通过引入测地线表示和图像匹配网络的概念来重新审视物体可变性的尺度效应。体系结构管道引入了一个数据处理层,其中对象被地理引用以生成元数据信息。该元数据由三维数据组成,包括对象的方向信息。将回归任务添加到利用元数据信息的训练集中。我们使用梯度加权类激活图(Grad-CAM)来生成基于高阈值像素的激活图和选择。使用测地线表示进一步计算方向和位置。本地特征提取的基线架构使用一个简单的带有ResNet骨干网的Siamese网络。NetVLAD层用于生成全局特征。我们还引入了一个地理空间注意力网络(GsAN)来帮助增强物体的定位。用于实验的数据集包括CVUSA和我们的自定义数据集,提供不同规模和任意方向的机场跑道视图。性能评估侧重于召回作为检索度量和比较各种损失函数。性能指标表明准确率更高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Geodesic Based Image Matching Network for the Multi-scale Ground to Aerial Geo-localization
Airport surveillance activities using remote sensing images are challenging due to object variations largely affecting the geo-localization and object detection/segmentation tasks. Furthermore, the problem of localization is even larger due to scale variations. Traditionally image-based geo-referencing is accomplished by superimposing ground positioning system (GPS) location to the queried image. It is also observed both the query and the geo-tagged reference images are taken from the same ground view or aerial height in the case of remote sensing images. In our research, we intend to revisit the scale effect on object variability, by introducing the concept of geodesic representations along with image-matching networks. The architecture pipeline introduces a data processing layer wherein objects are geo-referenced to generate the metadata information. This metadata consists of three-dimensional data including the orientation information of the object. A regression task is added to the training set which leverages the metadata information. We use the gradient weighted class activation maps (Grad-CAM) to generate the activation maps and selection based on high threshold values for the pixel. The orientations and the locations are further calculated using the geodesic representations. The baseline architecture for local feature extraction uses a simple Siamese network with a ResNet backbone network. A NetVLAD layer is used to generate the global features. We also introduce a Geospatial attention network (GsAN) to aid in enhanced localization of objects. The dataset used for experiments consisted of CVUSA and our custom dataset providing airport runway views for different scales and arbitrary orientations. The performance evaluations focused on recall as a retrieval metric and comparing various loss functions. The performance metrics indicate a higher accuracy rate.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信