基于多尺度注意力自适应网络的遥感图像目标检测

Qixi Tan, W. Xie, Haojin Tang, Yanshan Li
{"title":"基于多尺度注意力自适应网络的遥感图像目标检测","authors":"Qixi Tan, W. Xie, Haojin Tang, Yanshan Li","doi":"10.1109/ICICSP55539.2022.10050627","DOIUrl":null,"url":null,"abstract":"Remote sensing images (RSI) have a large range of variations in the aspect of inter- and intra-class size variability across objects. As a key technology in the field of RSI processing, RSI object detection has been widely applied. Multilevel features fusion network is commonly used to improve the performance of object detection. However, the existing multilevel feature fusion networks for RSI lack the ability to combine global information. Aiming at this problem, A multi-scale attention adaptive network (MA2Net) is proposed to object detection in RSI. The main contributions of this paper are twofold. Firstly, a multi-scale attention adaptive network is designed to adaptively integrate the multilevel features. This network is composed of integrating (IG) block, channel self-attention (CS) block, and adaptive fusion (AF) block. Specifically, IG is designed to transform the multi-level features into an intermediate size. The CS block is an embedded gaussian self-attention module used to model the relationship between the feature channels. AF is developed to learn the multilevel expression of self-attention features to obtain multi-scale feature maps. Secondly, to achieve a balance between multi-task and higher accuracy, a feature align head is utilized to correctly locate and classify objects. The experimental results on DIOR show that our network can achieve higher detection accuracy than the state-of-the-art RSI object detector.","PeriodicalId":281095,"journal":{"name":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-scale Attention Adaptive Network for Object Detection in Remote Sensing Images\",\"authors\":\"Qixi Tan, W. Xie, Haojin Tang, Yanshan Li\",\"doi\":\"10.1109/ICICSP55539.2022.10050627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Remote sensing images (RSI) have a large range of variations in the aspect of inter- and intra-class size variability across objects. As a key technology in the field of RSI processing, RSI object detection has been widely applied. Multilevel features fusion network is commonly used to improve the performance of object detection. However, the existing multilevel feature fusion networks for RSI lack the ability to combine global information. Aiming at this problem, A multi-scale attention adaptive network (MA2Net) is proposed to object detection in RSI. The main contributions of this paper are twofold. Firstly, a multi-scale attention adaptive network is designed to adaptively integrate the multilevel features. This network is composed of integrating (IG) block, channel self-attention (CS) block, and adaptive fusion (AF) block. Specifically, IG is designed to transform the multi-level features into an intermediate size. The CS block is an embedded gaussian self-attention module used to model the relationship between the feature channels. AF is developed to learn the multilevel expression of self-attention features to obtain multi-scale feature maps. Secondly, to achieve a balance between multi-task and higher accuracy, a feature align head is utilized to correctly locate and classify objects. The experimental results on DIOR show that our network can achieve higher detection accuracy than the state-of-the-art RSI object detector.\",\"PeriodicalId\":281095,\"journal\":{\"name\":\"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)\",\"volume\":\"84 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICSP55539.2022.10050627\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSP55539.2022.10050627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

遥感图像(RSI)在不同对象的类间和类内大小变异性方面具有很大的变化范围。作为RSI处理领域的一项关键技术,RSI目标检测得到了广泛的应用。多层次特征融合网络是提高目标检测性能的常用方法。然而,现有的RSI多层特征融合网络缺乏整合全局信息的能力。针对这一问题,提出了一种多尺度注意力自适应网络(MA2Net)用于RSI中的目标检测。本文的主要贡献有两个方面。首先,设计了一个多尺度注意力自适应网络,自适应整合多层次特征;该网络由集成(IG)块、信道自关注(CS)块和自适应融合(AF)块组成。具体来说,IG旨在将多层次特征转换为中等大小。CS块是一个嵌入式高斯自注意模块,用于对特征通道之间的关系进行建模。为了学习自注意特征的多层次表达,获得多尺度特征图,开发了自动识别。其次,为了在多任务和更高精度之间取得平衡,利用特征对齐头对目标进行正确定位和分类。DIOR上的实验结果表明,我们的网络可以达到比目前最先进的RSI目标检测器更高的检测精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-scale Attention Adaptive Network for Object Detection in Remote Sensing Images
Remote sensing images (RSI) have a large range of variations in the aspect of inter- and intra-class size variability across objects. As a key technology in the field of RSI processing, RSI object detection has been widely applied. Multilevel features fusion network is commonly used to improve the performance of object detection. However, the existing multilevel feature fusion networks for RSI lack the ability to combine global information. Aiming at this problem, A multi-scale attention adaptive network (MA2Net) is proposed to object detection in RSI. The main contributions of this paper are twofold. Firstly, a multi-scale attention adaptive network is designed to adaptively integrate the multilevel features. This network is composed of integrating (IG) block, channel self-attention (CS) block, and adaptive fusion (AF) block. Specifically, IG is designed to transform the multi-level features into an intermediate size. The CS block is an embedded gaussian self-attention module used to model the relationship between the feature channels. AF is developed to learn the multilevel expression of self-attention features to obtain multi-scale feature maps. Secondly, to achieve a balance between multi-task and higher accuracy, a feature align head is utilized to correctly locate and classify objects. The experimental results on DIOR show that our network can achieve higher detection accuracy than the state-of-the-art RSI object detector.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信