A Novel UAV Aerial Vehicle Detection Method Based on Attention Mechanism and Multi-scale Feature Cross Fusion

Zhigang Hou, Jin Yan, Bo Yang, Zhiming Ding
{"title":"A Novel UAV Aerial Vehicle Detection Method Based on Attention Mechanism and Multi-scale Feature Cross Fusion","authors":"Zhigang Hou, Jin Yan, Bo Yang, Zhiming Ding","doi":"10.1145/3460268.3460276","DOIUrl":null,"url":null,"abstract":"With the rapid development of artificial intelligence science, more and more researchers try to use deep learning to train neural networks and have achieved great success in object detection. Vehicle detection based on UAV image is a special field of object detection. Due to the low resolution of the vehicle object, complex background, and less image information, it is challenging to extract robust visual and spatial features from the depth network and accurately locate the object in complex scenes. In this paper, combining the characteristics of vehicles in aerial images, we design a novel feature pyramid network called channel-spatial attention fused feature pyramid network (CSF-FPN) with Faster R-CNN as the basic framework. In CSF-FPN, a hybrid attention mechanism and feature cross-fusion module are introduced, so that feature maps can be generated with enhanced spatial and channel interdependence to extract richer semantic information. After our CSF-FPN is integrated into the Faster R-CNN network, the detection performance of small objects is greatly improved. The experimental results based on the VEDIA Dataset showed that the proposed framework could effectively detect the vehicle in large scene azimuth. Compared with the existing advanced methods, mAP and F1-score are improved.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3460268.3460276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

With the rapid development of artificial intelligence science, more and more researchers try to use deep learning to train neural networks and have achieved great success in object detection. Vehicle detection based on UAV image is a special field of object detection. Due to the low resolution of the vehicle object, complex background, and less image information, it is challenging to extract robust visual and spatial features from the depth network and accurately locate the object in complex scenes. In this paper, combining the characteristics of vehicles in aerial images, we design a novel feature pyramid network called channel-spatial attention fused feature pyramid network (CSF-FPN) with Faster R-CNN as the basic framework. In CSF-FPN, a hybrid attention mechanism and feature cross-fusion module are introduced, so that feature maps can be generated with enhanced spatial and channel interdependence to extract richer semantic information. After our CSF-FPN is integrated into the Faster R-CNN network, the detection performance of small objects is greatly improved. The experimental results based on the VEDIA Dataset showed that the proposed framework could effectively detect the vehicle in large scene azimuth. Compared with the existing advanced methods, mAP and F1-score are improved.
一种基于注意机制和多尺度特征交叉融合的无人机飞行器检测方法
随着人工智能科学的快速发展,越来越多的研究者尝试使用深度学习来训练神经网络,并在目标检测方面取得了巨大的成功。基于无人机图像的车辆检测是目标检测的一个特殊领域。由于车辆目标分辨率低,背景复杂,图像信息少,从深度网络中提取鲁棒的视觉和空间特征并准确定位复杂场景中的目标是一项挑战。本文结合航空图像中车辆的特点,以Faster R-CNN为基本框架,设计了一种新型的通道-空间注意力融合特征金字塔网络(channel-spatial attention fused feature pyramid network, CSF-FPN)。在CSF-FPN中,引入了混合注意机制和特征交叉融合模块,增强了空间依赖性和通道依赖性,从而生成了更丰富的语义信息。我们的CSF-FPN集成到Faster R-CNN网络后,对小物体的检测性能大大提高。基于VEDIA数据集的实验结果表明,该框架可以有效地检测大场景方位角下的车辆。与现有的先进方法相比,mAP和f1评分得到了提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信