A Novel UAV Aerial Vehicle Detection Method Based on Attention Mechanism and Multi-scale Feature Cross Fusion

Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering Pub Date : 2021-01-15 DOI:10.1145/3460268.3460276

Zhigang Hou, Jin Yan, Bo Yang, Zhiming Ding

{"title":"A Novel UAV Aerial Vehicle Detection Method Based on Attention Mechanism and Multi-scale Feature Cross Fusion","authors":"Zhigang Hou, Jin Yan, Bo Yang, Zhiming Ding","doi":"10.1145/3460268.3460276","DOIUrl":null,"url":null,"abstract":"With the rapid development of artificial intelligence science, more and more researchers try to use deep learning to train neural networks and have achieved great success in object detection. Vehicle detection based on UAV image is a special field of object detection. Due to the low resolution of the vehicle object, complex background, and less image information, it is challenging to extract robust visual and spatial features from the depth network and accurately locate the object in complex scenes. In this paper, combining the characteristics of vehicles in aerial images, we design a novel feature pyramid network called channel-spatial attention fused feature pyramid network (CSF-FPN) with Faster R-CNN as the basic framework. In CSF-FPN, a hybrid attention mechanism and feature cross-fusion module are introduced, so that feature maps can be generated with enhanced spatial and channel interdependence to extract richer semantic information. After our CSF-FPN is integrated into the Faster R-CNN network, the detection performance of small objects is greatly improved. The experimental results based on the VEDIA Dataset showed that the proposed framework could effectively detect the vehicle in large scene azimuth. Compared with the existing advanced methods, mAP and F1-score are improved.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3460268.3460276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

With the rapid development of artificial intelligence science, more and more researchers try to use deep learning to train neural networks and have achieved great success in object detection. Vehicle detection based on UAV image is a special field of object detection. Due to the low resolution of the vehicle object, complex background, and less image information, it is challenging to extract robust visual and spatial features from the depth network and accurately locate the object in complex scenes. In this paper, combining the characteristics of vehicles in aerial images, we design a novel feature pyramid network called channel-spatial attention fused feature pyramid network (CSF-FPN) with Faster R-CNN as the basic framework. In CSF-FPN, a hybrid attention mechanism and feature cross-fusion module are introduced, so that feature maps can be generated with enhanced spatial and channel interdependence to extract richer semantic information. After our CSF-FPN is integrated into the Faster R-CNN network, the detection performance of small objects is greatly improved. The experimental results based on the VEDIA Dataset showed that the proposed framework could effectively detect the vehicle in large scene azimuth. Compared with the existing advanced methods, mAP and F1-score are improved.

查看原文本刊更多论文

一种基于注意机制和多尺度特征交叉融合的无人机飞行器检测方法

随着人工智能科学的快速发展，越来越多的研究者尝试使用深度学习来训练神经网络，并在目标检测方面取得了巨大的成功。基于无人机图像的车辆检测是目标检测的一个特殊领域。由于车辆目标分辨率低，背景复杂，图像信息少，从深度网络中提取鲁棒的视觉和空间特征并准确定位复杂场景中的目标是一项挑战。本文结合航空图像中车辆的特点，以Faster R-CNN为基本框架，设计了一种新型的通道-空间注意力融合特征金字塔网络(channel-spatial attention fused feature pyramid network, CSF-FPN)。在CSF-FPN中，引入了混合注意机制和特征交叉融合模块，增强了空间依赖性和通道依赖性，从而生成了更丰富的语义信息。我们的CSF-FPN集成到Faster R-CNN网络后，对小物体的检测性能大大提高。基于VEDIA数据集的实验结果表明，该框架可以有效地检测大场景方位角下的车辆。与现有的先进方法相比，mAP和f1评分得到了提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering

自引率

0.00%

发文量