SOD-YOLO: Small Object Detection Network for UAV Aerial Images

IF 1 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC
Zhiqian He, Lijie Cao
{"title":"SOD-YOLO: Small Object Detection Network for UAV Aerial Images","authors":"Zhiqian He,&nbsp;Lijie Cao","doi":"10.1002/tee.24195","DOIUrl":null,"url":null,"abstract":"<p>With the rapid development of the UAV industry, object detection using UAV has become a research hotspot. However, most current object detection models based on deep learning have large parameter counts and are difficult to deploy on embedded devices with limited memory and computational power. To address this problem, a Small Object Detection network for UAV aerial images SOD-YOLO based on YOLOv8 is proposed, which can meet the application requirements of resource-constrained devices while ensuring the detection accuracy. First, cross-domain fusion attention (CDFA) mechanism is proposed to build the C2f-Attention module in this paper, which is embedded in the backbone network in order to improve the extraction capability of key object features. Meanwhile, the AIFI_LSPE feature fusion module with improved RT-DETR and the IoU-aware query selection mechanism are added to the path aggregation network to improve the accuracy of multi-scale object detection. In addition, in order to balance the sample size ratio and improve the robustness of the network model, we make a new UAV image dataset named VisDrone2019 Extended Edition (VDEE) using images from the VisDrone2019 and UAVDT public datasets. Finally, Shape-IoU is used as a loss function to reduce the difference between the object GT frame and the detection frame. Experiments show that SOD-YOLO has a [email protected] of 42.8% in the VDEE dataset, which is increased by 5.1% over YOLOv8. In the VisDrone2019 dataset [email protected] is 39.2%, an improvement of 5.8% over YOLOv8. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>","PeriodicalId":13435,"journal":{"name":"IEEJ Transactions on Electrical and Electronic Engineering","volume":"20 3","pages":"431-439"},"PeriodicalIF":1.0000,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEJ Transactions on Electrical and Electronic Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/tee.24195","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

With the rapid development of the UAV industry, object detection using UAV has become a research hotspot. However, most current object detection models based on deep learning have large parameter counts and are difficult to deploy on embedded devices with limited memory and computational power. To address this problem, a Small Object Detection network for UAV aerial images SOD-YOLO based on YOLOv8 is proposed, which can meet the application requirements of resource-constrained devices while ensuring the detection accuracy. First, cross-domain fusion attention (CDFA) mechanism is proposed to build the C2f-Attention module in this paper, which is embedded in the backbone network in order to improve the extraction capability of key object features. Meanwhile, the AIFI_LSPE feature fusion module with improved RT-DETR and the IoU-aware query selection mechanism are added to the path aggregation network to improve the accuracy of multi-scale object detection. In addition, in order to balance the sample size ratio and improve the robustness of the network model, we make a new UAV image dataset named VisDrone2019 Extended Edition (VDEE) using images from the VisDrone2019 and UAVDT public datasets. Finally, Shape-IoU is used as a loss function to reduce the difference between the object GT frame and the detection frame. Experiments show that SOD-YOLO has a [email protected] of 42.8% in the VDEE dataset, which is increased by 5.1% over YOLOv8. In the VisDrone2019 dataset [email protected] is 39.2%, an improvement of 5.8% over YOLOv8. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.

求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEJ Transactions on Electrical and Electronic Engineering
IEEJ Transactions on Electrical and Electronic Engineering 工程技术-工程:电子与电气
CiteScore
2.70
自引率
10.00%
发文量
199
审稿时长
4.3 months
期刊介绍: IEEJ Transactions on Electrical and Electronic Engineering (hereinafter called TEEE ) publishes 6 times per year as an official journal of the Institute of Electrical Engineers of Japan (hereinafter "IEEJ"). This peer-reviewed journal contains original research papers and review articles on the most important and latest technological advances in core areas of Electrical and Electronic Engineering and in related disciplines. The journal also publishes short communications reporting on the results of the latest research activities TEEE ) aims to provide a new forum for IEEJ members in Japan as well as fellow researchers in Electrical and Electronic Engineering from around the world to exchange ideas and research findings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信