Multi-branch stacking remote sensing image target detection based on YOLOv5

IF 3.7 3区 地球科学 Q2 ENVIRONMENTAL SCIENCES
Luxuan Bian, Bo Li, Jue Wang, Zijun Gao
{"title":"Multi-branch stacking remote sensing image target detection based on YOLOv5","authors":"Luxuan Bian,&nbsp;Bo Li,&nbsp;Jue Wang,&nbsp;Zijun Gao","doi":"10.1016/j.ejrs.2023.11.006","DOIUrl":null,"url":null,"abstract":"<div><p>Optical remote sensing is crucial in land management, maritime safety, and rescue operations. Currently, high resolution target detection faces the problems including feature loss, false detection, and limited network robustness. To tackle the aforementioned issues, this study introduces a novel MBS-NET model, which is built upon the YOLOv5 structure. The proposed model presents a multi-branch stacking module structure to precisely capture deep target feature information. By introducing a dual-channel attention mechanism module (EGCA) at the model's neck, the proposed method ensures discriminative feature acquisition in crowded objects and vast spatial scenes. Prior to network training, introduce an improved data augmentation strategy to accommodate the multi-scale and directional variations in remote sensing objectives for model detection. Experimental results on the large-scale public DIOR dataset demonstrate that the MBS-NET model introduced in this paper displays exceptional performance and remarkable interpretability in remote sensing scenarios at a large scale. MBS-NET model outperforms YOLOv5 and YOLOv7 models by increasing the accuracy by 5% and 2% respectively. In addition, the recall rate and <span><math><msub><mi>F</mi><mn>1</mn></msub></math></span> Score index of MBS-NET model is superior to those of other methods, resulting in significant improvement of detection accuracy and robustness in large scenes.</p></div>","PeriodicalId":48539,"journal":{"name":"Egyptian Journal of Remote Sensing and Space Sciences","volume":"26 4","pages":"Pages 999-1008"},"PeriodicalIF":3.7000,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110982323000959/pdfft?md5=d0d8bf89a9ff290918cc230989cc390b&pid=1-s2.0-S1110982323000959-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Journal of Remote Sensing and Space Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110982323000959","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Optical remote sensing is crucial in land management, maritime safety, and rescue operations. Currently, high resolution target detection faces the problems including feature loss, false detection, and limited network robustness. To tackle the aforementioned issues, this study introduces a novel MBS-NET model, which is built upon the YOLOv5 structure. The proposed model presents a multi-branch stacking module structure to precisely capture deep target feature information. By introducing a dual-channel attention mechanism module (EGCA) at the model's neck, the proposed method ensures discriminative feature acquisition in crowded objects and vast spatial scenes. Prior to network training, introduce an improved data augmentation strategy to accommodate the multi-scale and directional variations in remote sensing objectives for model detection. Experimental results on the large-scale public DIOR dataset demonstrate that the MBS-NET model introduced in this paper displays exceptional performance and remarkable interpretability in remote sensing scenarios at a large scale. MBS-NET model outperforms YOLOv5 and YOLOv7 models by increasing the accuracy by 5% and 2% respectively. In addition, the recall rate and F1 Score index of MBS-NET model is superior to those of other methods, resulting in significant improvement of detection accuracy and robustness in large scenes.

基于YOLOv5的多分支叠加遥感图像目标检测
光学遥感在陆地管理、海上安全和救援行动中至关重要。目前,高分辨率目标检测面临着特征丢失、误检、网络鲁棒性受限等问题。为了解决上述问题,本研究引入了一种基于YOLOv5结构的新型MBS-NET模型。该模型采用多分支叠加模块结构,能够准确捕获目标深度特征信息。该方法通过在模型颈部引入双通道注意机制模块(EGCA),保证了在拥挤对象和广阔空间场景下的判别性特征获取。在网络训练之前,引入一种改进的数据增强策略,以适应模型检测中遥感目标的多尺度和方向变化。在大型公开DIOR数据集上的实验结果表明,本文引入的MBS-NET模型在大尺度遥感场景下表现出优异的性能和良好的可解释性。MBS-NET模型比YOLOv5和YOLOv7模型的准确率分别提高了5%和2%。此外,MBS-NET模型的召回率和F1 Score指标优于其他方法,在大场景下检测精度和鲁棒性显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.10
自引率
0.00%
发文量
85
审稿时长
48 weeks
期刊介绍: The Egyptian Journal of Remote Sensing and Space Sciences (EJRS) encompasses a comprehensive range of topics within Remote Sensing, Geographic Information Systems (GIS), planetary geology, and space technology development, including theories, applications, and modeling. EJRS aims to disseminate high-quality, peer-reviewed research focusing on the advancement of remote sensing and GIS technologies and their practical applications for effective planning, sustainable development, and environmental resource conservation. The journal particularly welcomes innovative papers with broad scientific appeal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信