{"title":"Multi-branch stacking remote sensing image target detection based on YOLOv5","authors":"Luxuan Bian, Bo Li, Jue Wang, Zijun Gao","doi":"10.1016/j.ejrs.2023.11.006","DOIUrl":null,"url":null,"abstract":"<div><p>Optical remote sensing is crucial in land management, maritime safety, and rescue operations. Currently, high resolution target detection faces the problems including feature loss, false detection, and limited network robustness. To tackle the aforementioned issues, this study introduces a novel MBS-NET model, which is built upon the YOLOv5 structure. The proposed model presents a multi-branch stacking module structure to precisely capture deep target feature information. By introducing a dual-channel attention mechanism module (EGCA) at the model's neck, the proposed method ensures discriminative feature acquisition in crowded objects and vast spatial scenes. Prior to network training, introduce an improved data augmentation strategy to accommodate the multi-scale and directional variations in remote sensing objectives for model detection. Experimental results on the large-scale public DIOR dataset demonstrate that the MBS-NET model introduced in this paper displays exceptional performance and remarkable interpretability in remote sensing scenarios at a large scale. MBS-NET model outperforms YOLOv5 and YOLOv7 models by increasing the accuracy by 5% and 2% respectively. In addition, the recall rate and <span><math><msub><mi>F</mi><mn>1</mn></msub></math></span> Score index of MBS-NET model is superior to those of other methods, resulting in significant improvement of detection accuracy and robustness in large scenes.</p></div>","PeriodicalId":48539,"journal":{"name":"Egyptian Journal of Remote Sensing and Space Sciences","volume":"26 4","pages":"Pages 999-1008"},"PeriodicalIF":3.7000,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110982323000959/pdfft?md5=d0d8bf89a9ff290918cc230989cc390b&pid=1-s2.0-S1110982323000959-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Journal of Remote Sensing and Space Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110982323000959","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Optical remote sensing is crucial in land management, maritime safety, and rescue operations. Currently, high resolution target detection faces the problems including feature loss, false detection, and limited network robustness. To tackle the aforementioned issues, this study introduces a novel MBS-NET model, which is built upon the YOLOv5 structure. The proposed model presents a multi-branch stacking module structure to precisely capture deep target feature information. By introducing a dual-channel attention mechanism module (EGCA) at the model's neck, the proposed method ensures discriminative feature acquisition in crowded objects and vast spatial scenes. Prior to network training, introduce an improved data augmentation strategy to accommodate the multi-scale and directional variations in remote sensing objectives for model detection. Experimental results on the large-scale public DIOR dataset demonstrate that the MBS-NET model introduced in this paper displays exceptional performance and remarkable interpretability in remote sensing scenarios at a large scale. MBS-NET model outperforms YOLOv5 and YOLOv7 models by increasing the accuracy by 5% and 2% respectively. In addition, the recall rate and Score index of MBS-NET model is superior to those of other methods, resulting in significant improvement of detection accuracy and robustness in large scenes.
期刊介绍:
The Egyptian Journal of Remote Sensing and Space Sciences (EJRS) encompasses a comprehensive range of topics within Remote Sensing, Geographic Information Systems (GIS), planetary geology, and space technology development, including theories, applications, and modeling. EJRS aims to disseminate high-quality, peer-reviewed research focusing on the advancement of remote sensing and GIS technologies and their practical applications for effective planning, sustainable development, and environmental resource conservation. The journal particularly welcomes innovative papers with broad scientific appeal.