Research on Object Detection in Complex Scenarios Based on ASA-YOLOv5

IF 1 4区工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEJ Transactions on Electrical and Electronic Engineering Pub Date : 2024-12-03 DOI:10.1002/tee.24249

Shoujun Lin, Lixia Deng, Hongyu Zhang, Lingyun Bi, Jinshun Dong, Dapeng Wan, Haiying Liu, Lida Liu

{"title":"Research on Object Detection in Complex Scenarios Based on ASA-YOLOv5","authors":"Shoujun Lin, Lixia Deng, Hongyu Zhang, Lingyun Bi, Jinshun Dong, Dapeng Wan, Haiying Liu, Lida Liu","doi":"10.1002/tee.24249","DOIUrl":null,"url":null,"abstract":"<p>The applications of target detection in complex scenarios cover a wide range of fields, such as pedestrian and vehicle detection in self-driving cars, face recognition and abnormal behavior detection in security monitoring systems, hazardous materials safety detection in public transportation, and so on. These applications demonstrate the importance and the prospect of wide application of target detection techniques in solving practical problems in complex scenarios. However, in these real scenes, there are often problems such as mutual occlusion and scale change. Therefore, how to accurately identify the target in the real complex scenarios has become a big problem to be solved. In order to solve the above problem, the paper proposes a novel algorithm, Adaptive Self-Attention-YOLOv5 (ASA-YOLOv5), which is built upon the YOLOv5s algorithm and demonstrates effectiveness for target identification in complex scenarios. First, the paper implements a fusion mechanism between the trunk and neck networks, enabling the fusion of features across different levels through upsampling and downsampling. This fusion process mitigates detection errors caused by feature loss. Second, the Shuffle Attention mechanism is introduced before upsampling and downsampling to suppress noise and amplify essential semantic information, further enhancing target identification accuracy. Lastly, the Adaptively Spatial Feature Fusion (ASFF) module and Receptive Field Blocks (RFBs) module are added in the head network, and it can improve feature scale invariance and expand the receptive field. The ability of the model to detect the target in the complex scene is improved effectively. Experimental results indicate a notable improvement in the model's mean Average Precision (mAP) by 2.1% on the COCO dataset and 0.7% on the SIXray dataset. The proposed ASA-YOLOv5 algorithm can enhance the effectiveness for target detection in complex scenarios, and it can be widely used in real-world settings. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>","PeriodicalId":13435,"journal":{"name":"IEEJ Transactions on Electrical and Electronic Engineering","volume":"20 6","pages":"876-885"},"PeriodicalIF":1.0000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEJ Transactions on Electrical and Electronic Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/tee.24249","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The applications of target detection in complex scenarios cover a wide range of fields, such as pedestrian and vehicle detection in self-driving cars, face recognition and abnormal behavior detection in security monitoring systems, hazardous materials safety detection in public transportation, and so on. These applications demonstrate the importance and the prospect of wide application of target detection techniques in solving practical problems in complex scenarios. However, in these real scenes, there are often problems such as mutual occlusion and scale change. Therefore, how to accurately identify the target in the real complex scenarios has become a big problem to be solved. In order to solve the above problem, the paper proposes a novel algorithm, Adaptive Self-Attention-YOLOv5 (ASA-YOLOv5), which is built upon the YOLOv5s algorithm and demonstrates effectiveness for target identification in complex scenarios. First, the paper implements a fusion mechanism between the trunk and neck networks, enabling the fusion of features across different levels through upsampling and downsampling. This fusion process mitigates detection errors caused by feature loss. Second, the Shuffle Attention mechanism is introduced before upsampling and downsampling to suppress noise and amplify essential semantic information, further enhancing target identification accuracy. Lastly, the Adaptively Spatial Feature Fusion (ASFF) module and Receptive Field Blocks (RFBs) module are added in the head network, and it can improve feature scale invariance and expand the receptive field. The ability of the model to detect the target in the complex scene is improved effectively. Experimental results indicate a notable improvement in the model's mean Average Precision (mAP) by 2.1% on the COCO dataset and 0.7% on the SIXray dataset. The proposed ASA-YOLOv5 algorithm can enhance the effectiveness for target detection in complex scenarios, and it can be widely used in real-world settings. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.

查看原文本刊更多论文

基于ASA-YOLOv5的复杂场景目标检测研究

复杂场景下的目标检测应用领域非常广泛，如自动驾驶汽车中的行人和车辆检测、安防监控系统中的人脸识别和异常行为检测、公共交通中的危险品安全检测等。这些应用表明了目标检测技术在解决复杂场景中的实际问题方面的重要性和广泛应用的前景。但在这些真实场景中，往往存在相互遮挡、尺度变化等问题。因此，如何在真实的复杂场景中准确识别目标就成为一个亟待解决的大问题。为了解决上述问题，本文提出了一种基于YOLOv5s算法的自适应自我注意- yolov5 （ASA-YOLOv5）算法，该算法对复杂场景下的目标识别具有一定的有效性。首先，本文实现了主干网络和颈部网络之间的融合机制，通过上采样和下采样实现了不同层次特征的融合。这种融合过程减轻了由于特征丢失而导致的检测错误。其次，在上采样和下采样之前引入Shuffle注意机制，抑制噪声，放大必要的语义信息，进一步提高目标识别精度。最后，在头部网络中加入自适应空间特征融合（adaptive Spatial Feature Fusion， ASFF）模块和接收野块（Receptive Field Blocks, RFBs）模块，提高了特征尺度不变性，扩展了接收野。有效地提高了模型在复杂场景中检测目标的能力。实验结果表明，该模型在COCO数据集上的平均精度（mAP）提高了2.1%，在SIXray数据集上提高了0.7%。本文提出的ASA-YOLOv5算法可以提高复杂场景下目标检测的有效性，可广泛应用于现实环境。©2024日本电气工程师协会和Wiley期刊有限责任公司。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEJ Transactions on Electrical and Electronic Engineering 工程技术-工程：电子与电气

CiteScore

2.70

自引率

10.00%

发文量

199

审稿时长

4.3 months

期刊介绍： IEEJ Transactions on Electrical and Electronic Engineering (hereinafter called TEEE ) publishes 6 times per year as an official journal of the Institute of Electrical Engineers of Japan (hereinafter "IEEJ"). This peer-reviewed journal contains original research papers and review articles on the most important and latest technological advances in core areas of Electrical and Electronic Engineering and in related disciplines. The journal also publishes short communications reporting on the results of the latest research activities TEEE ) aims to provide a new forum for IEEJ members in Japan as well as fellow researchers in Electrical and Electronic Engineering from around the world to exchange ideas and research findings.