An Improved YOLOv7 Tiny Algorithm for Vehicle and Pedestrian Detection with Occlusion in Autonomous Driving

IF 1.6 4区计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC

Chinese Journal of Electronics Pub Date : 2025-01-01 DOI:10.23919/cje.2023.00.256

Jian Su;Fang Wang;Wei Zhuang

{"title":"An Improved YOLOv7 Tiny Algorithm for Vehicle and Pedestrian Detection with Occlusion in Autonomous Driving","authors":"Jian Su;Fang Wang;Wei Zhuang","doi":"10.23919/cje.2023.00.256","DOIUrl":null,"url":null,"abstract":"Future transportation is advancing in the direction of intelligent transportation systems, where an essential part is vehicle and pedestrian detection. Due to the complex urban traffic environment, vehicles and pedestrians in road monitoring have different forms of occlusion problems, resulting in the missed detection of objects. We design an improved you only look once version 7 (YOLOv7) tiny algorithm for vehicle and pedestrian detection under occlusion, with the following four main improvements. In order to locate the object more accurately, <tex>$1 \\times 1$</tex> convolution and identity connection are added to the <tex>$3 \\times 3$</tex> convolution, and convolution reparameterization is used to enhance the inference speed of the network model. In view of the complex road background and more interference, the coordinate attention was added to the connection part of backbone and neck to enhance the network's capacity to detect the object and lessen interference from other targets. At the same time, before being sent to the detection head, global attention mechanism is added to improve the accuracy of model detection by capturing three-dimensional features. Considering the issue of imbalanced training samples, we propose focal complete intersection over union (CIOU) loss instead of CIOU loss to become the bounding box regression loss, so that the regression process attention to high-quality anchor boxes. Experiments show that the improved YOLOv7 tiny algorithm achieves 82.2% map @ 0.5 in pattern analysis, statistical modelling and computational learning visual object classes dataset, which is 2.8% higher than before the improvement. The performance of map @ 0.5:0.95 is 5.2% better than the previous improvement. The proposed improved algorithm can availably to detect partial occlusion objects.","PeriodicalId":50701,"journal":{"name":"Chinese Journal of Electronics","volume":"34 1","pages":"282-294"},"PeriodicalIF":1.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10891995","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Journal of Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10891995/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Future transportation is advancing in the direction of intelligent transportation systems, where an essential part is vehicle and pedestrian detection. Due to the complex urban traffic environment, vehicles and pedestrians in road monitoring have different forms of occlusion problems, resulting in the missed detection of objects. We design an improved you only look once version 7 (YOLOv7) tiny algorithm for vehicle and pedestrian detection under occlusion, with the following four main improvements. In order to locate the object more accurately, $1 \times 1$ convolution and identity connection are added to the $3 \times 3$ convolution, and convolution reparameterization is used to enhance the inference speed of the network model. In view of the complex road background and more interference, the coordinate attention was added to the connection part of backbone and neck to enhance the network's capacity to detect the object and lessen interference from other targets. At the same time, before being sent to the detection head, global attention mechanism is added to improve the accuracy of model detection by capturing three-dimensional features. Considering the issue of imbalanced training samples, we propose focal complete intersection over union (CIOU) loss instead of CIOU loss to become the bounding box regression loss, so that the regression process attention to high-quality anchor boxes. Experiments show that the improved YOLOv7 tiny algorithm achieves 82.2% map @ 0.5 in pattern analysis, statistical modelling and computational learning visual object classes dataset, which is 2.8% higher than before the improvement. The performance of map @ 0.5:0.95 is 5.2% better than the previous improvement. The proposed improved algorithm can availably to detect partial occlusion objects.

查看原文本刊更多论文

基于改进YOLOv7微小算法的自动驾驶车辆与行人遮挡检测

未来的交通正朝着智能交通系统的方向发展，其中车辆和行人的检测是必不可少的一部分。由于城市交通环境的复杂性，道路监测中的车辆和行人存在不同形式的遮挡问题，从而导致物体的漏检。我们设计了一个改进的你只看一次版本7 （YOLOv7）微小算法，用于遮挡下的车辆和行人检测，主要有以下四个改进。为了更准确地定位目标，在$3 \ × 3$卷积中加入$1 \ × 1$卷积和身份连接，并使用卷积重参数化来提高网络模型的推理速度。针对道路背景复杂、干扰较多的特点，在主干道和颈部的连接部分增加坐标关注，增强网络对目标的检测能力，减少其他目标的干扰。同时，在发送到检测头之前，增加了全局关注机制，通过捕捉三维特征来提高模型检测的精度。考虑到训练样本不平衡的问题，我们提出将CIOU （focal complete intersection over union）损失代替CIOU损失作为边界盒回归损失，使回归过程关注高质量的锚盒。实验表明，改进后的YOLOv7 tiny算法在模式分析、统计建模和计算学习视觉对象类数据集上的map @ 0.5达到82.2%，比改进前提高了2.8%。map @ 0.5:0.95的性能比之前的改进提高了5.2%。改进算法可以有效地检测局部遮挡目标。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Chinese Journal of Electronics 工程技术-工程：电子与电气

CiteScore

3.70

自引率

16.70%

发文量

342

审稿时长

12.0 months

期刊介绍： CJE focuses on the emerging fields of electronics, publishing innovative and transformative research papers. Most of the papers published in CJE are from universities and research institutes, presenting their innovative research results. Both theoretical and practical contributions are encouraged, and original research papers reporting novel solutions to the hot topics in electronics are strongly recommended.