{"title":"An Improved YOLOv7 Tiny Algorithm for Vehicle and Pedestrian Detection with Occlusion in Autonomous Driving","authors":"Jian Su;Fang Wang;Wei Zhuang","doi":"10.23919/cje.2023.00.256","DOIUrl":null,"url":null,"abstract":"Future transportation is advancing in the direction of intelligent transportation systems, where an essential part is vehicle and pedestrian detection. Due to the complex urban traffic environment, vehicles and pedestrians in road monitoring have different forms of occlusion problems, resulting in the missed detection of objects. We design an improved you only look once version 7 (YOLOv7) tiny algorithm for vehicle and pedestrian detection under occlusion, with the following four main improvements. In order to locate the object more accurately, <tex>$1 \\times 1$</tex> convolution and identity connection are added to the <tex>$3 \\times 3$</tex> convolution, and convolution reparameterization is used to enhance the inference speed of the network model. In view of the complex road background and more interference, the coordinate attention was added to the connection part of backbone and neck to enhance the network's capacity to detect the object and lessen interference from other targets. At the same time, before being sent to the detection head, global attention mechanism is added to improve the accuracy of model detection by capturing three-dimensional features. Considering the issue of imbalanced training samples, we propose focal complete intersection over union (CIOU) loss instead of CIOU loss to become the bounding box regression loss, so that the regression process attention to high-quality anchor boxes. Experiments show that the improved YOLOv7 tiny algorithm achieves 82.2% map @ 0.5 in pattern analysis, statistical modelling and computational learning visual object classes dataset, which is 2.8% higher than before the improvement. The performance of map @ 0.5:0.95 is 5.2% better than the previous improvement. The proposed improved algorithm can availably to detect partial occlusion objects.","PeriodicalId":50701,"journal":{"name":"Chinese Journal of Electronics","volume":"34 1","pages":"282-294"},"PeriodicalIF":1.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10891995","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Journal of Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10891995/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Future transportation is advancing in the direction of intelligent transportation systems, where an essential part is vehicle and pedestrian detection. Due to the complex urban traffic environment, vehicles and pedestrians in road monitoring have different forms of occlusion problems, resulting in the missed detection of objects. We design an improved you only look once version 7 (YOLOv7) tiny algorithm for vehicle and pedestrian detection under occlusion, with the following four main improvements. In order to locate the object more accurately, $1 \times 1$ convolution and identity connection are added to the $3 \times 3$ convolution, and convolution reparameterization is used to enhance the inference speed of the network model. In view of the complex road background and more interference, the coordinate attention was added to the connection part of backbone and neck to enhance the network's capacity to detect the object and lessen interference from other targets. At the same time, before being sent to the detection head, global attention mechanism is added to improve the accuracy of model detection by capturing three-dimensional features. Considering the issue of imbalanced training samples, we propose focal complete intersection over union (CIOU) loss instead of CIOU loss to become the bounding box regression loss, so that the regression process attention to high-quality anchor boxes. Experiments show that the improved YOLOv7 tiny algorithm achieves 82.2% map @ 0.5 in pattern analysis, statistical modelling and computational learning visual object classes dataset, which is 2.8% higher than before the improvement. The performance of map @ 0.5:0.95 is 5.2% better than the previous improvement. The proposed improved algorithm can availably to detect partial occlusion objects.
期刊介绍:
CJE focuses on the emerging fields of electronics, publishing innovative and transformative research papers. Most of the papers published in CJE are from universities and research institutes, presenting their innovative research results. Both theoretical and practical contributions are encouraged, and original research papers reporting novel solutions to the hot topics in electronics are strongly recommended.