求助PDF
{"title":"Research on Pedestrian Detection Algorithm in Industrial Scene Based on Improved YOLOv7-Tiny","authors":"Ling Wang, Junxu Bai, Peng Wang, Yane Bai","doi":"10.1002/tee.24075","DOIUrl":null,"url":null,"abstract":"<p>YOLOv7 is one of the most effective algorithms for one-stage detectors. However, when it is applied to pedestrian detection tasks in the industrial scene, it is still challenging for complex environments and multi-scale changes of pedestrians. This paper proposes a new pedestrian detector for the industrial scene based on improved YOLOv7-tiny and named as GP-YOLO. First, the neck of YOLOv7-tiny is replaced by RepGFPN structure, make full use of multi-scale features to enhance the detection accuracy of objects with large-scale changes. Second, a new g<sup>n</sup>conv branch is added to the feature fusion module, and the high-order spatial interaction capability is introduced to further enhance the target detection accuracy. Finally, a lightweight method based on PModule is proposed, on this basis, a PConv bottleneck is designed to reduce the FLOPs and enhance the feature extraction. Experiments on a self-made Industrial Pedestrian Data set show that before lightweight, the proposed algorithm achieves a 3.2% improvement in [email protected]:0.95 and a 3.7% improvement in Recall compared to the baseline YOLOv7-tiny. After lightweight GP-YOLO, compared to non-lightweight, parameters and FLOPs are decreased by 26% and 23%, respectively, the [email protected]:0.95 is decreased by only 1.1% and the Recall is decreased by only 1.3%, which remains at a high level. Compared with baseline YOLOv7-tiny, the lightweight GP-YOLO has similar parameters and FLOPs, but the [email protected]:0.95 is increased by 2.1%, and the Recall is increased by 2.4%. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>","PeriodicalId":13435,"journal":{"name":"IEEJ Transactions on Electrical and Electronic Engineering","volume":"19 7","pages":"1203-1215"},"PeriodicalIF":1.0000,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEJ Transactions on Electrical and Electronic Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/tee.24075","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
引用
批量引用
Abstract
YOLOv7 is one of the most effective algorithms for one-stage detectors. However, when it is applied to pedestrian detection tasks in the industrial scene, it is still challenging for complex environments and multi-scale changes of pedestrians. This paper proposes a new pedestrian detector for the industrial scene based on improved YOLOv7-tiny and named as GP-YOLO. First, the neck of YOLOv7-tiny is replaced by RepGFPN structure, make full use of multi-scale features to enhance the detection accuracy of objects with large-scale changes. Second, a new gn conv branch is added to the feature fusion module, and the high-order spatial interaction capability is introduced to further enhance the target detection accuracy. Finally, a lightweight method based on PModule is proposed, on this basis, a PConv bottleneck is designed to reduce the FLOPs and enhance the feature extraction. Experiments on a self-made Industrial Pedestrian Data set show that before lightweight, the proposed algorithm achieves a 3.2% improvement in [email protected]:0.95 and a 3.7% improvement in Recall compared to the baseline YOLOv7-tiny. After lightweight GP-YOLO, compared to non-lightweight, parameters and FLOPs are decreased by 26% and 23%, respectively, the [email protected]:0.95 is decreased by only 1.1% and the Recall is decreased by only 1.3%, which remains at a high level. Compared with baseline YOLOv7-tiny, the lightweight GP-YOLO has similar parameters and FLOPs, but the [email protected]:0.95 is increased by 2.1%, and the Recall is increased by 2.4%. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.
基于改进型 YOLOv7-Tiny 的工业场景行人检测算法研究
YOLOv7 是单级检测器中最有效的算法之一。然而,当它应用于工业场景中的行人检测任务时,对于复杂环境和行人的多尺度变化仍然具有挑战性。本文提出了一种基于改进型 YOLOv7-tiny 的新型工业场景行人检测器,并命名为 GP-YOLO。首先,将 YOLOv7-tiny 的颈部结构替换为 RepGFPN 结构,充分利用多尺度特征,提高对具有大尺度变化物体的检测精度。其次,在特征融合模块中新增 gnconv 分支,引入高阶空间交互能力,进一步提高目标检测精度。最后,提出了一种基于 PM 模块的轻量级方法,并在此基础上设计了 PConv 瓶颈,以减少 FLOPs 并增强特征提取能力。在自制的工业行人数据集上进行的实验表明,与基线 YOLOv7-tiny 相比,在轻量级之前,所提出的算法在 mAP@0.5:0.95 和 Recall 方面分别提高了 3.2% 和 3.7%。轻量级 GP-YOLO 后,与非轻量级相比,参数和 FLOPs 分别降低了 26% 和 23%,mAP@0.5:0.95 仅降低了 1.1%,召回率仅降低了 1.3%,保持在较高水平。与基线 YOLOv7-tiny 相比,轻量级 GP-YOLO 的参数和 FLOPs 相似,但 mAP@0.5:0.95 增加了 2.1%,Recall 增加了 2.4%。© 2024 日本电气工程师学会。由 Wiley Periodicals LLC 出版。
本文章由计算机程序翻译,如有差异,请以英文原文为准。