Hold surrounding’s key-you only look once version 7: a real-time pedestrian and vehicle detection algorithm in the low-signal-to-noise ratio infrared image

IF 1.2 4区工程技术 Q4 OPTICS

Optical Engineering Pub Date : 2023-11-01 DOI:10.1117/1.OE.62.11.118105

Yang Liu, Fulong Yi, Yuhua Ma, Yongfu Wang, Dianhui Wang

{"title":"Hold surrounding’s key-you only look once version 7: a real-time pedestrian and vehicle detection algorithm in the low-signal-to-noise ratio infrared image","authors":"Yang Liu, Fulong Yi, Yuhua Ma, Yongfu Wang, Dianhui Wang","doi":"10.1117/1.OE.62.11.118105","DOIUrl":null,"url":null,"abstract":"Abstract. Compared with visible light imaging technology, infrared optical imaging technology (see Fig. 1) is less affected by weather and illumination and is a potential auxiliary solution method for the future of the autonomous driving system. Based on the above characteristics, this study proposes the hold surrounding’s key (HSK)-you only look once (YOLOv7) algorithm. First, based on the characteristics of the infrared image, this study optimizes the network structure of YOLOv7 and proposes a vehicle and pedestrian detection algorithm based on the improved YOLOv7. Aiming at the problem that the reasoning speed and detection accuracy of pedestrian and vehicle detection algorithms based on infrared images are challenging to balance, occupy large storage space, and are difficult to deploy and run in real-time in low and medium-performance devices, the MPConv is added to replace the Conv structure in YOLOv7. In view of the false detection, missed detection, mutual occlusion and overlap of detected objects, and other situations that the YOLOv7 algorithm is prone to cause in the actual deployment environment easily, a tiny object detection layer is added. At the same time, to solve the problem that infrared optical imaging systems are prone to noise caused by external factors, this study introduces the TRPCA method for image denoising in the preprocessing process of the YOLOv7 algorithm. In the end, the HSK-YOLOv7 algorithm is verified using the self-made infrared traffic object detection dataset and the publicly available FLIR dataset to verify the detection effect of the HSK-YOLOv7 algorithm on near-infrared images and thermal infrared images. The parameter quantity of our algorithm is 37.3M, and the computing throughput is 107.5 GFLOPs. The detection speed on the self-made dataset and FLIR dataset reaches 163 frames per second (FPS) and 71.8 FPS, respectively, and the mAP@0.5 indicator reaches 94.08% and 61.3%, respectively. In general, HSK-YOLOv7 can meet the real-time requirements of the autonomous driving system while ensuring detection accuracy.","PeriodicalId":19561,"journal":{"name":"Optical Engineering","volume":"34 1","pages":"118105 - 118105"},"PeriodicalIF":1.2000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1117/1.OE.62.11.118105","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract. Compared with visible light imaging technology, infrared optical imaging technology (see Fig. 1) is less affected by weather and illumination and is a potential auxiliary solution method for the future of the autonomous driving system. Based on the above characteristics, this study proposes the hold surrounding’s key (HSK)-you only look once (YOLOv7) algorithm. First, based on the characteristics of the infrared image, this study optimizes the network structure of YOLOv7 and proposes a vehicle and pedestrian detection algorithm based on the improved YOLOv7. Aiming at the problem that the reasoning speed and detection accuracy of pedestrian and vehicle detection algorithms based on infrared images are challenging to balance, occupy large storage space, and are difficult to deploy and run in real-time in low and medium-performance devices, the MPConv is added to replace the Conv structure in YOLOv7. In view of the false detection, missed detection, mutual occlusion and overlap of detected objects, and other situations that the YOLOv7 algorithm is prone to cause in the actual deployment environment easily, a tiny object detection layer is added. At the same time, to solve the problem that infrared optical imaging systems are prone to noise caused by external factors, this study introduces the TRPCA method for image denoising in the preprocessing process of the YOLOv7 algorithm. In the end, the HSK-YOLOv7 algorithm is verified using the self-made infrared traffic object detection dataset and the publicly available FLIR dataset to verify the detection effect of the HSK-YOLOv7 algorithm on near-infrared images and thermal infrared images. The parameter quantity of our algorithm is 37.3M, and the computing throughput is 107.5 GFLOPs. The detection speed on the self-made dataset and FLIR dataset reaches 163 frames per second (FPS) and 71.8 FPS, respectively, and the mAP@0.5 indicator reaches 94.08% and 61.3%, respectively. In general, HSK-YOLOv7 can meet the real-time requirements of the autonomous driving system while ensuring detection accuracy.

查看原文本刊更多论文

握住周围的钥匙--你只需看一次第 7 版：低信噪比红外图像中的行人和车辆实时检测算法

摘要与可见光成像技术相比，红外光学成像技术（见图 1）受天气和光照的影响较小，是未来自动驾驶系统潜在的辅助解决方法。基于上述特点，本研究提出了 "Hold surrounding's key（HSK）-you only look once（YOLOv7）"算法。首先，根据红外图像的特点，本研究优化了 YOLOv7 的网络结构，并提出了基于改进后的 YOLOv7 的车辆和行人检测算法。针对基于红外图像的行人和车辆检测算法的推理速度和检测精度难以兼顾、占用存储空间大、难以在中低性能设备中实时部署和运行的问题，增加了MPConv来替代YOLOv7中的Conv结构；针对YOLOv7算法在实际部署环境中容易出现的误检、漏检、检测物体相互遮挡和重叠等情况，增加了微小物体检测层。同时，为了解决红外光学成像系统易受外界因素影响而产生噪声的问题，本研究在 YOLOv7 算法的预处理过程中引入了 TRPCA 方法对图像进行去噪处理。最后，利用自制的红外交通物体检测数据集和公开的FLIR数据集对HSK-YOLOv7算法进行了验证，验证了HSK-YOLOv7算法在近红外图像和热红外图像上的检测效果。我们的算法参数量为 37.3M，计算吞吐量为 107.5 GFLOPs。在自制数据集和 FLIR 数据集上的检测速度分别达到每秒 163 帧（FPS）和 71.8 FPS，mAP@0.5 指标分别达到 94.08% 和 61.3%。总体而言，HSK-YOLOv7 可以满足自动驾驶系统的实时性要求，同时保证检测精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Optical Engineering 工程技术-光学

CiteScore

2.70

自引率

7.70%

发文量

393

审稿时长

2.6 months

期刊介绍： Optical Engineering publishes peer-reviewed papers reporting on research and development in optical science and engineering and the practical applications of known optical science, engineering, and technology.