{"title":"基于YOLOv3模型和Retinex增强图像的行人检测方法","authors":"Hongquan Qu, Tongyang Yuan, Zhiyong Sheng, Yuan Zhang","doi":"10.1109/CISP-BMEI.2018.8633119","DOIUrl":null,"url":null,"abstract":"Pedestrian detection is a basic technology in the field of intelligent traffic video surveillance. It is also help for the optimization design of rail transport. It is known that the deep learning technology can achieve considerable performance on pedestrian detection. However, this kind of methods demand a large number of high-quality samples. In addition, the quality of data sample in the subway station is usually sensitive to the background environment, such as variant illumination or pedestrian density, which can significantly affect performance of the deep neural network. To solve this problem, this paper adopts an image enhancement policy based on the Retinex theory to preprocess training samples to reduce the influence of light changes. Firstly, we use the image enhancement method to enhance the contrast of the image and highlight the color of the object itself. Next, we put the initial sample into darknet frame with YOLOv3 to train the detection model1 and put the enhanced sample into the YOLOv3 to train the detection model 2. Finally, we tested these two models with 200 pedestrian pictures of four different scenarios. The experimental results show that the model trained by Retinex image enhancement has a more accurate detection rate of 94% compared with the model without the enhancement sample trained.","PeriodicalId":117227,"journal":{"name":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"A Pedestrian Detection Method Based on YOLOv3 Model and Image Enhanced by Retinex\",\"authors\":\"Hongquan Qu, Tongyang Yuan, Zhiyong Sheng, Yuan Zhang\",\"doi\":\"10.1109/CISP-BMEI.2018.8633119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pedestrian detection is a basic technology in the field of intelligent traffic video surveillance. It is also help for the optimization design of rail transport. It is known that the deep learning technology can achieve considerable performance on pedestrian detection. However, this kind of methods demand a large number of high-quality samples. In addition, the quality of data sample in the subway station is usually sensitive to the background environment, such as variant illumination or pedestrian density, which can significantly affect performance of the deep neural network. To solve this problem, this paper adopts an image enhancement policy based on the Retinex theory to preprocess training samples to reduce the influence of light changes. Firstly, we use the image enhancement method to enhance the contrast of the image and highlight the color of the object itself. Next, we put the initial sample into darknet frame with YOLOv3 to train the detection model1 and put the enhanced sample into the YOLOv3 to train the detection model 2. Finally, we tested these two models with 200 pedestrian pictures of four different scenarios. The experimental results show that the model trained by Retinex image enhancement has a more accurate detection rate of 94% compared with the model without the enhancement sample trained.\",\"PeriodicalId\":117227,\"journal\":{\"name\":\"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISP-BMEI.2018.8633119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2018.8633119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Pedestrian Detection Method Based on YOLOv3 Model and Image Enhanced by Retinex
Pedestrian detection is a basic technology in the field of intelligent traffic video surveillance. It is also help for the optimization design of rail transport. It is known that the deep learning technology can achieve considerable performance on pedestrian detection. However, this kind of methods demand a large number of high-quality samples. In addition, the quality of data sample in the subway station is usually sensitive to the background environment, such as variant illumination or pedestrian density, which can significantly affect performance of the deep neural network. To solve this problem, this paper adopts an image enhancement policy based on the Retinex theory to preprocess training samples to reduce the influence of light changes. Firstly, we use the image enhancement method to enhance the contrast of the image and highlight the color of the object itself. Next, we put the initial sample into darknet frame with YOLOv3 to train the detection model1 and put the enhanced sample into the YOLOv3 to train the detection model 2. Finally, we tested these two models with 200 pedestrian pictures of four different scenarios. The experimental results show that the model trained by Retinex image enhancement has a more accurate detection rate of 94% compared with the model without the enhancement sample trained.