{"title":"A Higher-Accuracy Color- and Gray-Image Training Method for Road-Object Detection with YOLO","authors":"Surapong Jina, W. Sae-Tang","doi":"10.1145/3594441.3594479","DOIUrl":null,"url":null,"abstract":"This paper proposes a higher-accuracy color- and gray-image training method for road-object detection with You Only Look Once (YOLO). Image pre-processing is performed before feeding images to a network for object detection, i.e., color-to-gray conversion and edge detection. Then, the original images, the gray version of the original images, and the edge images are fed to the main network. YOLO version 5 (YOLOv5) was used as a backbone network. The model was trained by using custom traffic dataset which consists of 738 training images and 185 validating images, and they are separated into 7 classes. The proposed method achieved a mAP of 44.97% when performing a validation by using both RGB and grayscale images. It is higher than that of the conventional training method using only original images. The results also confirm that higher accuracy can be achieved even for night vision images. The proposed method could serve for applications which be used in various factors and environments.","PeriodicalId":247919,"journal":{"name":"Proceedings of the 2023 8th International Conference on Information and Education Innovations","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 8th International Conference on Information and Education Innovations","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3594441.3594479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a higher-accuracy color- and gray-image training method for road-object detection with You Only Look Once (YOLO). Image pre-processing is performed before feeding images to a network for object detection, i.e., color-to-gray conversion and edge detection. Then, the original images, the gray version of the original images, and the edge images are fed to the main network. YOLO version 5 (YOLOv5) was used as a backbone network. The model was trained by using custom traffic dataset which consists of 738 training images and 185 validating images, and they are separated into 7 classes. The proposed method achieved a mAP of 44.97% when performing a validation by using both RGB and grayscale images. It is higher than that of the conventional training method using only original images. The results also confirm that higher accuracy can be achieved even for night vision images. The proposed method could serve for applications which be used in various factors and environments.