Teng Xu, Ling Ren, Tian Shi, Yuan Gao, Jian-Bang Ding, Rong-Chen Jin
{"title":"基于改进型 Yolox 的交通标志检测算法","authors":"Teng Xu, Ling Ren, Tian Shi, Yuan Gao, Jian-Bang Ding, Rong-Chen Jin","doi":"10.5755/j01.itc.52.4.34039","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel PVF-YOLO model to extract the multi-scale traffic sign features more effectively during car driving. Firstly, the original convolution module is replaced with the Omni-Dimensional convolution (ODconv) and the feature information obtained from the shallow feature layer is incorporated into the network. Secondly, this paper proposes a parallel structure block module for capturing multi-scale features. This module uses the Large Kernel Attention (LKA) and Visual Multilayer Perceptron (Visual MLP) to capture the information generated by the network model. It enhances the representation ability of feature maps. Next, in the process of training, the gradient concentration algorithm is used to optimize the initial Stochastic Gradient Descent (SGD). Under the condition of real-time detection, it improves the detection accuracy. Finally, to improve the robustness of the model, this paper conducts extensive experiments. Tsinghua-Tencent 100K (TT100K), Changsha University of Science and Technology CCTSDB (CSUST Chinese Traffic Sign Detection Benchmark) are used as the training data set. It verifies that the PVF-YOLO method proposed in this paper enhances the detection ability of traffic signs of different scales, and the detection speed and accuracy are better than the original model.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"2 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Traffic Sign Detection Algorithm Based on Improved Yolox\",\"authors\":\"Teng Xu, Ling Ren, Tian Shi, Yuan Gao, Jian-Bang Ding, Rong-Chen Jin\",\"doi\":\"10.5755/j01.itc.52.4.34039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel PVF-YOLO model to extract the multi-scale traffic sign features more effectively during car driving. Firstly, the original convolution module is replaced with the Omni-Dimensional convolution (ODconv) and the feature information obtained from the shallow feature layer is incorporated into the network. Secondly, this paper proposes a parallel structure block module for capturing multi-scale features. This module uses the Large Kernel Attention (LKA) and Visual Multilayer Perceptron (Visual MLP) to capture the information generated by the network model. It enhances the representation ability of feature maps. Next, in the process of training, the gradient concentration algorithm is used to optimize the initial Stochastic Gradient Descent (SGD). Under the condition of real-time detection, it improves the detection accuracy. Finally, to improve the robustness of the model, this paper conducts extensive experiments. Tsinghua-Tencent 100K (TT100K), Changsha University of Science and Technology CCTSDB (CSUST Chinese Traffic Sign Detection Benchmark) are used as the training data set. It verifies that the PVF-YOLO method proposed in this paper enhances the detection ability of traffic signs of different scales, and the detection speed and accuracy are better than the original model.\",\"PeriodicalId\":54982,\"journal\":{\"name\":\"Information Technology and Control\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Technology and Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.5755/j01.itc.52.4.34039\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Technology and Control","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.5755/j01.itc.52.4.34039","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Traffic Sign Detection Algorithm Based on Improved Yolox
This paper proposes a novel PVF-YOLO model to extract the multi-scale traffic sign features more effectively during car driving. Firstly, the original convolution module is replaced with the Omni-Dimensional convolution (ODconv) and the feature information obtained from the shallow feature layer is incorporated into the network. Secondly, this paper proposes a parallel structure block module for capturing multi-scale features. This module uses the Large Kernel Attention (LKA) and Visual Multilayer Perceptron (Visual MLP) to capture the information generated by the network model. It enhances the representation ability of feature maps. Next, in the process of training, the gradient concentration algorithm is used to optimize the initial Stochastic Gradient Descent (SGD). Under the condition of real-time detection, it improves the detection accuracy. Finally, to improve the robustness of the model, this paper conducts extensive experiments. Tsinghua-Tencent 100K (TT100K), Changsha University of Science and Technology CCTSDB (CSUST Chinese Traffic Sign Detection Benchmark) are used as the training data set. It verifies that the PVF-YOLO method proposed in this paper enhances the detection ability of traffic signs of different scales, and the detection speed and accuracy are better than the original model.
期刊介绍:
Periodical journal covers a wide field of computer science and control systems related problems including:
-Software and hardware engineering;
-Management systems engineering;
-Information systems and databases;
-Embedded systems;
-Physical systems modelling and application;
-Computer networks and cloud computing;
-Data visualization;
-Human-computer interface;
-Computer graphics, visual analytics, and multimedia systems.