基于深度神经网络和匹配算法的视频交通标志识别

2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI) Pub Date : 2019-09-01 DOI:10.1109/IC-AIAI48757.2019.00013

I. Belkin, S. Tkachenko, D. Yudin

{"title":"基于深度神经网络和匹配算法的视频交通标志识别","authors":"I. Belkin, S. Tkachenko, D. Yudin","doi":"10.1109/IC-AIAI48757.2019.00013","DOIUrl":null,"url":null,"abstract":"The paper analyzes data sets containing images with labeled traffic signs, as well as modern approaches for their detection and classification on images of urban scenes. Particular attention is paid to the recognition of Russian types of traffic signs. Various modern architectures of deep neural networks for the simultaneous object detection and classification were studied, including Faster R-CNN, Mask R-CNN, Cascade R-CNN, RetinaNet. To increase the efficiency of neural network recognition of objects in a video sequence, the Seq-BBox Matching algorithm is used. Training and testing of the proposed approach was carried out on Russian Traffic Sign Dataset and IceVision Dataset containing over 150 types of road signs and more than 65,000 marked images. For all the approaches considered, quality metrics are defined: mean average precision mAP, mean average recall mAR and processing time of one frame. The highest quality performance was demonstrated by the architecture of Faster R-CNN with Seq-BBox Matching, while the highest performance is provided by the architecture of RetinaNet. Implementation was carried out using the Python 3.7 programming language and PyTorch deep learning library using NVidia CUDA technology. Performance indicators were obtained on the workstation with the NVidia Tesla V-100 32GB video card. The obtained results demonstrate the possibility of applying the proposed approach both for the resource-intensive procedure for automated labeling of road scene images for new data sets preparation, and for traffic sign recognition in on-board computer vision systems of unmanned vehicles.","PeriodicalId":374193,"journal":{"name":"2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Traffic Sign Recognition on Video Sequence using Deep Neural Networks and Matching Algorithm\",\"authors\":\"I. Belkin, S. Tkachenko, D. Yudin\",\"doi\":\"10.1109/IC-AIAI48757.2019.00013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper analyzes data sets containing images with labeled traffic signs, as well as modern approaches for their detection and classification on images of urban scenes. Particular attention is paid to the recognition of Russian types of traffic signs. Various modern architectures of deep neural networks for the simultaneous object detection and classification were studied, including Faster R-CNN, Mask R-CNN, Cascade R-CNN, RetinaNet. To increase the efficiency of neural network recognition of objects in a video sequence, the Seq-BBox Matching algorithm is used. Training and testing of the proposed approach was carried out on Russian Traffic Sign Dataset and IceVision Dataset containing over 150 types of road signs and more than 65,000 marked images. For all the approaches considered, quality metrics are defined: mean average precision mAP, mean average recall mAR and processing time of one frame. The highest quality performance was demonstrated by the architecture of Faster R-CNN with Seq-BBox Matching, while the highest performance is provided by the architecture of RetinaNet. Implementation was carried out using the Python 3.7 programming language and PyTorch deep learning library using NVidia CUDA technology. Performance indicators were obtained on the workstation with the NVidia Tesla V-100 32GB video card. The obtained results demonstrate the possibility of applying the proposed approach both for the resource-intensive procedure for automated labeling of road scene images for new data sets preparation, and for traffic sign recognition in on-board computer vision systems of unmanned vehicles.\",\"PeriodicalId\":374193,\"journal\":{\"name\":\"2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI)\",\"volume\":\"150 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-AIAI48757.2019.00013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-AIAI48757.2019.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

本文分析了包含有标记的交通标志图像的数据集，以及城市场景图像中交通标志检测和分类的现代方法。特别注意的是俄罗斯类型的交通标志的识别。研究了用于同时目标检测和分类的各种现代深度神经网络架构，包括Faster R-CNN、Mask R-CNN、Cascade R-CNN、RetinaNet。为了提高神经网络对视频序列中目标的识别效率，采用了Seq-BBox匹配算法。该方法在俄罗斯交通标志数据集和IceVision数据集上进行了训练和测试，其中包含150多种道路标志和超过65,000个标记图像。对于所考虑的所有方法，定义了质量度量:平均平均精度mAP，平均平均召回率mAR和一帧的处理时间。采用Seq-BBox匹配的Faster R-CNN架构表现出最高的质量性能，而采用RetinaNet架构表现出最高的性能。使用Python 3.7编程语言和使用NVidia CUDA技术的PyTorch深度学习库进行实现。在使用NVidia Tesla V-100 32GB显卡的工作站上获得性能指标。所获得的结果表明，所提出的方法既可以应用于资源密集型的道路场景图像自动标记过程，也可以应用于无人驾驶车辆车载计算机视觉系统中的交通标志识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Traffic Sign Recognition on Video Sequence using Deep Neural Networks and Matching Algorithm

The paper analyzes data sets containing images with labeled traffic signs, as well as modern approaches for their detection and classification on images of urban scenes. Particular attention is paid to the recognition of Russian types of traffic signs. Various modern architectures of deep neural networks for the simultaneous object detection and classification were studied, including Faster R-CNN, Mask R-CNN, Cascade R-CNN, RetinaNet. To increase the efficiency of neural network recognition of objects in a video sequence, the Seq-BBox Matching algorithm is used. Training and testing of the proposed approach was carried out on Russian Traffic Sign Dataset and IceVision Dataset containing over 150 types of road signs and more than 65,000 marked images. For all the approaches considered, quality metrics are defined: mean average precision mAP, mean average recall mAR and processing time of one frame. The highest quality performance was demonstrated by the architecture of Faster R-CNN with Seq-BBox Matching, while the highest performance is provided by the architecture of RetinaNet. Implementation was carried out using the Python 3.7 programming language and PyTorch deep learning library using NVidia CUDA technology. Performance indicators were obtained on the workstation with the NVidia Tesla V-100 32GB video card. The obtained results demonstrate the possibility of applying the proposed approach both for the resource-intensive procedure for automated labeling of road scene images for new data sets preparation, and for traffic sign recognition in on-board computer vision systems of unmanned vehicles.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 International Conference on Artificial Intelligence: Applications and Innovations (IC-AIAI)

自引率

0.00%

发文量