{"title":"交通标志识别的两阶段方法研究:少对多还是多对多?","authors":"Meng-Huan Hsieh, Qiang Zhao","doi":"10.1109/ICMLC51923.2020.9469574","DOIUrl":null,"url":null,"abstract":"Needless to say, traffic sign recognition (TSR) is important for safety driving. A TSR system can make the driver more aware of the road situation and condition, and thus can reduce traffic accidents. A TSR system contains mainly two parts, one for detection and another for classification. Recently, deep learners such as You Only Look One (YOLO) and Single Shot Multi-Box Detector (SSD) have been proposed for implementing these two parts together. However, since there are many different traffic signs, training a good model is usually time consuming. In this study, we investigate the efficiency/efficacy of two different two-stage approaches for TSR. The first approach is a few-to-many approach, in which YOLO-v3 is used to detect traffic signs based on their shapes and VGG-16 is used to classify the signs into detailed classes. The second one is a many-to-many approach, in which traffic signs are detected and classified by YOLO-v3, and VGG-16 is used to correct signs miss-classified by YOLO-v3. Experiment results show that, the average accuracy of the many-to-many approach is 93.98% and that of the few-to-many approach is 88.29% for the German Traffic Sign Detection Benchmark dataset. Compared with the YOLO-v3 alone approach, the many-to-many approach has a 23.08% gain but the few-to-many approach has only a 6.2% gain.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Study on Two-Stage Approach for Traffic Sign Recognition: Few-to-Many or Many-to-Many?\",\"authors\":\"Meng-Huan Hsieh, Qiang Zhao\",\"doi\":\"10.1109/ICMLC51923.2020.9469574\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Needless to say, traffic sign recognition (TSR) is important for safety driving. A TSR system can make the driver more aware of the road situation and condition, and thus can reduce traffic accidents. A TSR system contains mainly two parts, one for detection and another for classification. Recently, deep learners such as You Only Look One (YOLO) and Single Shot Multi-Box Detector (SSD) have been proposed for implementing these two parts together. However, since there are many different traffic signs, training a good model is usually time consuming. In this study, we investigate the efficiency/efficacy of two different two-stage approaches for TSR. The first approach is a few-to-many approach, in which YOLO-v3 is used to detect traffic signs based on their shapes and VGG-16 is used to classify the signs into detailed classes. The second one is a many-to-many approach, in which traffic signs are detected and classified by YOLO-v3, and VGG-16 is used to correct signs miss-classified by YOLO-v3. Experiment results show that, the average accuracy of the many-to-many approach is 93.98% and that of the few-to-many approach is 88.29% for the German Traffic Sign Detection Benchmark dataset. Compared with the YOLO-v3 alone approach, the many-to-many approach has a 23.08% gain but the few-to-many approach has only a 6.2% gain.\",\"PeriodicalId\":170815,\"journal\":{\"name\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC51923.2020.9469574\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC51923.2020.9469574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
毋庸置疑,交通标志识别(TSR)对于安全驾驶至关重要。TSR系统可以使驾驶员更加了解道路状况,从而减少交通事故。TSR系统主要包括检测和分类两部分。最近,人们提出了You Only Look One (YOLO)和Single Shot Multi-Box Detector (SSD)等深度学习器来共同实现这两个部分。然而,由于有许多不同的交通标志,训练一个好的模型通常是耗时的。在本研究中,我们研究了两种不同的两阶段TSR方法的效率/疗效。第一种方法是少对多方法,利用YOLO-v3对交通标志进行形状检测,利用VGG-16对交通标志进行详细分类。第二种是多对多方法,由YOLO-v3对交通标志进行检测和分类,使用VGG-16对YOLO-v3未分类的交通标志进行校正。实验结果表明,对于德国交通标志检测基准数据集,多对多方法的平均准确率为93.98%,少对多方法的平均准确率为88.29%。与单独的YOLO-v3方法相比,多对多方法有23.08%的增益,而少对多方法只有6.2%的增益。
A Study on Two-Stage Approach for Traffic Sign Recognition: Few-to-Many or Many-to-Many?
Needless to say, traffic sign recognition (TSR) is important for safety driving. A TSR system can make the driver more aware of the road situation and condition, and thus can reduce traffic accidents. A TSR system contains mainly two parts, one for detection and another for classification. Recently, deep learners such as You Only Look One (YOLO) and Single Shot Multi-Box Detector (SSD) have been proposed for implementing these two parts together. However, since there are many different traffic signs, training a good model is usually time consuming. In this study, we investigate the efficiency/efficacy of two different two-stage approaches for TSR. The first approach is a few-to-many approach, in which YOLO-v3 is used to detect traffic signs based on their shapes and VGG-16 is used to classify the signs into detailed classes. The second one is a many-to-many approach, in which traffic signs are detected and classified by YOLO-v3, and VGG-16 is used to correct signs miss-classified by YOLO-v3. Experiment results show that, the average accuracy of the many-to-many approach is 93.98% and that of the few-to-many approach is 88.29% for the German Traffic Sign Detection Benchmark dataset. Compared with the YOLO-v3 alone approach, the many-to-many approach has a 23.08% gain but the few-to-many approach has only a 6.2% gain.