{"title":"使用视觉变压器识别交通标志","authors":"Haolan Wang","doi":"10.1145/3546157.3546166","DOIUrl":null,"url":null,"abstract":"Traffic sign recognition is an integral part of future autonomous driving systems. Deep learning has been applied in this task, while the performance of the recent vision Transformers is unexplored. In this study, eight different vision Transformers are validated in three real-world traffic sign datasets for the first time. The experimental results demonstrate that the best vision Transformer has a performance between the pre-trained DenseNet and the DenseNet trained from scratch. Besides, the best vision Transformers model has less training time compared to DenseNet.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Traffic Sign Recognition with Vision Transformers\",\"authors\":\"Haolan Wang\",\"doi\":\"10.1145/3546157.3546166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traffic sign recognition is an integral part of future autonomous driving systems. Deep learning has been applied in this task, while the performance of the recent vision Transformers is unexplored. In this study, eight different vision Transformers are validated in three real-world traffic sign datasets for the first time. The experimental results demonstrate that the best vision Transformer has a performance between the pre-trained DenseNet and the DenseNet trained from scratch. Besides, the best vision Transformers model has less training time compared to DenseNet.\",\"PeriodicalId\":422215,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Information System and Data Mining\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Information System and Data Mining\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3546157.3546166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Information System and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3546157.3546166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Traffic sign recognition is an integral part of future autonomous driving systems. Deep learning has been applied in this task, while the performance of the recent vision Transformers is unexplored. In this study, eight different vision Transformers are validated in three real-world traffic sign datasets for the first time. The experimental results demonstrate that the best vision Transformer has a performance between the pre-trained DenseNet and the DenseNet trained from scratch. Besides, the best vision Transformers model has less training time compared to DenseNet.