基于注意机制的小型目标车辆行为识别网络

Proceedings of the 4th World Symposium on Software Engineering Pub Date : 2022-09-28 DOI:10.1145/3568364.3568365

Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu

{"title":"基于注意机制的小型目标车辆行为识别网络","authors":"Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu","doi":"10.1145/3568364.3568365","DOIUrl":null,"url":null,"abstract":"Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.","PeriodicalId":262799,"journal":{"name":"Proceedings of the 4th World Symposium on Software Engineering","volume":"14 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Behavior Recognition Network for Small Target Vehicle based on Attention Mechanism\",\"authors\":\"Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu\",\"doi\":\"10.1145/3568364.3568365\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.\",\"PeriodicalId\":262799,\"journal\":{\"name\":\"Proceedings of the 4th World Symposium on Software Engineering\",\"volume\":\"14 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th World Symposium on Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3568364.3568365\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th World Symposium on Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3568364.3568365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

车辆行为识别是智能交通领域的重要研究方向之一。在交通监控视频领域，目标车辆在视频帧中所占的比例很小，使得网络很难提取出目标车辆的关键特征。为了解决这一问题，本文构建了用于车辆行为识别的视频数据集vehicle- 3c，并结合深度学习理论。在此基础上，提出了一种基于双流注意网络(TSAN)理论的车辆行为识别方法。该方法以两流卷积网络为基本框架，嵌入注意单元，提取交通监控视频中目标车辆的时间(运动)特征和空间特征，然后融合时空特征进行分类判断。结果表明，TSAN在vehicle-3C数据集上的识别准确率达到81.8%，优于其他基于深度学习的行为识别方法。此外，TSAN在UCF-101数据集上也达到了77.2%的识别准确率，验证了网络的泛化性能。实验结果表明，TSAN能够准确提取并有效融合视频中前景目标车辆的时空特征，在车辆行为识别任务中具有较高的识别精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Behavior Recognition Network for Small Target Vehicle based on Attention Mechanism

Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 4th World Symposium on Software Engineering

自引率

0.00%

发文量