基于注意机制的小型目标车辆行为识别网络

Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu
{"title":"基于注意机制的小型目标车辆行为识别网络","authors":"Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu","doi":"10.1145/3568364.3568365","DOIUrl":null,"url":null,"abstract":"Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.","PeriodicalId":262799,"journal":{"name":"Proceedings of the 4th World Symposium on Software Engineering","volume":"14 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Behavior Recognition Network for Small Target Vehicle based on Attention Mechanism\",\"authors\":\"Zhuoyi Wu, Zhiqiang Ma, Caijilahu Bao, Leixiao Li, Xiaoxu Zhang, Fangyuan Zhu\",\"doi\":\"10.1145/3568364.3568365\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.\",\"PeriodicalId\":262799,\"journal\":{\"name\":\"Proceedings of the 4th World Symposium on Software Engineering\",\"volume\":\"14 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th World Symposium on Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3568364.3568365\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th World Symposium on Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3568364.3568365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

车辆行为识别是智能交通领域的重要研究方向之一。在交通监控视频领域,目标车辆在视频帧中所占的比例很小,使得网络很难提取出目标车辆的关键特征。为了解决这一问题,本文构建了用于车辆行为识别的视频数据集vehicle- 3c,并结合深度学习理论。在此基础上,提出了一种基于双流注意网络(TSAN)理论的车辆行为识别方法。该方法以两流卷积网络为基本框架,嵌入注意单元,提取交通监控视频中目标车辆的时间(运动)特征和空间特征,然后融合时空特征进行分类判断。结果表明,TSAN在vehicle-3C数据集上的识别准确率达到81.8%,优于其他基于深度学习的行为识别方法。此外,TSAN在UCF-101数据集上也达到了77.2%的识别准确率,验证了网络的泛化性能。实验结果表明,TSAN能够准确提取并有效融合视频中前景目标车辆的时空特征,在车辆行为识别任务中具有较高的识别精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Behavior Recognition Network for Small Target Vehicle based on Attention Mechanism
Vehicle behavior recognition is one of the most important research fields in intelligent transportation. In the field of traffic surveillance videos, target vehicles only occupy small proportion of the video frame,making it difficult for the network to extract key features of small target vehicles. To solve this problem, this paper constructed vehicle-3C, a video dataset for vehicle behavior recognition, and was combined with deep learning theory. As a result, it proposed a vehicle behavior recognition method based on the theory of Two-Stream with Attention Network (TSAN). For this method, the two-stream convolutional network is used as a basic framework and attention units are embedded in it to extract temporal (motion) features and spatial features of target vehicles in traffic surveillance videos, and then the temporal and spatial features are fused for category judgment. The results showed that TSAN could achieve 81.8% identification accuracy in vehicle-3C dataset, which was better than other behavior recognition methods based on deep learning. In addition, TSAN also achieves a recognition accuracy of 77.2% in dataset UCF-101, which verifies the generalization performance of the network. Experimental results showed that TSAN could accurately extract and effectively fuse the temporal and spatial features of the foreground target vehicle in the video and achieve high recognition accuracy in the vehicle behavior recognition task.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信