基于多头自注意视觉变换器模型的高效道路交通视频拥堵分类

IF 1.1 Q3 TRANSPORTATION SCIENCE & TECHNOLOGY
Sofiane Abdelkrim Khalladi, Asmâa Ouessai, Nadir Kamel Benamara, M. Keche
{"title":"基于多头自注意视觉变换器模型的高效道路交通视频拥堵分类","authors":"Sofiane Abdelkrim Khalladi, Asmâa Ouessai, Nadir Kamel Benamara, M. Keche","doi":"10.2478/ttj-2024-0003","DOIUrl":null,"url":null,"abstract":"\n Due to rapid population growth, traffic congestion has become one of the major issues in urban areas. The utilization of technology may help to address this issue. This paper proposes a new Multi-head Self-attention Vision Transformer (MSViT) based macroscopic approach, for road traffic congestion classification. To evaluate this approach, we use the UCSD (University of California San Diego) dataset that includes different weather conditions (clear, overcast and rainy) and different traffic scenarios (light, medium and heavy). The classification accuracy reached a high level of 99.76% with this dataset and 99.37% when night-mode frames are added to it. The proposed MSViT based method outperforms the state-of-the-art macroscopic and microscopic methods that have been evaluated using the same UCSD dataset, which makes it an efficient solution for traffic congestion prediction.","PeriodicalId":44110,"journal":{"name":"Transport and Telecommunication Journal","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Road Traffic Video Congestion Classification Based on the Multi-Head Self-Attention Vision Transformer Model\",\"authors\":\"Sofiane Abdelkrim Khalladi, Asmâa Ouessai, Nadir Kamel Benamara, M. Keche\",\"doi\":\"10.2478/ttj-2024-0003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Due to rapid population growth, traffic congestion has become one of the major issues in urban areas. The utilization of technology may help to address this issue. This paper proposes a new Multi-head Self-attention Vision Transformer (MSViT) based macroscopic approach, for road traffic congestion classification. To evaluate this approach, we use the UCSD (University of California San Diego) dataset that includes different weather conditions (clear, overcast and rainy) and different traffic scenarios (light, medium and heavy). The classification accuracy reached a high level of 99.76% with this dataset and 99.37% when night-mode frames are added to it. The proposed MSViT based method outperforms the state-of-the-art macroscopic and microscopic methods that have been evaluated using the same UCSD dataset, which makes it an efficient solution for traffic congestion prediction.\",\"PeriodicalId\":44110,\"journal\":{\"name\":\"Transport and Telecommunication Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transport and Telecommunication Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2478/ttj-2024-0003\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transport and Telecommunication Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ttj-2024-0003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

由于人口的快速增长,交通拥堵已成为城市地区的主要问题之一。利用技术可能有助于解决这一问题。本文提出了一种新的基于多头自注意视觉变换器(MSViT)的宏观方法,用于道路交通拥堵分类。为了评估这种方法,我们使用了 UCSD(加州大学圣地亚哥分校)数据集,其中包括不同的天气条件(晴天、阴天和雨天)和不同的交通场景(轻度、中度和重度)。该数据集的分类准确率高达 99.76%,如果加入夜间模式帧,分类准确率将达到 99.37%。所提出的基于 MSViT 的方法优于使用同一 UCSD 数据集进行评估的最先进的宏观和微观方法,这使其成为交通拥堵预测的有效解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficient Road Traffic Video Congestion Classification Based on the Multi-Head Self-Attention Vision Transformer Model
Due to rapid population growth, traffic congestion has become one of the major issues in urban areas. The utilization of technology may help to address this issue. This paper proposes a new Multi-head Self-attention Vision Transformer (MSViT) based macroscopic approach, for road traffic congestion classification. To evaluate this approach, we use the UCSD (University of California San Diego) dataset that includes different weather conditions (clear, overcast and rainy) and different traffic scenarios (light, medium and heavy). The classification accuracy reached a high level of 99.76% with this dataset and 99.37% when night-mode frames are added to it. The proposed MSViT based method outperforms the state-of-the-art macroscopic and microscopic methods that have been evaluated using the same UCSD dataset, which makes it an efficient solution for traffic congestion prediction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Transport and Telecommunication Journal
Transport and Telecommunication Journal TRANSPORTATION SCIENCE & TECHNOLOGY-
CiteScore
3.00
自引率
0.00%
发文量
21
审稿时长
35 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信