{"title":"Road Traffic Classification from Nighttime Videos Using the Multihead Self-Attention Vision Transformer Model and the SVM","authors":"Sofiane Abdelkrim Khalladi, Asmâa Ouessai, Mokhtar Keche","doi":"10.3103/S0146411624700652","DOIUrl":null,"url":null,"abstract":"<p>Intelligent transport systems (ITSs) have emerged as a groundbreaking solution to address the challenges associated with road traffic, which are enhancing road utilization efficiency, providing convenient and safe transportation, and reducing energy consumption. ITS leverages advanced technologies to collect, store, and deliver real-time road traffic information, enabling intelligent decision-making and optimizing various aspects of transportation systems. As a contribution in this matter, we propose in this paper a novel efficient macroscopic approach, based on the multihead self-attention vision transformer (MSViT), for categorizing road traffic congestion, from nighttime videos, into three classes: light, medium, and heavy. To assess the performance of our approach, we conducted experiments using the nighttime UCSD (University of California San Diego) dataset, which includes various weather conditions (clear, overcast, and rainy) and traffic scenarios (light, medium, and heavy). The classification accuracy reached a high level of 94.24%. By incorporating a support vector machine (SVM) classifier into this method, we managed to enhance this accuracy to the outstanding level of 98.92%, thus outperforming the existing state-of-the-art methods that were evaluated using the same UCSD dataset, furthermore, the execution time was optimized.</p>","PeriodicalId":46238,"journal":{"name":"AUTOMATIC CONTROL AND COMPUTER SCIENCES","volume":"58 5","pages":"544 - 554"},"PeriodicalIF":0.6000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC CONTROL AND COMPUTER SCIENCES","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0146411624700652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Intelligent transport systems (ITSs) have emerged as a groundbreaking solution to address the challenges associated with road traffic, which are enhancing road utilization efficiency, providing convenient and safe transportation, and reducing energy consumption. ITS leverages advanced technologies to collect, store, and deliver real-time road traffic information, enabling intelligent decision-making and optimizing various aspects of transportation systems. As a contribution in this matter, we propose in this paper a novel efficient macroscopic approach, based on the multihead self-attention vision transformer (MSViT), for categorizing road traffic congestion, from nighttime videos, into three classes: light, medium, and heavy. To assess the performance of our approach, we conducted experiments using the nighttime UCSD (University of California San Diego) dataset, which includes various weather conditions (clear, overcast, and rainy) and traffic scenarios (light, medium, and heavy). The classification accuracy reached a high level of 94.24%. By incorporating a support vector machine (SVM) classifier into this method, we managed to enhance this accuracy to the outstanding level of 98.92%, thus outperforming the existing state-of-the-art methods that were evaluated using the same UCSD dataset, furthermore, the execution time was optimized.
期刊介绍:
Automatic Control and Computer Sciences is a peer reviewed journal that publishes articles on• Control systems, cyber-physical system, real-time systems, robotics, smart sensors, embedded intelligence • Network information technologies, information security, statistical methods of data processing, distributed artificial intelligence, complex systems modeling, knowledge representation, processing and management • Signal and image processing, machine learning, machine perception, computer vision