Hassam Tahir, Muhammad Shahbaz Khan, Muhammad Owais Tariq
{"title":"Performance Analysis and Comparison of Faster R-CNN, Mask R-CNN and ResNet50 for the Detection and Counting of Vehicles","authors":"Hassam Tahir, Muhammad Shahbaz Khan, Muhammad Owais Tariq","doi":"10.1109/ICCCIS51004.2021.9397079","DOIUrl":null,"url":null,"abstract":"Traffic congestion is one of the major issues of urban cities. The conventional techniques used usually to control traffic via different types of sensors are less precise and expensive. Intelligent solutions using deep learning algorithms provide promising results in terms of better performance, prompt decision making and cost effectiveness. This article aims at providing an easy, more accurate and less expensive solution for the traffic control issues specifically at the traffic signals. Three deep neural network (DNN) frameworks i.e. Faster R-CNN, Mask R-CNN and ResNet-50 have been implemented and compared for vehicle detection, classification and counting. A dataset of 3200 images of different vehicles is used for the training of the models. The training is carried out on NVIDIA 1060TI 3GB GPU. Trained system is tested on indigenous recorded video data of 8 hours for two routes at a traffic signal. Results demonstrated that the overall detection accuracy of Faster R-CNN and Mask R-CNN is >80%, whereas detection accuracy of ResNet-50 is >75%. The counting accuracies of Faster R-CNN, Mask R-CNN and ResNet-50 are >75%, >70% and >62% respectively. Various error analysis have been carried out to validate the performance of the aforementioned frameworks. Furthermore, a prototype has also been developed by interconnecting the DNN results with Arduino via Serial communication.","PeriodicalId":316752,"journal":{"name":"2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"227 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS51004.2021.9397079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Traffic congestion is one of the major issues of urban cities. The conventional techniques used usually to control traffic via different types of sensors are less precise and expensive. Intelligent solutions using deep learning algorithms provide promising results in terms of better performance, prompt decision making and cost effectiveness. This article aims at providing an easy, more accurate and less expensive solution for the traffic control issues specifically at the traffic signals. Three deep neural network (DNN) frameworks i.e. Faster R-CNN, Mask R-CNN and ResNet-50 have been implemented and compared for vehicle detection, classification and counting. A dataset of 3200 images of different vehicles is used for the training of the models. The training is carried out on NVIDIA 1060TI 3GB GPU. Trained system is tested on indigenous recorded video data of 8 hours for two routes at a traffic signal. Results demonstrated that the overall detection accuracy of Faster R-CNN and Mask R-CNN is >80%, whereas detection accuracy of ResNet-50 is >75%. The counting accuracies of Faster R-CNN, Mask R-CNN and ResNet-50 are >75%, >70% and >62% respectively. Various error analysis have been carried out to validate the performance of the aforementioned frameworks. Furthermore, a prototype has also been developed by interconnecting the DNN results with Arduino via Serial communication.