Performance Analysis and Comparison of Faster R-CNN, Mask R-CNN and ResNet50 for the Detection and Counting of Vehicles

2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) Pub Date : 2021-02-19 DOI:10.1109/ICCCIS51004.2021.9397079

Hassam Tahir, Muhammad Shahbaz Khan, Muhammad Owais Tariq

{"title":"Performance Analysis and Comparison of Faster R-CNN, Mask R-CNN and ResNet50 for the Detection and Counting of Vehicles","authors":"Hassam Tahir, Muhammad Shahbaz Khan, Muhammad Owais Tariq","doi":"10.1109/ICCCIS51004.2021.9397079","DOIUrl":null,"url":null,"abstract":"Traffic congestion is one of the major issues of urban cities. The conventional techniques used usually to control traffic via different types of sensors are less precise and expensive. Intelligent solutions using deep learning algorithms provide promising results in terms of better performance, prompt decision making and cost effectiveness. This article aims at providing an easy, more accurate and less expensive solution for the traffic control issues specifically at the traffic signals. Three deep neural network (DNN) frameworks i.e. Faster R-CNN, Mask R-CNN and ResNet-50 have been implemented and compared for vehicle detection, classification and counting. A dataset of 3200 images of different vehicles is used for the training of the models. The training is carried out on NVIDIA 1060TI 3GB GPU. Trained system is tested on indigenous recorded video data of 8 hours for two routes at a traffic signal. Results demonstrated that the overall detection accuracy of Faster R-CNN and Mask R-CNN is >80%, whereas detection accuracy of ResNet-50 is >75%. The counting accuracies of Faster R-CNN, Mask R-CNN and ResNet-50 are >75%, >70% and >62% respectively. Various error analysis have been carried out to validate the performance of the aforementioned frameworks. Furthermore, a prototype has also been developed by interconnecting the DNN results with Arduino via Serial communication.","PeriodicalId":316752,"journal":{"name":"2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"227 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS51004.2021.9397079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

Traffic congestion is one of the major issues of urban cities. The conventional techniques used usually to control traffic via different types of sensors are less precise and expensive. Intelligent solutions using deep learning algorithms provide promising results in terms of better performance, prompt decision making and cost effectiveness. This article aims at providing an easy, more accurate and less expensive solution for the traffic control issues specifically at the traffic signals. Three deep neural network (DNN) frameworks i.e. Faster R-CNN, Mask R-CNN and ResNet-50 have been implemented and compared for vehicle detection, classification and counting. A dataset of 3200 images of different vehicles is used for the training of the models. The training is carried out on NVIDIA 1060TI 3GB GPU. Trained system is tested on indigenous recorded video data of 8 hours for two routes at a traffic signal. Results demonstrated that the overall detection accuracy of Faster R-CNN and Mask R-CNN is >80%, whereas detection accuracy of ResNet-50 is >75%. The counting accuracies of Faster R-CNN, Mask R-CNN and ResNet-50 are >75%, >70% and >62% respectively. Various error analysis have been carried out to validate the performance of the aforementioned frameworks. Furthermore, a prototype has also been developed by interconnecting the DNN results with Arduino via Serial communication.

查看原文本刊更多论文

Faster R-CNN、Mask R-CNN和ResNet50在车辆检测与计数中的性能分析与比较

交通拥堵是城市的主要问题之一。通常通过不同类型的传感器来控制交通的传统技术既不精确又昂贵。使用深度学习算法的智能解决方案在更好的性能、快速决策和成本效益方面提供了有希望的结果。本文旨在为交通控制问题提供一种简单、准确、经济的解决方案。本文对Faster R-CNN、Mask R-CNN和ResNet-50三种深度神经网络(DNN)框架进行了车辆检测、分类和计数的实现和比较。使用3200张不同车辆图像的数据集来训练模型。训练在NVIDIA 1060TI 3GB GPU上进行。经过训练的系统在当地录制的8小时视频数据上进行了测试，用于两条路线的交通信号。结果表明，Faster R-CNN和Mask R-CNN的整体检测准确率>80%，而ResNet-50的检测准确率>75%。Faster R-CNN、Mask R-CNN和ResNet-50的计数准确率分别为>75%、>70%和>62%。为了验证上述框架的性能，进行了各种误差分析。此外，还通过串行通信将DNN结果与Arduino互连，开发了一个原型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)

自引率

0.00%

发文量