基于有限计算资源的非结构化车辆检测

Tarik Reza Toha, Masfiqur Rahaman, Saiful Islam Salim, M. Hossain, A. M. Sadri, A. B. M. A. Al Islam
{"title":"基于有限计算资源的非结构化车辆检测","authors":"Tarik Reza Toha, Masfiqur Rahaman, Saiful Islam Salim, M. Hossain, A. M. Sadri, A. B. M. A. Al Islam","doi":"10.1109/ICDM51629.2021.00172","DOIUrl":null,"url":null,"abstract":"Inefficient traffic signal control system is one of the most important causes of traffic congestion in the cities of developing countries such as Bangladesh, India, Kenya, etc. This can be mitigated by adopting a decentralized traffic-responsive signal system, where vehicle detection is performed on the road through different image-based deep learning architectures amenable to limited-resource embedded platforms as available in developing countries. Deep learning architectures currently available in this regard demand high computational resources to achieve higher inference speed and better accuracy. Besides, the few existing limited-resource deep learning architectural alternatives neither attain higher inference speed nor substantial accuracy due to not overcoming the inherent limitations. To this extent, in this study, we propose a novel limited-resource deep learning architecture, namely DhakaNet, for real-time vehicle detection in on-road (street-view) traffic images. Our proposed architecture leverages enhancing Cross-Stage Partial Network and Path Aggregation Network to build the backbone and head networks, respectively. Besides, we develop a novel multi-scale attention module to extract multi-scale meaningful features from the images, where the developed multi-scale attention module boosts the detection accuracy at the cost of small overhead. Rigorous experimental evaluation of our proposed DhakaNet over three benchmark street-view traffic datasets such as DhakaAI, IITM-HeTra-A, and IITM-HeTra-B shows up to 51% faster inference speed at a similar accuracy, or up to 13% higher accuracy at a similar inference speed compared to other state-of-the-art limited-resource deep learning architectural alternatives.","PeriodicalId":320970,"journal":{"name":"2021 IEEE International Conference on Data Mining (ICDM)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DhakaNet: Unstructured Vehicle Detection using Limited Computational Resources\",\"authors\":\"Tarik Reza Toha, Masfiqur Rahaman, Saiful Islam Salim, M. Hossain, A. M. Sadri, A. B. M. A. Al Islam\",\"doi\":\"10.1109/ICDM51629.2021.00172\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Inefficient traffic signal control system is one of the most important causes of traffic congestion in the cities of developing countries such as Bangladesh, India, Kenya, etc. This can be mitigated by adopting a decentralized traffic-responsive signal system, where vehicle detection is performed on the road through different image-based deep learning architectures amenable to limited-resource embedded platforms as available in developing countries. Deep learning architectures currently available in this regard demand high computational resources to achieve higher inference speed and better accuracy. Besides, the few existing limited-resource deep learning architectural alternatives neither attain higher inference speed nor substantial accuracy due to not overcoming the inherent limitations. To this extent, in this study, we propose a novel limited-resource deep learning architecture, namely DhakaNet, for real-time vehicle detection in on-road (street-view) traffic images. Our proposed architecture leverages enhancing Cross-Stage Partial Network and Path Aggregation Network to build the backbone and head networks, respectively. Besides, we develop a novel multi-scale attention module to extract multi-scale meaningful features from the images, where the developed multi-scale attention module boosts the detection accuracy at the cost of small overhead. Rigorous experimental evaluation of our proposed DhakaNet over three benchmark street-view traffic datasets such as DhakaAI, IITM-HeTra-A, and IITM-HeTra-B shows up to 51% faster inference speed at a similar accuracy, or up to 13% higher accuracy at a similar inference speed compared to other state-of-the-art limited-resource deep learning architectural alternatives.\",\"PeriodicalId\":320970,\"journal\":{\"name\":\"2021 IEEE International Conference on Data Mining (ICDM)\",\"volume\":\"97 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Data Mining (ICDM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDM51629.2021.00172\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM51629.2021.00172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

低效的交通信号控制系统是造成孟加拉国、印度、肯尼亚等发展中国家城市交通拥堵的重要原因之一。这可以通过采用分散的交通响应信号系统来缓解,该系统通过不同的基于图像的深度学习架构在道路上执行车辆检测,这些架构适用于发展中国家可用的资源有限的嵌入式平台。在这方面,目前可用的深度学习架构需要大量的计算资源来实现更高的推理速度和更好的准确性。此外,由于没有克服固有的局限性,现有的少数有限资源深度学习架构替代方案既没有获得更高的推理速度,也没有获得实质性的准确性。为此,在本研究中,我们提出了一种新的有限资源深度学习架构,即DhakaNet,用于道路(街景)交通图像中的实时车辆检测。我们提出的架构利用增强的跨阶段部分网络和路径聚合网络分别构建骨干网和头网。此外,我们开发了一种新的多尺度关注模块,从图像中提取多尺度有意义的特征,该多尺度关注模块以较小的开销提高了检测精度。对我们提出的DhakaNet在三个基准街景交通数据集(如DhakaAI, IITM-HeTra-A和IITM-HeTra-B)上进行的严格实验评估显示,与其他最先进的有限资源深度学习架构替代方案相比,在相似的精度下,推理速度提高了51%,或者在相似的推理速度下,准确率提高了13%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DhakaNet: Unstructured Vehicle Detection using Limited Computational Resources
Inefficient traffic signal control system is one of the most important causes of traffic congestion in the cities of developing countries such as Bangladesh, India, Kenya, etc. This can be mitigated by adopting a decentralized traffic-responsive signal system, where vehicle detection is performed on the road through different image-based deep learning architectures amenable to limited-resource embedded platforms as available in developing countries. Deep learning architectures currently available in this regard demand high computational resources to achieve higher inference speed and better accuracy. Besides, the few existing limited-resource deep learning architectural alternatives neither attain higher inference speed nor substantial accuracy due to not overcoming the inherent limitations. To this extent, in this study, we propose a novel limited-resource deep learning architecture, namely DhakaNet, for real-time vehicle detection in on-road (street-view) traffic images. Our proposed architecture leverages enhancing Cross-Stage Partial Network and Path Aggregation Network to build the backbone and head networks, respectively. Besides, we develop a novel multi-scale attention module to extract multi-scale meaningful features from the images, where the developed multi-scale attention module boosts the detection accuracy at the cost of small overhead. Rigorous experimental evaluation of our proposed DhakaNet over three benchmark street-view traffic datasets such as DhakaAI, IITM-HeTra-A, and IITM-HeTra-B shows up to 51% faster inference speed at a similar accuracy, or up to 13% higher accuracy at a similar inference speed compared to other state-of-the-art limited-resource deep learning architectural alternatives.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信