Mickael Cormier, Stefan Wolf, L. Sommer, Arne Schumann, J. Beyerer
{"title":"Fast Pedestrian Detection for Real-World Crowded Scenarios on Embedded GPU","authors":"Mickael Cormier, Stefan Wolf, L. Sommer, Arne Schumann, J. Beyerer","doi":"10.1109/EUROCON52738.2021.9535550","DOIUrl":null,"url":null,"abstract":"The behavior of individuals in crowds in public places has gained enormously in importance last year, for example through distancing requirements. However, automatically detecting pedestrians in real-world uncooperative scenarios remains a very challenging task. Especially crowded areas in surveillance footage are not only challenging for automatic vision systems, but also for human operators. Furthermore, complex detection models do not scale easily and are not traditionally designed for on-device processing in resource-constrained smart cameras, which become more and more popular due to technical and privacy issues at large events. In this work, we propose a new Fast Pedestrian Detector (FPD) based on RetinaNet which is a fast and efficient architecture for embedded platforms. The proposed FPD provides near real-time and real-time detection of hundreds of pedestrians on embedded platforms, outperforming popular YOLO-based approaches traditionally tuned for speed. Furthermore, by evaluating our approach on several different Jetson platforms in terms of speed and energy profiles, we highlight the challenges related to the deployment of a deep learning based pedestrian detector on embedded platforms for smart surveillance cameras.","PeriodicalId":328338,"journal":{"name":"IEEE EUROCON 2021 - 19th International Conference on Smart Technologies","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE EUROCON 2021 - 19th International Conference on Smart Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUROCON52738.2021.9535550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The behavior of individuals in crowds in public places has gained enormously in importance last year, for example through distancing requirements. However, automatically detecting pedestrians in real-world uncooperative scenarios remains a very challenging task. Especially crowded areas in surveillance footage are not only challenging for automatic vision systems, but also for human operators. Furthermore, complex detection models do not scale easily and are not traditionally designed for on-device processing in resource-constrained smart cameras, which become more and more popular due to technical and privacy issues at large events. In this work, we propose a new Fast Pedestrian Detector (FPD) based on RetinaNet which is a fast and efficient architecture for embedded platforms. The proposed FPD provides near real-time and real-time detection of hundreds of pedestrians on embedded platforms, outperforming popular YOLO-based approaches traditionally tuned for speed. Furthermore, by evaluating our approach on several different Jetson platforms in terms of speed and energy profiles, we highlight the challenges related to the deployment of a deep learning based pedestrian detector on embedded platforms for smart surveillance cameras.