{"title":"Real-time Head Detection for Automated Passenger Counting in Embedded Systems","authors":"Hyunduk Kim, Sang-Heon Lee, Myoung-Kyu Sohn","doi":"10.1145/3386164.3389086","DOIUrl":null,"url":null,"abstract":"Head detection is a key problem for automated passenger counting systems. In recent decades, considerable effort has been expended to develop an accurate and reliable head detector. However, head detection is still a challenging task because of problems caused by variations in pose and occlusions. Recently, general object detection algorithms based on convolutional neural networks (CNNs), such as Faster R-CNN, SSD and YOLO, have been successful. However, these algorithms require the use of a Graphics Processing Unit (GPU) for real-time performance. In this study, we focused on developing real-time head detection in an embedded system. Starting with the Tiny-YOLOv3 network, we applied the following strategies to achieve real-time performance in a non-GPU environment. First, we reduced the input image size to 224x224. Second, we added an extra yolo layer to detect smaller heads. Third, we removed batch normalization. Finally, we conducted depthwise separable convolution rather than traditional convolution. Three public datasets, HollywoodHeads, SCUT_HEAD, and CrowdHuman, were exploited to train and test the proposed network, and Average Precision (AP) at Intersection over Unit (IoU) = 0.5 were used to evaluate the tests. Experimental results showed that the proposed network perform better and faster than Tiny-YOLOv3.","PeriodicalId":231209,"journal":{"name":"Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control","volume":"479 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3386164.3389086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Head detection is a key problem for automated passenger counting systems. In recent decades, considerable effort has been expended to develop an accurate and reliable head detector. However, head detection is still a challenging task because of problems caused by variations in pose and occlusions. Recently, general object detection algorithms based on convolutional neural networks (CNNs), such as Faster R-CNN, SSD and YOLO, have been successful. However, these algorithms require the use of a Graphics Processing Unit (GPU) for real-time performance. In this study, we focused on developing real-time head detection in an embedded system. Starting with the Tiny-YOLOv3 network, we applied the following strategies to achieve real-time performance in a non-GPU environment. First, we reduced the input image size to 224x224. Second, we added an extra yolo layer to detect smaller heads. Third, we removed batch normalization. Finally, we conducted depthwise separable convolution rather than traditional convolution. Three public datasets, HollywoodHeads, SCUT_HEAD, and CrowdHuman, were exploited to train and test the proposed network, and Average Precision (AP) at Intersection over Unit (IoU) = 0.5 were used to evaluate the tests. Experimental results showed that the proposed network perform better and faster than Tiny-YOLOv3.