Xu Yang, J. Gaspar, W. Lou, W. Ke, C. Lam, Yapeng Wang
{"title":"Vision-Based Mobile People Counting System","authors":"Xu Yang, J. Gaspar, W. Lou, W. Ke, C. Lam, Yapeng Wang","doi":"10.1145/3340997.3340999","DOIUrl":null,"url":null,"abstract":"People detection and counting systems are highly valuable in multiple situations including managing emergency situations and efficiently allocating resources. However, most people counting systems are based on fixed sensors or fixed cameras, which lack flexibility and convenience. In this paper, we have developed a vision-based mobile people counting system which uses Android smartphones to capture images, and state-of-the-art person detectors, based on artificial intelligence, to count the number of people in a designated area. The embedded devices in smartphones such as camera, clock, GPS, are utilized to provide additional information for data collection. Several person detection frameworks such as You Only Look Once v2 (YOLO2), Aggregate Channel Features (ACF) and Multi-Task cascade Convolutional Neural Network (MTCNN) were evaluated to determine the best performing algorithm capable of offering accurate counting results across different scenarios. The experiments results show that YOLO2 outperforms ACF and MTCNN detection algorithms in different scenarios. However, YOLO2 has its own limitations as it often outputs redundant detections, requiring an additional Non-Maxima Suppression (NMS) algorithm to output a single bounding box per detection. The NMS threshold has to be carefully pre-fixed to provide top detection and counting performance across different scenarios.","PeriodicalId":409906,"journal":{"name":"Proceedings of the 2019 4th International Conference on Machine Learning Technologies","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 4th International Conference on Machine Learning Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3340997.3340999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
People detection and counting systems are highly valuable in multiple situations including managing emergency situations and efficiently allocating resources. However, most people counting systems are based on fixed sensors or fixed cameras, which lack flexibility and convenience. In this paper, we have developed a vision-based mobile people counting system which uses Android smartphones to capture images, and state-of-the-art person detectors, based on artificial intelligence, to count the number of people in a designated area. The embedded devices in smartphones such as camera, clock, GPS, are utilized to provide additional information for data collection. Several person detection frameworks such as You Only Look Once v2 (YOLO2), Aggregate Channel Features (ACF) and Multi-Task cascade Convolutional Neural Network (MTCNN) were evaluated to determine the best performing algorithm capable of offering accurate counting results across different scenarios. The experiments results show that YOLO2 outperforms ACF and MTCNN detection algorithms in different scenarios. However, YOLO2 has its own limitations as it often outputs redundant detections, requiring an additional Non-Maxima Suppression (NMS) algorithm to output a single bounding box per detection. The NMS threshold has to be carefully pre-fixed to provide top detection and counting performance across different scenarios.
人员检测和计数系统在管理紧急情况和有效分配资源等多种情况下都具有很高的价值。然而,大多数人口统计系统都是基于固定传感器或固定摄像头,缺乏灵活性和便利性。在本文中,我们开发了一个基于视觉的移动人数统计系统,该系统使用Android智能手机捕获图像,并基于人工智能开发了最先进的人员检测器,以统计指定区域的人数。智能手机中的嵌入式设备,如相机、时钟、GPS,被用来为数据收集提供额外的信息。我们评估了几种人物检测框架,如You Only Look Once v2 (YOLO2)、聚合通道特征(ACF)和多任务级联卷积神经网络(MTCNN),以确定能够在不同场景下提供准确计数结果的最佳算法。实验结果表明,YOLO2在不同场景下的检测性能优于ACF和MTCNN算法。然而,YOLO2有其自身的局限性,因为它经常输出冗余检测,需要额外的非最大抑制(NMS)算法来输出每个检测的单个边界框。必须仔细预先确定NMS阈值,以便在不同场景中提供最高检测和计数性能。