{"title":"一种基于人滤的高性能人群计数新方法","authors":"P. Do, N. Ly","doi":"10.1109/NICS51282.2020.9335850","DOIUrl":null,"url":null,"abstract":"One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A New High Performance Approach for Crowd Counting Using Human Filter\",\"authors\":\"P. Do, N. Ly\",\"doi\":\"10.1109/NICS51282.2020.9335850\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335850\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A New High Performance Approach for Crowd Counting Using Human Filter
One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.