一种基于人滤的高性能人群计数新方法

P. Do, N. Ly
{"title":"一种基于人滤的高性能人群计数新方法","authors":"P. Do, N. Ly","doi":"10.1109/NICS51282.2020.9335850","DOIUrl":null,"url":null,"abstract":"One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A New High Performance Approach for Crowd Counting Using Human Filter\",\"authors\":\"P. Do, N. Ly\",\"doi\":\"10.1109/NICS51282.2020.9335850\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335850\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

人群监控系统的任务之一是估计人群中的人数,并在超过允许的阈值时发出警告。以前的方法通常使用多列CNN来估计密度图,从而估计计数。然而,从人群数据集中学习到的信息量非常小。另一方面,人与其他物体(如建筑物、树木、岩石等)之间的混淆(背景噪声)会影响密度图的估计。本文针对这两个问题,提出了一种基于人类滤波(Human Filter, CHF)的计数模型,该模型由两个模块组成:第一个模块是基于VGG-16模型的人群图像特征提取器,用于估计密度图;本模块将利用从ImageNet数据集学习到的特性。第二种是人类过滤器,用于加权密度图的每个像素。两个模块通过元素乘法组合在一起。我们使用MAE、MSE度量来评估模型的估计结果,并根据PSNR、SSIM来评估密度图的质量。实验表明,在上海科技、UCF_CC_50、UCF-QRNF数据集上,我们的方法对人数的估计优于以往的方法。考虑到模型的复杂性,我们的方法在两个模块之间共享参数,因此与之前的方法(如Switch-CNN, SSC, ADCrowdNet)相比,参数数量减少了一半。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A New High Performance Approach for Crowd Counting Using Human Filter
One of the tasks of the crowd monitoring system is to estimate the number of people in the crowd and issue a warning when it exceeds the allowed threshold. Previous approaches often used multi-column CNN to estimate density maps and thereby estimate the count. However, the amount of information learned from crowd datasets is very small. On the other hand, the confusion between people and other objects such as buildings, trees, rocks, etc (background noise) will affect the density map estimation. In this paper, we focus to solve these two problems and propose a model called Counting using Human Filter (CHF) which consists of two modules: The first one is a feature extractor from a crowd image based on the VGG-16 model to estimate the density map. This module will take advantage of the features learned from the ImageNet dataset. The second one is the human filter used to weight each pixel of the density map. Two modules are combined by element-wise multiplication. We evaluate the estimated results of the model with MAE, MSE metric and assess the quality of density maps according to PSNR, SSIM. Experiments show that our approach estimates the number of people better than the previous methods when evaluating on the ShanghaiTech, UCF_CC_50, UCF-QRNF datasets. Regarding the complexity of the model, our method shares parameters between two modules so it halved the number of parameters compared to previous methods such as Switch-CNN, SSC, ADCrowdNet.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信