{"title":"扩大CNN人群统计","authors":"Deevesh Chaudhary, Devesh Kumar Srivastava, Akhilesh Kumar Sharma","doi":"10.1109/ComPE53109.2021.9752224","DOIUrl":null,"url":null,"abstract":"In the current scenario of the COVID-19 pandemic estimating the count of number of people present in public places at a particular time has become a significant task. Crowd count is attracting a lot of researchers from the computer vision and deep learning field. It has been found that to achieve this objective computer vision techniques such as deep learning, machine learning, etc. outperform traditional ways of estimating crowd count that uses handcrafted features such as Histogram of gradients, Haar, Scale Invariant Feature Transform and gives better results with higher accuracy. The paper studies the effect of dilation on convolution layers in estimating the crowd count. We have also done a comparative analysis of the developed model with different dilation rates on the ShanghaiTech dataset (part A and part B). The model is trained with images containing occluded and restricted visibility of heads. The model outputs the result with substantial accuracy in estimating the headcount in images of the dense crowd in a sensibly less amount of time.","PeriodicalId":211704,"journal":{"name":"2021 International Conference on Computational Performance Evaluation (ComPE)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dilated CNN for Crowd Count\",\"authors\":\"Deevesh Chaudhary, Devesh Kumar Srivastava, Akhilesh Kumar Sharma\",\"doi\":\"10.1109/ComPE53109.2021.9752224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current scenario of the COVID-19 pandemic estimating the count of number of people present in public places at a particular time has become a significant task. Crowd count is attracting a lot of researchers from the computer vision and deep learning field. It has been found that to achieve this objective computer vision techniques such as deep learning, machine learning, etc. outperform traditional ways of estimating crowd count that uses handcrafted features such as Histogram of gradients, Haar, Scale Invariant Feature Transform and gives better results with higher accuracy. The paper studies the effect of dilation on convolution layers in estimating the crowd count. We have also done a comparative analysis of the developed model with different dilation rates on the ShanghaiTech dataset (part A and part B). The model is trained with images containing occluded and restricted visibility of heads. The model outputs the result with substantial accuracy in estimating the headcount in images of the dense crowd in a sensibly less amount of time.\",\"PeriodicalId\":211704,\"journal\":{\"name\":\"2021 International Conference on Computational Performance Evaluation (ComPE)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computational Performance Evaluation (ComPE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ComPE53109.2021.9752224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Performance Evaluation (ComPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComPE53109.2021.9752224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在当前COVID-19大流行的情况下,估计特定时间公共场所的人数已成为一项重要任务。Crowd count吸引了大量来自计算机视觉和深度学习领域的研究人员。人们发现,为了实现这一目标,深度学习、机器学习等计算机视觉技术比传统的使用梯度直方图(Histogram of gradients)、Haar、尺度不变特征变换(Scale Invariant Feature Transform)等手工特征估计人群数量的方法要好,结果更好,精度更高。本文研究了膨胀对卷积层在估计人群数量中的影响。我们还在ShanghaiTech数据集(a部分和B部分)上对开发的模型进行了不同膨胀率的对比分析。该模型使用包含头部遮挡和受限可见性的图像进行训练。该模型在较短的时间内以相当高的精度在密集人群的图像中估计人数。
In the current scenario of the COVID-19 pandemic estimating the count of number of people present in public places at a particular time has become a significant task. Crowd count is attracting a lot of researchers from the computer vision and deep learning field. It has been found that to achieve this objective computer vision techniques such as deep learning, machine learning, etc. outperform traditional ways of estimating crowd count that uses handcrafted features such as Histogram of gradients, Haar, Scale Invariant Feature Transform and gives better results with higher accuracy. The paper studies the effect of dilation on convolution layers in estimating the crowd count. We have also done a comparative analysis of the developed model with different dilation rates on the ShanghaiTech dataset (part A and part B). The model is trained with images containing occluded and restricted visibility of heads. The model outputs the result with substantial accuracy in estimating the headcount in images of the dense crowd in a sensibly less amount of time.