基于增强特征通道卷积神经网络的人群计数

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI) Pub Date : 2019-11-01 DOI:10.1109/ICTAI.2019.00118

Yinlong Bian, Jiehong Shen, Xin Xiong, Ying Li, Wei-Ji He, Peng Li

{"title":"基于增强特征通道卷积神经网络的人群计数","authors":"Yinlong Bian, Jiehong Shen, Xin Xiong, Ying Li, Wei-Ji He, Peng Li","doi":"10.1109/ICTAI.2019.00118","DOIUrl":null,"url":null,"abstract":"Accurate crowd counting is important for interpreting and understanding crowd, which has great practical significance in video monitoring, public safety, urban planning, the construction of intelligent shopping malls and so on. For accurate counting, many excellent algorithms have been proposed, but there are still some challenges in terms of scale variation, occlusion, inaccurate counting in various backgrounds and so forth. In this paper, we propose a new model EFCCNN (Enhanced Feature Channel Convolutional Neural Network) to deal with these challenges. The proposed EFCCNN model has three main contributions. We propose a new convolutional neural network, which can be trained by end-to-end, and it performs better than other crowd counting networks. Additionally, we use SENet (Squeeze-and-Excitation Network) structure to change the channel weight, which can enhance the significant channel, and we use residual structure to transmit the channel weight to improve the counting precision. The SENet structure is helpful to solve the problem of scale variation and occlusion. The EFCCNN model is the first crowd counting model using channel weight information. Furthermore, a new loss function focusing on the structural information of images is proposed, which reduces the mean absolute error of crowd counting, effectively solves the problem of inaccurate counting in various backgrounds, such as crowd miscounting in the tree and brush background, and improves the quality of the crowd density map on SSIM (Structural Similarity Index Measure). Experiments on ShanghaiTech, Mall, UCF_CC_50 dataset show that EFCCNN have a lower mean absolute error of crowd counting and a higher quality density map.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Crowd Counting via Enhanced Feature Channel Convolutional Neural Network\",\"authors\":\"Yinlong Bian, Jiehong Shen, Xin Xiong, Ying Li, Wei-Ji He, Peng Li\",\"doi\":\"10.1109/ICTAI.2019.00118\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate crowd counting is important for interpreting and understanding crowd, which has great practical significance in video monitoring, public safety, urban planning, the construction of intelligent shopping malls and so on. For accurate counting, many excellent algorithms have been proposed, but there are still some challenges in terms of scale variation, occlusion, inaccurate counting in various backgrounds and so forth. In this paper, we propose a new model EFCCNN (Enhanced Feature Channel Convolutional Neural Network) to deal with these challenges. The proposed EFCCNN model has three main contributions. We propose a new convolutional neural network, which can be trained by end-to-end, and it performs better than other crowd counting networks. Additionally, we use SENet (Squeeze-and-Excitation Network) structure to change the channel weight, which can enhance the significant channel, and we use residual structure to transmit the channel weight to improve the counting precision. The SENet structure is helpful to solve the problem of scale variation and occlusion. The EFCCNN model is the first crowd counting model using channel weight information. Furthermore, a new loss function focusing on the structural information of images is proposed, which reduces the mean absolute error of crowd counting, effectively solves the problem of inaccurate counting in various backgrounds, such as crowd miscounting in the tree and brush background, and improves the quality of the crowd density map on SSIM (Structural Similarity Index Measure). Experiments on ShanghaiTech, Mall, UCF_CC_50 dataset show that EFCCNN have a lower mean absolute error of crowd counting and a higher quality density map.\",\"PeriodicalId\":346657,\"journal\":{\"name\":\"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTAI.2019.00118\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2019.00118","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

准确的人群统计对于解读和了解人群具有重要意义，在视频监控、公共安全、城市规划、智能商场建设等方面都具有重要的现实意义。为了精确计数，已经提出了许多优秀的算法，但在尺度变化、遮挡、不同背景下计数不准确等方面仍然存在一些挑战。在本文中，我们提出了一个新的模型EFCCNN (Enhanced Feature Channel Convolutional Neural Network，增强特征通道卷积神经网络)来应对这些挑战。提出的EFCCNN模型有三个主要贡献。我们提出了一种新的卷积神经网络，它可以端到端训练，并且性能优于其他人群计数网络。采用SENet (Squeeze-and-Excitation Network)结构改变信道权值，增强有效信道;采用残差结构传递信道权值，提高计数精度。SENet结构有助于解决尺度变化和遮挡问题。EFCCNN模型是第一个使用信道权重信息的人群计数模型。在此基础上，提出了一种新的以图像结构信息为中心的损失函数，降低了人群计数的平均绝对误差，有效解决了树形背景和刷形背景下人群计数不准确的问题，提高了基于SSIM (structural Similarity Index Measure)的人群密度图的质量。在上海科技、Mall、UCF_CC_50数据集上的实验表明，EFCCNN具有较低的人群统计平均绝对误差和较高质量的密度图。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Crowd Counting via Enhanced Feature Channel Convolutional Neural Network

Accurate crowd counting is important for interpreting and understanding crowd, which has great practical significance in video monitoring, public safety, urban planning, the construction of intelligent shopping malls and so on. For accurate counting, many excellent algorithms have been proposed, but there are still some challenges in terms of scale variation, occlusion, inaccurate counting in various backgrounds and so forth. In this paper, we propose a new model EFCCNN (Enhanced Feature Channel Convolutional Neural Network) to deal with these challenges. The proposed EFCCNN model has three main contributions. We propose a new convolutional neural network, which can be trained by end-to-end, and it performs better than other crowd counting networks. Additionally, we use SENet (Squeeze-and-Excitation Network) structure to change the channel weight, which can enhance the significant channel, and we use residual structure to transmit the channel weight to improve the counting precision. The SENet structure is helpful to solve the problem of scale variation and occlusion. The EFCCNN model is the first crowd counting model using channel weight information. Furthermore, a new loss function focusing on the structural information of images is proposed, which reduces the mean absolute error of crowd counting, effectively solves the problem of inaccurate counting in various backgrounds, such as crowd miscounting in the tree and brush background, and improves the quality of the crowd density map on SSIM (Structural Similarity Index Measure). Experiments on ShanghaiTech, Mall, UCF_CC_50 dataset show that EFCCNN have a lower mean absolute error of crowd counting and a higher quality density map.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

自引率

0.00%

发文量