{"title":"基于快速回归卷积神经网络的视觉人群计数","authors":"S. Teoh, Vooi Voon Yap, H. Nisar","doi":"10.1109/ICCOINS49721.2021.9497198","DOIUrl":null,"url":null,"abstract":"This paper presents an improved convolutional neural network (CNN) architecture for accurate visual crowd counting in crowd images. Comprehensive analysis on the performance and inference speed of the network model are also presented. Multi-column convolutional neural network (MCNN) for visual crowd counting through predicted density map is widely used in previous works, however this method has limitation in predicting a quality density map. Instead, the proposed network is constructed by using the powerful CNN layers, dense layers, and one regressor node with whole image-based inference. Therefore, it is less computationally intensive and inference speed can be increased. Experiments have been conducted on Mall dataset. Moreover, benchmarking on different CNN architectures have been conducted. The proposed network shows promising counting accuracy and reasonable inference speed against the existing state-of-art approaches.","PeriodicalId":245662,"journal":{"name":"2021 International Conference on Computer & Information Sciences (ICCOINS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fast Regression Convolutional Neural Network for Visual Crowd Counting\",\"authors\":\"S. Teoh, Vooi Voon Yap, H. Nisar\",\"doi\":\"10.1109/ICCOINS49721.2021.9497198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an improved convolutional neural network (CNN) architecture for accurate visual crowd counting in crowd images. Comprehensive analysis on the performance and inference speed of the network model are also presented. Multi-column convolutional neural network (MCNN) for visual crowd counting through predicted density map is widely used in previous works, however this method has limitation in predicting a quality density map. Instead, the proposed network is constructed by using the powerful CNN layers, dense layers, and one regressor node with whole image-based inference. Therefore, it is less computationally intensive and inference speed can be increased. Experiments have been conducted on Mall dataset. Moreover, benchmarking on different CNN architectures have been conducted. The proposed network shows promising counting accuracy and reasonable inference speed against the existing state-of-art approaches.\",\"PeriodicalId\":245662,\"journal\":{\"name\":\"2021 International Conference on Computer & Information Sciences (ICCOINS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computer & Information Sciences (ICCOINS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCOINS49721.2021.9497198\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer & Information Sciences (ICCOINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCOINS49721.2021.9497198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fast Regression Convolutional Neural Network for Visual Crowd Counting
This paper presents an improved convolutional neural network (CNN) architecture for accurate visual crowd counting in crowd images. Comprehensive analysis on the performance and inference speed of the network model are also presented. Multi-column convolutional neural network (MCNN) for visual crowd counting through predicted density map is widely used in previous works, however this method has limitation in predicting a quality density map. Instead, the proposed network is constructed by using the powerful CNN layers, dense layers, and one regressor node with whole image-based inference. Therefore, it is less computationally intensive and inference speed can be increased. Experiments have been conducted on Mall dataset. Moreover, benchmarking on different CNN architectures have been conducted. The proposed network shows promising counting accuracy and reasonable inference speed against the existing state-of-art approaches.