{"title":"Crowd Counting with Spatial Normalization Network","authors":"Pengcheng Xia, Dapeng Zhang","doi":"10.1109/CASE48305.2020.9216769","DOIUrl":null,"url":null,"abstract":"Crowd counting, which requires to estimate crowd density from an image, is still a challenging task in computer vision. Most of the current methods are focused on large scale variation of people and ignore the huge distribution difference of crowd. To tackle these two problems together, we propose a novel framework named Spatial Normalization Network (SNNet). We normalize multi-scale features from parallel subnetworks to a particular scale and then fuse them to acquire rich spatial information for final accurate density map predictions. Furthermore, we propose a novel normalization layer called Spatial Group Normalization (SGN), which firstly split feature maps along the spatial dimension and then perform group-wise normalization. It’s useful to solve statistic shift problems caused by the great difference of distribution in crowd counting. Moreover, SGN can be naturally plugged into existing solutions and brings significant improvement in crowd counting. Our proposed SNNet achieves state-of-the-art performance on four challenging crowd counting datasets (ShanghaiTech, UCFQNRF, GCC and TRANCOS datasets), which demonstrates the effectiveness and robust feature learning capability of our methods.","PeriodicalId":212181,"journal":{"name":"2020 IEEE 16th International Conference on Automation Science and Engineering (CASE)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 16th International Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CASE48305.2020.9216769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Crowd counting, which requires to estimate crowd density from an image, is still a challenging task in computer vision. Most of the current methods are focused on large scale variation of people and ignore the huge distribution difference of crowd. To tackle these two problems together, we propose a novel framework named Spatial Normalization Network (SNNet). We normalize multi-scale features from parallel subnetworks to a particular scale and then fuse them to acquire rich spatial information for final accurate density map predictions. Furthermore, we propose a novel normalization layer called Spatial Group Normalization (SGN), which firstly split feature maps along the spatial dimension and then perform group-wise normalization. It’s useful to solve statistic shift problems caused by the great difference of distribution in crowd counting. Moreover, SGN can be naturally plugged into existing solutions and brings significant improvement in crowd counting. Our proposed SNNet achieves state-of-the-art performance on four challenging crowd counting datasets (ShanghaiTech, UCFQNRF, GCC and TRANCOS datasets), which demonstrates the effectiveness and robust feature learning capability of our methods.