{"title":"Crowd counting method based on feature fusion and attention mechanism","authors":"Jiaming Niu, Guobin Li, Yu Yang","doi":"10.1109/AIID51893.2021.9456541","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of background noise interference and occlusion in complex crowded crowd scenes, a crowd counting network FANet based on feature fusion and attention mechanism is proposed. By introducing a feature fusion layer and a crowd region recognition module, FANet can effectively eliminate the influence of background interference and occlusion, thereby improving counting performance. As a supplement to the feature extraction network, the feature fusion layer aims to fuse low-level texture features and high-level features to avoid a large amount of loss of features, thereby enabling the model to have higher multi-scale information perception capabilities and improving training efficiency. The crowd region recognition module generates a corresponding attention weight map for the image through convolution and up-sampling operations, and based on this, achieves the purpose of suppressing background interference. Finally, the evaluation was conducted on two data sets. The experiment showed that the MAE of the proposed method on ShanghaiTech and UCF-QNRF achieved 1.1%,3% and 1.1% improvement respectively.","PeriodicalId":412698,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIID51893.2021.9456541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Aiming at the problem of background noise interference and occlusion in complex crowded crowd scenes, a crowd counting network FANet based on feature fusion and attention mechanism is proposed. By introducing a feature fusion layer and a crowd region recognition module, FANet can effectively eliminate the influence of background interference and occlusion, thereby improving counting performance. As a supplement to the feature extraction network, the feature fusion layer aims to fuse low-level texture features and high-level features to avoid a large amount of loss of features, thereby enabling the model to have higher multi-scale information perception capabilities and improving training efficiency. The crowd region recognition module generates a corresponding attention weight map for the image through convolution and up-sampling operations, and based on this, achieves the purpose of suppressing background interference. Finally, the evaluation was conducted on two data sets. The experiment showed that the MAE of the proposed method on ShanghaiTech and UCF-QNRF achieved 1.1%,3% and 1.1% improvement respectively.