{"title":"基于特征融合和注意机制的人群计数方法","authors":"Jiaming Niu, Guobin Li, Yu Yang","doi":"10.1109/AIID51893.2021.9456541","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of background noise interference and occlusion in complex crowded crowd scenes, a crowd counting network FANet based on feature fusion and attention mechanism is proposed. By introducing a feature fusion layer and a crowd region recognition module, FANet can effectively eliminate the influence of background interference and occlusion, thereby improving counting performance. As a supplement to the feature extraction network, the feature fusion layer aims to fuse low-level texture features and high-level features to avoid a large amount of loss of features, thereby enabling the model to have higher multi-scale information perception capabilities and improving training efficiency. The crowd region recognition module generates a corresponding attention weight map for the image through convolution and up-sampling operations, and based on this, achieves the purpose of suppressing background interference. Finally, the evaluation was conducted on two data sets. The experiment showed that the MAE of the proposed method on ShanghaiTech and UCF-QNRF achieved 1.1%,3% and 1.1% improvement respectively.","PeriodicalId":412698,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Crowd counting method based on feature fusion and attention mechanism\",\"authors\":\"Jiaming Niu, Guobin Li, Yu Yang\",\"doi\":\"10.1109/AIID51893.2021.9456541\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem of background noise interference and occlusion in complex crowded crowd scenes, a crowd counting network FANet based on feature fusion and attention mechanism is proposed. By introducing a feature fusion layer and a crowd region recognition module, FANet can effectively eliminate the influence of background interference and occlusion, thereby improving counting performance. As a supplement to the feature extraction network, the feature fusion layer aims to fuse low-level texture features and high-level features to avoid a large amount of loss of features, thereby enabling the model to have higher multi-scale information perception capabilities and improving training efficiency. The crowd region recognition module generates a corresponding attention weight map for the image through convolution and up-sampling operations, and based on this, achieves the purpose of suppressing background interference. Finally, the evaluation was conducted on two data sets. The experiment showed that the MAE of the proposed method on ShanghaiTech and UCF-QNRF achieved 1.1%,3% and 1.1% improvement respectively.\",\"PeriodicalId\":412698,\"journal\":{\"name\":\"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIID51893.2021.9456541\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIID51893.2021.9456541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Crowd counting method based on feature fusion and attention mechanism
Aiming at the problem of background noise interference and occlusion in complex crowded crowd scenes, a crowd counting network FANet based on feature fusion and attention mechanism is proposed. By introducing a feature fusion layer and a crowd region recognition module, FANet can effectively eliminate the influence of background interference and occlusion, thereby improving counting performance. As a supplement to the feature extraction network, the feature fusion layer aims to fuse low-level texture features and high-level features to avoid a large amount of loss of features, thereby enabling the model to have higher multi-scale information perception capabilities and improving training efficiency. The crowd region recognition module generates a corresponding attention weight map for the image through convolution and up-sampling operations, and based on this, achieves the purpose of suppressing background interference. Finally, the evaluation was conducted on two data sets. The experiment showed that the MAE of the proposed method on ShanghaiTech and UCF-QNRF achieved 1.1%,3% and 1.1% improvement respectively.