基于双注意助推器的规模增强网络的人群计数

Xin Zeng, Shizhe Hu, Qiang Guo, Yunpeng Wu, Yangdong Ye
{"title":"基于双注意助推器的规模增强网络的人群计数","authors":"Xin Zeng, Shizhe Hu, Qiang Guo, Yunpeng Wu, Yangdong Ye","doi":"10.1145/3579654.3579655","DOIUrl":null,"url":null,"abstract":"Crowd counting has been a fundamental yet challenging problem in pattern recognition. Most recent deep models for crowd counting rely on Convolutional Neural Networks (CNNs). Although CNN visual features comprise the spatial and channel features, existing deep models on crowd counting have limited descriptive ability as they only focus on the spatial or channel information. In this paper, we propose Scale Enhanced Network with Dual Attention Booster named as SEN-DAB, a novel method to jointly learn the representations of spatial and channel information for crowd counting. Moreover, to further leverage the multi-scale information, a pyramid residual scale enhanced block is presented to process the multi-scale features. As a result, the learned spatial, channel and multi-scale features can be robust to appearance changes of the crowd. Our model is tested on three benchmarks and the experimental results confirm that the promising performance of SEN-DAB when compared with various networks.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Crowd Counting Using Scale Enhanced Network with Dual Attention Booster\",\"authors\":\"Xin Zeng, Shizhe Hu, Qiang Guo, Yunpeng Wu, Yangdong Ye\",\"doi\":\"10.1145/3579654.3579655\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crowd counting has been a fundamental yet challenging problem in pattern recognition. Most recent deep models for crowd counting rely on Convolutional Neural Networks (CNNs). Although CNN visual features comprise the spatial and channel features, existing deep models on crowd counting have limited descriptive ability as they only focus on the spatial or channel information. In this paper, we propose Scale Enhanced Network with Dual Attention Booster named as SEN-DAB, a novel method to jointly learn the representations of spatial and channel information for crowd counting. Moreover, to further leverage the multi-scale information, a pyramid residual scale enhanced block is presented to process the multi-scale features. As a result, the learned spatial, channel and multi-scale features can be robust to appearance changes of the crowd. Our model is tested on three benchmarks and the experimental results confirm that the promising performance of SEN-DAB when compared with various networks.\",\"PeriodicalId\":146783,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence\",\"volume\":\"117 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3579654.3579655\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579655","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

人群计数是模式识别中的一个基本问题,也是一个具有挑战性的问题。最新的人群计数深度模型依赖于卷积神经网络(cnn)。虽然CNN的视觉特征包括空间特征和通道特征,但现有的人群计数深度模型只关注空间或通道信息,描述能力有限。本文提出了一种具有双注意助推器的尺度增强网络,即SEN-DAB,这是一种用于人群计数的空间和通道信息表示联合学习的新方法。此外,为了进一步利用多尺度信息,提出了金字塔残差尺度增强块对多尺度特征进行处理。因此,学习到的空间、通道和多尺度特征对人群的外观变化具有鲁棒性。我们的模型在三个基准上进行了测试,实验结果证实了SEN-DAB与各种网络相比具有良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Crowd Counting Using Scale Enhanced Network with Dual Attention Booster
Crowd counting has been a fundamental yet challenging problem in pattern recognition. Most recent deep models for crowd counting rely on Convolutional Neural Networks (CNNs). Although CNN visual features comprise the spatial and channel features, existing deep models on crowd counting have limited descriptive ability as they only focus on the spatial or channel information. In this paper, we propose Scale Enhanced Network with Dual Attention Booster named as SEN-DAB, a novel method to jointly learn the representations of spatial and channel information for crowd counting. Moreover, to further leverage the multi-scale information, a pyramid residual scale enhanced block is presented to process the multi-scale features. As a result, the learned spatial, channel and multi-scale features can be robust to appearance changes of the crowd. Our model is tested on three benchmarks and the experimental results confirm that the promising performance of SEN-DAB when compared with various networks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信