Improve the Scale Invariance of the Convolutional Network for Crowd Counting

2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE) Pub Date : 2021-01-15 DOI:10.1109/ICCECE51280.2021.9342331

Ryan Jin

引用次数: 0

Abstract

The main challenges of crowd counting are considerable variations in complex scenes/backgrounds. This paper first reveals that the Convolution Neural Networks (CNNs) are incapable of addressing these problems. To solve this problem, we propose a novel attention mechanism to improve the scale invariance of convolutional networks. Our method can not only automatically exploit spatial awareness to optimize the convolutional features but also imitate the human attention mechanism to remove the noise of the background. It is worth noting that it can easily plug-and-play into the vanilla convolution/pooling layer with relatively little computation cost. We have integrated our method into several state-of-the-art methods. Extensive experiments on five popular benchmarks demonstrate that our approach significantly outperforms other state-of-the-art methods and beats entire convolution/pooling layer in all cases.

查看原文本刊更多论文

改进卷积网络在人群计数中的尺度不变性

人群计数的主要挑战是复杂场景/背景中的大量变化。本文首先揭示了卷积神经网络(cnn)无法解决这些问题。为了解决这个问题，我们提出了一种新的注意机制来提高卷积网络的尺度不变性。该方法不仅可以自动利用空间感知来优化卷积特征，而且可以模仿人的注意机制来去除背景噪声。值得注意的是，它可以很容易地插入到普通的卷积/池化层中，计算成本相对较小。我们已经把我们的方法整合到几个最先进的方法中。在五个流行的基准测试上进行的大量实验表明，我们的方法明显优于其他最先进的方法，并且在所有情况下都胜过整个卷积/池化层。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)

自引率

0.00%

发文量