Bag of Groups of Convolutional Features Model for Visual Object Recognition

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) Pub Date : 2021-10-25 DOI:10.1109/mlsp52302.2021.9596432

Jaspreet Singh, Chandan Singh

引用次数: 1

Abstract

Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.

查看原文本刊更多论文

用于视觉目标识别的卷积特征包模型

深度卷积神经网络(cnn)仅对翻译具有等变性。近年来，等变cnn被提出用于图像分类任务，它不仅对平移具有等变性，而且对其他仿射几何变换具有等变性。此外，cnn和等变cnn需要大量的标记训练数据来泛化其参数，这也限制了它们的应用领域。我们提出了一种针对cnn和群等变cnn (g - cnn)的bogcf模型[1]，该模型保留了g - cnn的等变基本性质，并通过将网络较深层的卷积特征映射分成组来生成全局不变特征。所提出的cnn和g - cnn模型，即CNN-BoGCFs和G-CNN-BoGCFs，在少量标记数据上进行图像分类训练时，表现出非常高的性能。使用旋转的MNIST、SIMPLIcity和Oxford flower 17数据集对所提出的方法进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)

自引率

0.00%

发文量