{"title":"Bag of Groups of Convolutional Features Model for Visual Object Recognition","authors":"Jaspreet Singh, Chandan Singh","doi":"10.1109/mlsp52302.2021.9596432","DOIUrl":null,"url":null,"abstract":"Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlsp52302.2021.9596432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.