Group equivariant learning for few-shot image classification

IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Meijuan Su, LeiLei Yan, Fanzhang Li
{"title":"Group equivariant learning for few-shot image classification","authors":"Meijuan Su,&nbsp;LeiLei Yan,&nbsp;Fanzhang Li","doi":"10.1007/s10489-025-06546-7","DOIUrl":null,"url":null,"abstract":"<div><p>Few-shot learning, as an effective approach to solve image classification problems in data-scarce scenarios, has made significant progress in recent years, with numerous methods emerging. These methods typically use convolutional neural networks (CNNs) as feature extractors and classify other data based on the features of a small number of labeled samples. The reason CNNs have become the preferred method for image processing tasks is primarily due to their translational equivariance. However, conventional CNNs lack inherent mechanisms to handle other symmetry transformations (such as rotation and reflection), resulting in reduced classification performance of the model, especially in few-shot scenarios. To address this problem, we leverage the advantages of group convolutions in handling broader symmetric transformations, integrating them into few-shot learning tasks, and accordingly propose a group-equivariant prototypical learning network. This method maps samples into the group space via a group convolution module, enhancing the model’s ability to handle various symmetry transformations present in classification targets within images, thereby improving its feature representation capability. Additionally, we designed a new contrastive loss that can naturally be co-optimized with cross-entropy loss, guiding the model to learn a highly discriminative group feature space. The experimental results on the miniImageNet, CIFAR-FS, and CUB-200 datasets show that the GEPL method significantly improves classification performance, thus verifying the effectiveness of our method.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 10","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06546-7","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Few-shot learning, as an effective approach to solve image classification problems in data-scarce scenarios, has made significant progress in recent years, with numerous methods emerging. These methods typically use convolutional neural networks (CNNs) as feature extractors and classify other data based on the features of a small number of labeled samples. The reason CNNs have become the preferred method for image processing tasks is primarily due to their translational equivariance. However, conventional CNNs lack inherent mechanisms to handle other symmetry transformations (such as rotation and reflection), resulting in reduced classification performance of the model, especially in few-shot scenarios. To address this problem, we leverage the advantages of group convolutions in handling broader symmetric transformations, integrating them into few-shot learning tasks, and accordingly propose a group-equivariant prototypical learning network. This method maps samples into the group space via a group convolution module, enhancing the model’s ability to handle various symmetry transformations present in classification targets within images, thereby improving its feature representation capability. Additionally, we designed a new contrastive loss that can naturally be co-optimized with cross-entropy loss, guiding the model to learn a highly discriminative group feature space. The experimental results on the miniImageNet, CIFAR-FS, and CUB-200 datasets show that the GEPL method significantly improves classification performance, thus verifying the effectiveness of our method.

基于组等变学习的少镜头图像分类
Few-shot学习作为一种解决数据稀缺场景下图像分类问题的有效方法,近年来取得了重大进展,出现了许多方法。这些方法通常使用卷积神经网络(cnn)作为特征提取器,并根据少量标记样本的特征对其他数据进行分类。cnn之所以成为图像处理任务的首选方法,主要是因为其平移等变性。然而,传统的cnn缺乏处理其他对称变换(如旋转和反射)的固有机制,导致模型的分类性能下降,特别是在少镜头场景下。为了解决这个问题,我们利用群卷积在处理更广泛的对称变换方面的优势,将它们集成到少量学习任务中,并相应地提出了一个群等变的原型学习网络。该方法通过群卷积模块将样本映射到群空间中,增强了模型处理图像中分类目标中存在的各种对称变换的能力,从而提高了模型的特征表示能力。此外,我们设计了一种新的对比损失,可以自然地与交叉熵损失协同优化,引导模型学习一个高度判别的群体特征空间。在miniImageNet、CIFAR-FS和CUB-200数据集上的实验结果表明,GEPL方法显著提高了分类性能,从而验证了方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Intelligence
Applied Intelligence 工程技术-计算机:人工智能
CiteScore
6.60
自引率
20.80%
发文量
1361
审稿时长
5.9 months
期刊介绍: With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信