Learning feature alignment and dual correlation for few-shot image classification

IF 8.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xilang Huang, Seon Han Choi
{"title":"Learning feature alignment and dual correlation for few-shot image classification","authors":"Xilang Huang,&nbsp;Seon Han Choi","doi":"10.1049/cit2.12273","DOIUrl":null,"url":null,"abstract":"<p>Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 2","pages":"303-318"},"PeriodicalIF":8.4000,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12273","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12273","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Few-shot image classification is the task of classifying novel classes using extremely limited labelled samples. To perform classification using the limited samples, one solution is to learn the feature alignment (FA) information between the labelled and unlabelled sample features. Most FA methods use the feature mean as the class prototype and calculate the correlation between prototype and unlabelled features to learn an alignment strategy. However, mean prototypes tend to degenerate informative features because spatial features at the same position may not be equally important for the final classification, leading to inaccurate correlation calculations. Therefore, the authors propose an effective intraclass FA strategy that aggregates semantically similar spatial features from an adaptive reference prototype in low-dimensional feature space to obtain an informative prototype feature map for precise correlation computation. Moreover, a dual correlation module to learn the hard and soft correlations was developed by the authors. This module combines the correlation information between the prototype and unlabelled features in both the original and learnable feature spaces, aiming to produce a comprehensive cross-correlation between the prototypes and unlabelled features. Using both FA and cross-attention modules, our model can maintain informative class features and capture important shared features for classification. Experimental results on three few-shot classification benchmarks show that the proposed method outperformed related methods and resulted in a 3% performance boost in the 1-shot setting by inserting the proposed module into the related methods.

Abstract Image

学习特征对齐和双相关性,实现少镜头图像分类
少射图像分类是使用极其有限的标记样本对新类别进行分类的任务。为了使用有限的样本进行分类,一种解决方案是学习标记和未标记样本特征之间的特征对齐(FA)信息。大多数遗传算法使用特征均值作为类原型,通过计算原型与未标记特征之间的相关性来学习对齐策略。然而,由于同一位置的空间特征对最终分类的重要性可能不相同,平均原型往往会使信息特征退化,从而导致相关性计算不准确。因此,作者提出了一种有效的类内特征分析策略,该策略将语义相似的空间特征从自适应参考原型中聚集到低维特征空间中,以获得信息丰富的原型特征图,用于精确的相关性计算。此外,作者还开发了一个双相关模块来学习硬相关和软相关。该模块结合了原始特征空间和可学习特征空间中原型和未标记特征之间的相关信息,旨在产生原型和未标记特征之间的全面交叉相关。使用FA和交叉注意模块,我们的模型可以维护信息丰富的类特征,并捕获重要的共享特征进行分类。实验结果表明,通过将所提出的模块插入到相关方法中,所提出的方法优于相关方法,并且在1 - shot设置中性能提升了3%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CAAI Transactions on Intelligence Technology
CAAI Transactions on Intelligence Technology COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
11.00
自引率
3.90%
发文量
134
审稿时长
35 weeks
期刊介绍: CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信