{"title":"SGE: Semantic-guided Generalization Enhancement for Few-Shot Learning","authors":"Zijun Zheng , Yangyang Zhu , Heng Wu , Laishui Lv , Shanzhou Niu , Gaohang Yu","doi":"10.1016/j.knosys.2025.113761","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of computer vision, learning from limited datasets presents a significant challenge, known as Few-Shot Learning (FSL). Current methodologies have employed semantic information and data augmentation to address insufficient feature representation in limited datasets. However, these approaches often fail to resolve the fundamental data scarcity problem. Additionally, they may inadvertently introduce distributional bias, constraining the utility of semantic features in FSL contexts. To alleviate the issue, in this paper we propose a novel Semantic-guided Generalization Enhancement method (SGE) for FSL. SGE masterfully harnesses robust semantic information by integrating knowledge from diverse pre-trained models. It utilizes semantic cues from class labels to guide both data augmentation and feature extraction processes. Compared to traditional data augmentation techniques, SGE generates augmented samples that are more semantically consistent with the original samples. SGE enables the model to obtain a comprehensive and accurate representation of class characteristics through multifaceted data augmentation of categories. The backbone model adaptively integrates semantic information with image data, and merges the enhanced sample features with the original sample features via a feature fusion module. This allows SGE, guided by semantic cues, to construct a robust class prototype with rich discriminative features. Empirical evidence demonstrates that our framework outperforms state-of-the-art methods across four benchmark assessments. This clearly demonstrates the remarkable efficacy of SGE in leveraging semantic information, thereby exerting a significant influence on FSL. The code is available at <span><span>https://github.com/zhuyangyang-cjlu/SGE</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"323 ","pages":"Article 113761"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512500807X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of computer vision, learning from limited datasets presents a significant challenge, known as Few-Shot Learning (FSL). Current methodologies have employed semantic information and data augmentation to address insufficient feature representation in limited datasets. However, these approaches often fail to resolve the fundamental data scarcity problem. Additionally, they may inadvertently introduce distributional bias, constraining the utility of semantic features in FSL contexts. To alleviate the issue, in this paper we propose a novel Semantic-guided Generalization Enhancement method (SGE) for FSL. SGE masterfully harnesses robust semantic information by integrating knowledge from diverse pre-trained models. It utilizes semantic cues from class labels to guide both data augmentation and feature extraction processes. Compared to traditional data augmentation techniques, SGE generates augmented samples that are more semantically consistent with the original samples. SGE enables the model to obtain a comprehensive and accurate representation of class characteristics through multifaceted data augmentation of categories. The backbone model adaptively integrates semantic information with image data, and merges the enhanced sample features with the original sample features via a feature fusion module. This allows SGE, guided by semantic cues, to construct a robust class prototype with rich discriminative features. Empirical evidence demonstrates that our framework outperforms state-of-the-art methods across four benchmark assessments. This clearly demonstrates the remarkable efficacy of SGE in leveraging semantic information, thereby exerting a significant influence on FSL. The code is available at https://github.com/zhuyangyang-cjlu/SGE.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.