Improving few-shot relation classification with multi-scale hierarchical prototype learning

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neural Networks Pub Date : 2025-09-18 DOI:10.1016/j.neunet.2025.108124

Haijia Bi , Lu Liu , Hai Cui , Shengyue Liu , Ridong Han , Jiayu Han , Tao Peng

{"title":"Improving few-shot relation classification with multi-scale hierarchical prototype learning","authors":"Haijia Bi , Lu Liu , Hai Cui , Shengyue Liu , Ridong Han , Jiayu Han , Tao Peng","doi":"10.1016/j.neunet.2025.108124","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot relation classification aims to distinguish different relation classes from extremely limited annotated data. Most existing methods primarily use prototype networks to construct a prototypical representation, classifying the instance by comparing its similarity to each prototype. Despite achieving promising results, the prototypes derived solely from limited support instances are often inaccurate due to constraints in feature extraction capabilities. Moreover, they ignore the different hierarchical levels of relational information, which can provide more effective guidance for classification. In this paper, we propose a novel <strong>m</strong>ulti-sc<strong>a</strong>le hie<strong>r</strong>arch<strong>i</strong>cal pr<strong>o</strong>totype (Mario) learning method that captures relational interaction information at three levels: inter-set, inter-class and intra-class, enhancing the model’s understanding of global semantic information and helping it distinguish subtle differences between classes. Additionally, we incorporate relational descriptive information to reduce the impact of textual expression diversity, enabling the model to emulate the human cognitive process in understanding variation. Extensive experiments conduct on the FewRel dataset demonstrate the effectiveness of our proposed model. In particular, it achieves accuracy rates of 92.52 %/95.33 %/85.46 %/91.33 % under four common few-shot settings. Notably, in the critical 5-way and 10-way 1-shot settings, it outperforms the strongest baseline by 2.87 % and 4.29 %.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"194 ","pages":"Article 108124"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025010044","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Few-shot relation classification aims to distinguish different relation classes from extremely limited annotated data. Most existing methods primarily use prototype networks to construct a prototypical representation, classifying the instance by comparing its similarity to each prototype. Despite achieving promising results, the prototypes derived solely from limited support instances are often inaccurate due to constraints in feature extraction capabilities. Moreover, they ignore the different hierarchical levels of relational information, which can provide more effective guidance for classification. In this paper, we propose a novel multi-scale hierarchical prototype (Mario) learning method that captures relational interaction information at three levels: inter-set, inter-class and intra-class, enhancing the model’s understanding of global semantic information and helping it distinguish subtle differences between classes. Additionally, we incorporate relational descriptive information to reduce the impact of textual expression diversity, enabling the model to emulate the human cognitive process in understanding variation. Extensive experiments conduct on the FewRel dataset demonstrate the effectiveness of our proposed model. In particular, it achieves accuracy rates of 92.52 %/95.33 %/85.46 %/91.33 % under four common few-shot settings. Notably, in the critical 5-way and 10-way 1-shot settings, it outperforms the strongest baseline by 2.87 % and 4.29 %.

查看原文本刊更多论文

基于多尺度层次原型学习的少镜头关系分类改进。

少射关系分类旨在从极其有限的标注数据中区分不同的关系类。大多数现有方法主要使用原型网络来构建原型表示，通过比较其与每个原型的相似性来对实例进行分类。尽管获得了有希望的结果，但是由于特征提取能力的限制，仅从有限的支持实例派生的原型通常是不准确的。此外，它们忽略了关系信息的不同层次，可以为分类提供更有效的指导。在本文中，我们提出了一种新的多尺度分层原型（Mario）学习方法，该方法在集间、类间和类内三个层次捕获关系交互信息，增强了模型对全局语义信息的理解，并帮助其区分类之间的细微差异。此外，我们加入了关系描述信息来减少文本表达多样性的影响，使模型能够模拟人类理解变化的认知过程。在FewRel数据集上进行的大量实验证明了我们提出的模型的有效性。特别是在四种常见的少射设置下，它的准确率达到了92.52% / 95.33% / 85.46% / 91.33%。值得注意的是，在关键的5-way和10-way 1-shot设置中，它比最强基线的表现分别高出2.87%和4.29%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.