一种基于知识蒸馏的伪标注方法用于图的少镜头学习

IF 6.9 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management Pub Date : 2025-07-04 DOI:10.1016/j.ipm.2025.104268

Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu

{"title":"一种基于知识蒸馏的伪标注方法用于图的少镜头学习","authors":"Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu","doi":"10.1016/j.ipm.2025.104268","DOIUrl":null,"url":null,"abstract":"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104268"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A pseudo-labeling approach based on knowledge distillation for graph few-shot learning\",\"authors\":\"Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu\",\"doi\":\"10.1016/j.ipm.2025.104268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 6\",\"pages\":\"Article 104268\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325002092\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002092","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

基于图的少射节点分类（FSNC）是解决复杂网络分析中标记节点有限问题的一种很有前途的方法。虽然现有的伪标签FSNC方法已经显示出令人鼓舞的结果，但它们经常与错误或过度自信的伪标签作斗争，这可能会对模型泛化产生负面影响。为了克服这些限制，我们提出了PLD-FSNC，一种利用知识蒸馏的新型伪标记FSNC框架。我们的PLD-FSNC框架由嵌入转移和伪标签改进两个模块组成。嵌入传递模块将知识从预训练的源模型传递到目标模型，提高伪标签选择质量。伪标签改进模块通过使用源模型的软标签来监督目标模型的预测，减轻了错误和过度自信的伪标签的影响。我们还为伪标签改进模块提供了理论依据，并通过在六个真实数据集上的广泛实验证明了其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A pseudo-labeling approach based on knowledge distillation for graph few-shot learning

Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, i.e., embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.