一种基于知识蒸馏的伪标注方法用于图的少镜头学习

IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu
{"title":"一种基于知识蒸馏的伪标注方法用于图的少镜头学习","authors":"Zongqian Wu ,&nbsp;Peng Zhou ,&nbsp;Guoqiu Wen ,&nbsp;Xiaofeng Zhu","doi":"10.1016/j.ipm.2025.104268","DOIUrl":null,"url":null,"abstract":"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104268"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A pseudo-labeling approach based on knowledge distillation for graph few-shot learning\",\"authors\":\"Zongqian Wu ,&nbsp;Peng Zhou ,&nbsp;Guoqiu Wen ,&nbsp;Xiaofeng Zhu\",\"doi\":\"10.1016/j.ipm.2025.104268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 6\",\"pages\":\"Article 104268\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325002092\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002092","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

基于图的少射节点分类(FSNC)是解决复杂网络分析中标记节点有限问题的一种很有前途的方法。虽然现有的伪标签FSNC方法已经显示出令人鼓舞的结果,但它们经常与错误或过度自信的伪标签作斗争,这可能会对模型泛化产生负面影响。为了克服这些限制,我们提出了PLD-FSNC,一种利用知识蒸馏的新型伪标记FSNC框架。我们的PLD-FSNC框架由嵌入转移和伪标签改进两个模块组成。嵌入传递模块将知识从预训练的源模型传递到目标模型,提高伪标签选择质量。伪标签改进模块通过使用源模型的软标签来监督目标模型的预测,减轻了错误和过度自信的伪标签的影响。我们还为伪标签改进模块提供了理论依据,并通过在六个真实数据集上的广泛实验证明了其有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A pseudo-labeling approach based on knowledge distillation for graph few-shot learning
Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, i.e., embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Processing & Management
Information Processing & Management 工程技术-计算机:信息系统
CiteScore
17.00
自引率
11.60%
发文量
276
审稿时长
39 days
期刊介绍: Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信