{"title":"一种基于知识蒸馏的伪标注方法用于图的少镜头学习","authors":"Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu","doi":"10.1016/j.ipm.2025.104268","DOIUrl":null,"url":null,"abstract":"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104268"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A pseudo-labeling approach based on knowledge distillation for graph few-shot learning\",\"authors\":\"Zongqian Wu , Peng Zhou , Guoqiu Wen , Xiaofeng Zhu\",\"doi\":\"10.1016/j.ipm.2025.104268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, <em>i.e.</em>, embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 6\",\"pages\":\"Article 104268\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325002092\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002092","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A pseudo-labeling approach based on knowledge distillation for graph few-shot learning
Graph-based few-shot node classification (FSNC) has emerged as a promising solution to the challenge of limited labeled nodes in complex network analysis. Although existing pseudo-labeling FSNC methods have shown encouraging results, they often struggle with wrong or over-confident pseudo-labels, which can negatively impact model generalization. To overcome these limitations, we propose PLD-FSNC, a novel pseudo-labeling FSNC framework leveraging knowledge distillation. Our PLD-FSNC framework is composed of two modules, i.e., embedding transfer and pseudo-label improvement. The embedding transfer module transfers knowledge from a pre-trained source model to a target model, enhancing pseudo-label selection quality. The pseudo-label improvement module mitigates the impact of wrong and over-confident pseudo-labels by using soft labels from the source model to supervise the target model’s predictions. We also provide theoretical justification for our pseudo-label improvement module and demonstrate its effectiveness through extensive experiments on six real-world datasets.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.