Zhehao Dai , Guojiang Shen , Haopeng Yuan , Shangfei Zheng , Yuyue Hu , Jiaxin Du , Xiangjie Kong , Feng Xia
{"title":"Towards heterogeneous federated graph learning via structural entropy and prototype aggregation","authors":"Zhehao Dai , Guojiang Shen , Haopeng Yuan , Shangfei Zheng , Yuyue Hu , Jiaxin Du , Xiangjie Kong , Feng Xia","doi":"10.1016/j.ins.2025.122338","DOIUrl":null,"url":null,"abstract":"<div><div>In today's data-driven landscape, Federated Graph Learning (FGL) facilitates collaborative training between distributed data while providing robust privacy protections. However, FGL faces significant challenges in practical application: data heterogeneity owing to divergent node distributions and graph structures across clients, coupled with model heterogeneity caused by heterogeneous GNN architectures, substantially impedes the aggregation efficacy and generalization capabilities of global models. Existing FGL frameworks often overlook the unique impact of graph topology, inherent to graph data, between data and model heterogeneity. We propose an innovative framework called Structural Entropy Federated Graph Learning (SEFGL) that leverages structural entropy to simultaneously address data and model heterogeneity. At the client level, structural entropy-based virtual node generation and graph reconstruction techniques are applied to strengthen minority class node representations and optimize local graph topology while maintaining the original data distribution. At the server level, a prototype learning approach based on structural entropy aggregates data from clients with similar entropy characteristics. This enables each client to acquire a more diverse global representation, fostering the development of a personalized and robust prototype. Experiments conducted on three graph datasets demonstrate that the SEFGL framework achieves superior performance in terms of generalizability, efficiency, and effectiveness in high-heterogeneity scenarios.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"718 ","pages":"Article 122338"},"PeriodicalIF":8.1000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525004700","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In today's data-driven landscape, Federated Graph Learning (FGL) facilitates collaborative training between distributed data while providing robust privacy protections. However, FGL faces significant challenges in practical application: data heterogeneity owing to divergent node distributions and graph structures across clients, coupled with model heterogeneity caused by heterogeneous GNN architectures, substantially impedes the aggregation efficacy and generalization capabilities of global models. Existing FGL frameworks often overlook the unique impact of graph topology, inherent to graph data, between data and model heterogeneity. We propose an innovative framework called Structural Entropy Federated Graph Learning (SEFGL) that leverages structural entropy to simultaneously address data and model heterogeneity. At the client level, structural entropy-based virtual node generation and graph reconstruction techniques are applied to strengthen minority class node representations and optimize local graph topology while maintaining the original data distribution. At the server level, a prototype learning approach based on structural entropy aggregates data from clients with similar entropy characteristics. This enables each client to acquire a more diverse global representation, fostering the development of a personalized and robust prototype. Experiments conducted on three graph datasets demonstrate that the SEFGL framework achieves superior performance in terms of generalizability, efficiency, and effectiveness in high-heterogeneity scenarios.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.