Towards heterogeneous federated graph learning via structural entropy and prototype aggregation

IF 8.1 1区计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Sciences Pub Date : 2025-05-27 DOI:10.1016/j.ins.2025.122338

Zhehao Dai , Guojiang Shen , Haopeng Yuan , Shangfei Zheng , Yuyue Hu , Jiaxin Du , Xiangjie Kong , Feng Xia

{"title":"Towards heterogeneous federated graph learning via structural entropy and prototype aggregation","authors":"Zhehao Dai , Guojiang Shen , Haopeng Yuan , Shangfei Zheng , Yuyue Hu , Jiaxin Du , Xiangjie Kong , Feng Xia","doi":"10.1016/j.ins.2025.122338","DOIUrl":null,"url":null,"abstract":"<div><div>In today's data-driven landscape, Federated Graph Learning (FGL) facilitates collaborative training between distributed data while providing robust privacy protections. However, FGL faces significant challenges in practical application: data heterogeneity owing to divergent node distributions and graph structures across clients, coupled with model heterogeneity caused by heterogeneous GNN architectures, substantially impedes the aggregation efficacy and generalization capabilities of global models. Existing FGL frameworks often overlook the unique impact of graph topology, inherent to graph data, between data and model heterogeneity. We propose an innovative framework called Structural Entropy Federated Graph Learning (SEFGL) that leverages structural entropy to simultaneously address data and model heterogeneity. At the client level, structural entropy-based virtual node generation and graph reconstruction techniques are applied to strengthen minority class node representations and optimize local graph topology while maintaining the original data distribution. At the server level, a prototype learning approach based on structural entropy aggregates data from clients with similar entropy characteristics. This enables each client to acquire a more diverse global representation, fostering the development of a personalized and robust prototype. Experiments conducted on three graph datasets demonstrate that the SEFGL framework achieves superior performance in terms of generalizability, efficiency, and effectiveness in high-heterogeneity scenarios.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"718 ","pages":"Article 122338"},"PeriodicalIF":8.1000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525004700","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In today's data-driven landscape, Federated Graph Learning (FGL) facilitates collaborative training between distributed data while providing robust privacy protections. However, FGL faces significant challenges in practical application: data heterogeneity owing to divergent node distributions and graph structures across clients, coupled with model heterogeneity caused by heterogeneous GNN architectures, substantially impedes the aggregation efficacy and generalization capabilities of global models. Existing FGL frameworks often overlook the unique impact of graph topology, inherent to graph data, between data and model heterogeneity. We propose an innovative framework called Structural Entropy Federated Graph Learning (SEFGL) that leverages structural entropy to simultaneously address data and model heterogeneity. At the client level, structural entropy-based virtual node generation and graph reconstruction techniques are applied to strengthen minority class node representations and optimize local graph topology while maintaining the original data distribution. At the server level, a prototype learning approach based on structural entropy aggregates data from clients with similar entropy characteristics. This enables each client to acquire a more diverse global representation, fostering the development of a personalized and robust prototype. Experiments conducted on three graph datasets demonstrate that the SEFGL framework achieves superior performance in terms of generalizability, efficiency, and effectiveness in high-heterogeneity scenarios.

查看原文本刊更多论文

基于结构熵和原型聚合的异构联邦图学习

在当今数据驱动的环境中，联邦图学习（FGL）促进了分布式数据之间的协作训练，同时提供了强大的隐私保护。然而，FGL在实际应用中面临着重大挑战：客户端节点分布和图结构的差异导致数据异构，再加上异构GNN架构导致的模型异构，极大地阻碍了全局模型的聚合效率和泛化能力。现有的FGL框架往往忽略了图数据固有的图拓扑在数据和模型异质性之间的独特影响。我们提出了一个名为结构熵联邦图学习（SEFGL）的创新框架，它利用结构熵来同时解决数据和模型的异质性。在客户端层面，采用基于结构熵的虚拟节点生成和图重构技术，在保持原始数据分布的同时，增强少数类节点表示，优化局部图拓扑。在服务器层，基于结构熵的原型学习方法聚合来自具有相似熵特征的客户端的数据。这使得每个客户都能获得更多样化的全球代表，促进个性化和强大原型的发展。在三个图数据集上进行的实验表明，SEFGL框架在高异构场景下的泛化性、效率和有效性方面都取得了优异的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Sciences 工程技术-计算机：信息系统

CiteScore

14.00

自引率

17.30%

发文量

1322

审稿时长

10.4 months

期刊介绍： Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.