Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2024-08-12 DOI:10.1109/TKDE.2024.3441766

Wendong Bi;Lun Du;Qiang Fu;Yanlin Wang;Shi Han;Dongmei Zhang

{"title":"Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach","authors":"Wendong Bi;Lun Du;Qiang Fu;Yanlin Wang;Shi Han;Dongmei Zhang","doi":"10.1109/TKDE.2024.3441766","DOIUrl":null,"url":null,"abstract":"Graph Neural Networks (GNNs) have shown superior performance in modeling graph data. Existing studies have shown that a lot of GNNs perform well on homophilic graphs while performing poorly on heterophilic graphs. Recently, researchers have turned their attention to design GNNs for heterophilic graphs by specific model design. Different from existing methods that mitigate heterophily by model design, we propose to study heterophilic graphs from an orthogonal perspective by rewiring the graph to reduce heterophily and make GNNs perform better. Through comprehensive empirical analysis, we verify the potential of graph rewiring methods. Then we propose a method named \n<bold>D</b>\neep \n<bold>H</b>\neterophily \n<bold>G</b>\nraph \n<bold>R</b>\newiring (DHGR) to rewire graphs by adding homophilic edges and pruning heterophilic edges. The rewiring operation is implemented by comparing the similarity of neighborhood label/feature distribution of node pairs. Besides, we design a scalable implementation for DHGR to guarantee a high efficiency. DHRG can be easily used as a plug-in module, i.e., a graph pre-processing step, for any GNNs, including both GNNs for homophily and heterophily, to boost their performance on the node classification task. To the best of our knowledge, it is the first work studying graph rewiring for heterophilic graphs. Extensive experiments on 11 public graph datasets demonstrate the superiority of our proposed methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"36 12","pages":"8744-8757"},"PeriodicalIF":8.9000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634240/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Graph Neural Networks (GNNs) have shown superior performance in modeling graph data. Existing studies have shown that a lot of GNNs perform well on homophilic graphs while performing poorly on heterophilic graphs. Recently, researchers have turned their attention to design GNNs for heterophilic graphs by specific model design. Different from existing methods that mitigate heterophily by model design, we propose to study heterophilic graphs from an orthogonal perspective by rewiring the graph to reduce heterophily and make GNNs perform better. Through comprehensive empirical analysis, we verify the potential of graph rewiring methods. Then we propose a method named D eep H eterophily G raph R ewiring (DHGR) to rewire graphs by adding homophilic edges and pruning heterophilic edges. The rewiring operation is implemented by comparing the similarity of neighborhood label/feature distribution of node pairs. Besides, we design a scalable implementation for DHGR to guarantee a high efficiency. DHRG can be easily used as a plug-in module, i.e., a graph pre-processing step, for any GNNs, including both GNNs for homophily and heterophily, to boost their performance on the node classification task. To the best of our knowledge, it is the first work studying graph rewiring for heterophilic graphs. Extensive experiments on 11 public graph datasets demonstrate the superiority of our proposed methods.

查看原文本刊更多论文

让嗜异图更适合 GNN：图形重布线方法

图神经网络（GNN）在图数据建模方面表现出卓越的性能。现有研究表明，很多 GNN 在同亲图上表现良好，而在异亲图上表现不佳。最近，研究人员将注意力转向了通过特定模型设计来为异嗜图设计 GNN。与现有的通过模型设计减轻异嗜性的方法不同，我们建议从正交的角度研究异嗜性图，通过重新布线来减少异嗜性，使 GNN 发挥更好的性能。通过全面的实证分析，我们验证了图重新布线方法的潜力。然后，我们提出了一种名为 "深度嗜异性图重配（DHGR）"的方法，通过添加同嗜性边和修剪异嗜性边来重配图。重布线操作是通过比较节点对的邻域标签/特征分布的相似性来实现的。此外，我们还为 DHGR 设计了一种可扩展的实现方法，以保证高效率。DHRG 可以作为任何 GNN（包括同亲和异亲的 GNN）的插件模块（即图预处理步骤）轻松使用，以提高它们在节点分类任务中的性能。据我们所知，这是第一项研究异亲图的图重配的工作。在 11 个公共图数据集上的广泛实验证明了我们提出的方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.