AuCoGNN: Enhancing Graph Fairness Learning Under Distribution Shifts With Automated Graph Generation

IF 10.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2025-07-14 DOI:10.1109/TKDE.2025.3586276

Xiao Wang;Yibo Li;Yujie Xing;Shaohua Fan;Chuan Shi

{"title":"AuCoGNN: Enhancing Graph Fairness Learning Under Distribution Shifts With Automated Graph Generation","authors":"Xiao Wang;Yibo Li;Yujie Xing;Shaohua Fan;Chuan Shi","doi":"10.1109/TKDE.2025.3586276","DOIUrl":null,"url":null,"abstract":"Graph neural networks (GNNs) have shown strong performance on graph-structured data but may inherit bias from training data, leading to discriminatory predictions based on sensitive attributes like gender and race. Existing fairness methods assume that training and testing data share the same distribution, but how fairness is affected under distribution shifts remains largely unexplored. To address this, we first identify theoretical factors that cause bias in graphs and explore how fairness is influenced by distribution shifts, particularly focusing on representation distances between groups in training and testing graphs. Based on this, we propose FatraGNN, which uses a graph generator to create biased graphs from different distributions and an alignment module to reduce representation distances for specific groups. This improves fairness and classification performance on unseen graphs. However, FatraGNN has limitations in generating realistic graphs and addressing group differentiation. To overcome these, we introduce AuCoGNN, which includes an automated graph generation module and a contrastive alignment mechanism. This ensures better fairness by maximizing the representation distance between the same certain groups while minimizing the representation distance between different groups. Experiments on real-world and semi-synthetic datasets demonstrate the effectiveness of both models in improving fairness and accuracy.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 10","pages":"5781-5794"},"PeriodicalIF":10.4000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11080132/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Graph neural networks (GNNs) have shown strong performance on graph-structured data but may inherit bias from training data, leading to discriminatory predictions based on sensitive attributes like gender and race. Existing fairness methods assume that training and testing data share the same distribution, but how fairness is affected under distribution shifts remains largely unexplored. To address this, we first identify theoretical factors that cause bias in graphs and explore how fairness is influenced by distribution shifts, particularly focusing on representation distances between groups in training and testing graphs. Based on this, we propose FatraGNN, which uses a graph generator to create biased graphs from different distributions and an alignment module to reduce representation distances for specific groups. This improves fairness and classification performance on unseen graphs. However, FatraGNN has limitations in generating realistic graphs and addressing group differentiation. To overcome these, we introduce AuCoGNN, which includes an automated graph generation module and a contrastive alignment mechanism. This ensures better fairness by maximizing the representation distance between the same certain groups while minimizing the representation distance between different groups. Experiments on real-world and semi-synthetic datasets demonstrate the effectiveness of both models in improving fairness and accuracy.

查看原文本刊更多论文

基于自动图生成的分布变化下的图公平性学习

图神经网络（gnn）在图结构数据上表现出色，但可能会从训练数据中继承偏见，导致基于性别和种族等敏感属性的歧视性预测。现有的公平性方法假设训练数据和测试数据共享相同的分布，但在分布变化下公平性如何受到影响仍然未被探索。为了解决这个问题，我们首先确定了导致图中偏差的理论因素，并探讨了公平性如何受到分布变化的影响，特别是关注训练图和测试图中组间的表示距离。在此基础上，我们提出了FatraGNN，它使用图生成器从不同的分布中创建有偏差的图，并使用对齐模块减少特定组的表示距离。这提高了未见图的公平性和分类性能。然而，FatraGNN在生成逼真的图形和处理群体分化方面存在局限性。为了克服这些问题，我们引入了AuCoGNN，它包括一个自动图形生成模块和一个对比对齐机制。这通过最大化相同特定组之间的表示距离，同时最小化不同组之间的表示距离来确保更好的公平性。在真实世界和半合成数据集上的实验证明了这两种模型在提高公平性和准确性方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.