Chengcheng Xu , Tianfeng Wang , Man Chen , Jun Chen , Wei Li , Zhisong Pan
{"title":"GRAIL:平衡负抽样的图对比学习","authors":"Chengcheng Xu , Tianfeng Wang , Man Chen , Jun Chen , Wei Li , Zhisong Pan","doi":"10.1016/j.ipm.2025.104211","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, some graph contrastive learning methods mitigate the class imbalance by balancing the number of anchors, overlooking the crucial role of negative samples in forming a regular simplex. Moreover, existing strategies select a limited number of positive samples with poor quality, causing the model to erroneously push away nodes with similar semantics. To address these issues, we propose a <strong>g</strong>raph cont<strong>r</strong>astive learning method with b<strong>a</strong>lanced negat<strong>i</strong>ve samp<strong>l</strong>ing, named GRAIL. Specifically, GRAIL introduces a multi-head similarity metric that leverages mixed probability distributions related to dimensional elements to adaptively select an equal number of hard negative samples within each non-anchor cluster. As a result, GRAIL not only promotes the formation of a regular simplex by balancing the gradient contributions of different negative classes but also selects the most informative hard negative samples to improve the distinguishing ability of minority classes while minimizing the impact on majority classes. Furthermore, GRAIL selects multiple positive samples with a high correct ratio using structural similarity and feature similarity, thereby enabling the model to learn trustworthy node representations. Since traditional contrastive loss focuses on the majority class while neglecting the minority class, a balanced contrastive loss is introduced to optimize node representations. Experiments on node classification, node clustering, and link prediction tasks across six imbalanced graph datasets demonstrate that GRAIL outperforms existing state-of-the-art methods. The source code is available at <span><span>https://github.com/xushucheng-coder/GRAIL/tree/master</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 5","pages":"Article 104211"},"PeriodicalIF":7.4000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GRAIL: Graph contrastive learning with balanced negative sampling\",\"authors\":\"Chengcheng Xu , Tianfeng Wang , Man Chen , Jun Chen , Wei Li , Zhisong Pan\",\"doi\":\"10.1016/j.ipm.2025.104211\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Currently, some graph contrastive learning methods mitigate the class imbalance by balancing the number of anchors, overlooking the crucial role of negative samples in forming a regular simplex. Moreover, existing strategies select a limited number of positive samples with poor quality, causing the model to erroneously push away nodes with similar semantics. To address these issues, we propose a <strong>g</strong>raph cont<strong>r</strong>astive learning method with b<strong>a</strong>lanced negat<strong>i</strong>ve samp<strong>l</strong>ing, named GRAIL. Specifically, GRAIL introduces a multi-head similarity metric that leverages mixed probability distributions related to dimensional elements to adaptively select an equal number of hard negative samples within each non-anchor cluster. As a result, GRAIL not only promotes the formation of a regular simplex by balancing the gradient contributions of different negative classes but also selects the most informative hard negative samples to improve the distinguishing ability of minority classes while minimizing the impact on majority classes. Furthermore, GRAIL selects multiple positive samples with a high correct ratio using structural similarity and feature similarity, thereby enabling the model to learn trustworthy node representations. Since traditional contrastive loss focuses on the majority class while neglecting the minority class, a balanced contrastive loss is introduced to optimize node representations. Experiments on node classification, node clustering, and link prediction tasks across six imbalanced graph datasets demonstrate that GRAIL outperforms existing state-of-the-art methods. The source code is available at <span><span>https://github.com/xushucheng-coder/GRAIL/tree/master</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 5\",\"pages\":\"Article 104211\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325001529\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325001529","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
GRAIL: Graph contrastive learning with balanced negative sampling
Currently, some graph contrastive learning methods mitigate the class imbalance by balancing the number of anchors, overlooking the crucial role of negative samples in forming a regular simplex. Moreover, existing strategies select a limited number of positive samples with poor quality, causing the model to erroneously push away nodes with similar semantics. To address these issues, we propose a graph contrastive learning method with balanced negative sampling, named GRAIL. Specifically, GRAIL introduces a multi-head similarity metric that leverages mixed probability distributions related to dimensional elements to adaptively select an equal number of hard negative samples within each non-anchor cluster. As a result, GRAIL not only promotes the formation of a regular simplex by balancing the gradient contributions of different negative classes but also selects the most informative hard negative samples to improve the distinguishing ability of minority classes while minimizing the impact on majority classes. Furthermore, GRAIL selects multiple positive samples with a high correct ratio using structural similarity and feature similarity, thereby enabling the model to learn trustworthy node representations. Since traditional contrastive loss focuses on the majority class while neglecting the minority class, a balanced contrastive loss is introduced to optimize node representations. Experiments on node classification, node clustering, and link prediction tasks across six imbalanced graph datasets demonstrate that GRAIL outperforms existing state-of-the-art methods. The source code is available at https://github.com/xushucheng-coder/GRAIL/tree/master.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.