Chengcheng Xu, Tong Han, Tianfeng Wang, Xiao Han, Zhisong Pan
{"title":"Semantic-aware contrastive learning for graph classification","authors":"Chengcheng Xu, Tong Han, Tianfeng Wang, Xiao Han, Zhisong Pan","doi":"10.1016/j.eswa.2025.129976","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, the performance of vanilla graph-level contrastive learning is limited by traditional data augmentation strategies and size imbalance. First, adding noise in the embedding space for data augmentation may cause the graph-level representation to exceed the class decision boundary and alter its original semantic information. Second, transferring information from head graphs to tail graphs to alleviate size imbalance without distinguishing the semantic information of the graphs may lead to suboptimal model performance. To address these issues, we propose a semantic-aware graph-level contrastive learning method named SAGCL. Specifically, SAGCL achieves controllable data augmentation by adjusting the closeness between class center features and augmented features, preserving the inherent structure and semantic information of the graph. Meanwhile, the intra-cluster variance is used as a regularization term to maintain the uniformity of the feature distribution. In addition, SAGCL employs a confidence-weighted approach to obtain the semantic prototypes of head graphs and tail graphs within each cluster. Then, the rich semantic information from the head graphs is transferred to the tail graphs, effectively enhancing the model’s ability to distinguish tail graphs. Experiments on graph classification tasks on eight imbalanced datasets demonstrate that SAGCL outperforms existing state-of-the-art methods.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"299 ","pages":"Article 129976"},"PeriodicalIF":7.5000,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425035912","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, the performance of vanilla graph-level contrastive learning is limited by traditional data augmentation strategies and size imbalance. First, adding noise in the embedding space for data augmentation may cause the graph-level representation to exceed the class decision boundary and alter its original semantic information. Second, transferring information from head graphs to tail graphs to alleviate size imbalance without distinguishing the semantic information of the graphs may lead to suboptimal model performance. To address these issues, we propose a semantic-aware graph-level contrastive learning method named SAGCL. Specifically, SAGCL achieves controllable data augmentation by adjusting the closeness between class center features and augmented features, preserving the inherent structure and semantic information of the graph. Meanwhile, the intra-cluster variance is used as a regularization term to maintain the uniformity of the feature distribution. In addition, SAGCL employs a confidence-weighted approach to obtain the semantic prototypes of head graphs and tail graphs within each cluster. Then, the rich semantic information from the head graphs is transferred to the tail graphs, effectively enhancing the model’s ability to distinguish tail graphs. Experiments on graph classification tasks on eight imbalanced datasets demonstrate that SAGCL outperforms existing state-of-the-art methods.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.