{"title":"An introspection of graph structure learning: A graph skeleton extraction via minimum dominating set","authors":"Zifeng Ye, Aifu Han, Guolin Chen, Xiaoxia Huang","doi":"10.1016/j.neucom.2025.130257","DOIUrl":null,"url":null,"abstract":"<div><div>Graph structure learning (GSL) is a data-driven learning approach that has garnered widespread attention in recent years. Nevertheless, the insufficient understanding of latent graph properties poses various challenges for effective graph modeling. This raises the following question: What type of graph skeleton can preserve the most crucial latent properties that significantly impact the performance of graph neural networks (GNNs) in downstream tasks? To this end, we have conducted a comprehensive study on three key graph properties: homophily, degree distribution, and connected components, and determined how these factors influence semi-supervised node classification tasks. Specifically, the influence of homophily on GNN performance is rigorously assessed. Motivated by the analysis, a dual-sparsity graph extraction method, based on the minimum dominating set (MDS), is proposed, to intelligently select informative edges under a given edge sampling ratio. This method effectively captures the scale-free characteristics of the degree distribution, and prioritizes the preservation of node connectivity. Experimental results show that homophily is a key factor in achieving high GNN accuracy. Additionally, the degree distribution and connected components describe the connectivity patterns of the graph from both local and global topological perspectives, which are highly correlated with node classification performance under the GNN message-passing mechanism. This work reveals the necessity of considering the graph skeleton and provides a stepping stone for facilitating GSL using these latent graph properties.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"639 ","pages":"Article 130257"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225009294","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Graph structure learning (GSL) is a data-driven learning approach that has garnered widespread attention in recent years. Nevertheless, the insufficient understanding of latent graph properties poses various challenges for effective graph modeling. This raises the following question: What type of graph skeleton can preserve the most crucial latent properties that significantly impact the performance of graph neural networks (GNNs) in downstream tasks? To this end, we have conducted a comprehensive study on three key graph properties: homophily, degree distribution, and connected components, and determined how these factors influence semi-supervised node classification tasks. Specifically, the influence of homophily on GNN performance is rigorously assessed. Motivated by the analysis, a dual-sparsity graph extraction method, based on the minimum dominating set (MDS), is proposed, to intelligently select informative edges under a given edge sampling ratio. This method effectively captures the scale-free characteristics of the degree distribution, and prioritizes the preservation of node connectivity. Experimental results show that homophily is a key factor in achieving high GNN accuracy. Additionally, the degree distribution and connected components describe the connectivity patterns of the graph from both local and global topological perspectives, which are highly correlated with node classification performance under the GNN message-passing mechanism. This work reveals the necessity of considering the graph skeleton and provides a stepping stone for facilitating GSL using these latent graph properties.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.