{"title":"图结构学习的自省:基于最小支配集的图骨架提取","authors":"Zifeng Ye, Aifu Han, Guolin Chen, Xiaoxia Huang","doi":"10.1016/j.neucom.2025.130257","DOIUrl":null,"url":null,"abstract":"<div><div>Graph structure learning (GSL) is a data-driven learning approach that has garnered widespread attention in recent years. Nevertheless, the insufficient understanding of latent graph properties poses various challenges for effective graph modeling. This raises the following question: What type of graph skeleton can preserve the most crucial latent properties that significantly impact the performance of graph neural networks (GNNs) in downstream tasks? To this end, we have conducted a comprehensive study on three key graph properties: homophily, degree distribution, and connected components, and determined how these factors influence semi-supervised node classification tasks. Specifically, the influence of homophily on GNN performance is rigorously assessed. Motivated by the analysis, a dual-sparsity graph extraction method, based on the minimum dominating set (MDS), is proposed, to intelligently select informative edges under a given edge sampling ratio. This method effectively captures the scale-free characteristics of the degree distribution, and prioritizes the preservation of node connectivity. Experimental results show that homophily is a key factor in achieving high GNN accuracy. Additionally, the degree distribution and connected components describe the connectivity patterns of the graph from both local and global topological perspectives, which are highly correlated with node classification performance under the GNN message-passing mechanism. This work reveals the necessity of considering the graph skeleton and provides a stepping stone for facilitating GSL using these latent graph properties.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"639 ","pages":"Article 130257"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An introspection of graph structure learning: A graph skeleton extraction via minimum dominating set\",\"authors\":\"Zifeng Ye, Aifu Han, Guolin Chen, Xiaoxia Huang\",\"doi\":\"10.1016/j.neucom.2025.130257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph structure learning (GSL) is a data-driven learning approach that has garnered widespread attention in recent years. Nevertheless, the insufficient understanding of latent graph properties poses various challenges for effective graph modeling. This raises the following question: What type of graph skeleton can preserve the most crucial latent properties that significantly impact the performance of graph neural networks (GNNs) in downstream tasks? To this end, we have conducted a comprehensive study on three key graph properties: homophily, degree distribution, and connected components, and determined how these factors influence semi-supervised node classification tasks. Specifically, the influence of homophily on GNN performance is rigorously assessed. Motivated by the analysis, a dual-sparsity graph extraction method, based on the minimum dominating set (MDS), is proposed, to intelligently select informative edges under a given edge sampling ratio. This method effectively captures the scale-free characteristics of the degree distribution, and prioritizes the preservation of node connectivity. Experimental results show that homophily is a key factor in achieving high GNN accuracy. Additionally, the degree distribution and connected components describe the connectivity patterns of the graph from both local and global topological perspectives, which are highly correlated with node classification performance under the GNN message-passing mechanism. This work reveals the necessity of considering the graph skeleton and provides a stepping stone for facilitating GSL using these latent graph properties.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"639 \",\"pages\":\"Article 130257\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225009294\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225009294","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
An introspection of graph structure learning: A graph skeleton extraction via minimum dominating set
Graph structure learning (GSL) is a data-driven learning approach that has garnered widespread attention in recent years. Nevertheless, the insufficient understanding of latent graph properties poses various challenges for effective graph modeling. This raises the following question: What type of graph skeleton can preserve the most crucial latent properties that significantly impact the performance of graph neural networks (GNNs) in downstream tasks? To this end, we have conducted a comprehensive study on three key graph properties: homophily, degree distribution, and connected components, and determined how these factors influence semi-supervised node classification tasks. Specifically, the influence of homophily on GNN performance is rigorously assessed. Motivated by the analysis, a dual-sparsity graph extraction method, based on the minimum dominating set (MDS), is proposed, to intelligently select informative edges under a given edge sampling ratio. This method effectively captures the scale-free characteristics of the degree distribution, and prioritizes the preservation of node connectivity. Experimental results show that homophily is a key factor in achieving high GNN accuracy. Additionally, the degree distribution and connected components describe the connectivity patterns of the graph from both local and global topological perspectives, which are highly correlated with node classification performance under the GNN message-passing mechanism. This work reveals the necessity of considering the graph skeleton and provides a stepping stone for facilitating GSL using these latent graph properties.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.