Yuan Li, Li Liu, Penggang Chen, Chenglin Zhang, Guoyin Wang
{"title":"增强图神经网络的自解释建模功能:多粒度感受野的因果视角","authors":"Yuan Li, Li Liu, Penggang Chen, Chenglin Zhang, Guoyin Wang","doi":"10.1016/j.ipm.2024.103821","DOIUrl":null,"url":null,"abstract":"<div><p>Self-explainable Graph Neural Networks (GNNs) provide explanations alongside their predictions, making the model transparent and facilitating their wide adoption in high-stakes tasks. Current studies on constructing such GNNs are limited by the single receptive field, resulting in the modeling of spurious correlations in self-explainable GNNs. To address this issue, this paper introduces a GNN model with incorporated multi-granularity receptive fields, capturing causal correlations during the model construction and providing explanations alongside its predictions. Specifically, we employ closeness matrices with multiple structural orders to construct multi-granularity receptive fields for the model. Subsequently, we design a model architecture with sliced channels to integrate representations learned from multiple receptive fields heuristically. Objective functions from a causal perspective are further designed to guide the optimization of the proposed model. Experiments conducted on five real-world datasets and one synthetic dataset demonstrate the superior performance of the proposed model. In terms of classification accuracy, compared to SOTA baseline, the proposed GNN achieves the improvement of 0.17%, 1.99%, 0.70%, 0.83%, and 0.78% on the real-world datasets MUTAG, PTC, PROTEINS, IMDB-M, and IMDB-B, respectively. Compared to three self-explainable baselines, qualitative and quantitative studies are conducted on MUTAG, PTC, PROTEINS, IMDB-M, IMDB-B, and the synthetic dataset Spurious-Motif. Experimental results confirm that the proposed model can accurately identify the essential substructures, such as <span><math><mrow><mi>N</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span> in the MUTAG dataset. Additionally, the proposed model assigns significant weights to the motif part and distinguishes it from the base part in the Spurious-Motif dataset, enhancing the accuracy of graph classification and the explanations of the predictions. The classifications along with explanations obtained with this approach align with human cognition and experience.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing graph neural networks for self-explainable modeling: A causal perspective with multi-granularity receptive fields\",\"authors\":\"Yuan Li, Li Liu, Penggang Chen, Chenglin Zhang, Guoyin Wang\",\"doi\":\"10.1016/j.ipm.2024.103821\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Self-explainable Graph Neural Networks (GNNs) provide explanations alongside their predictions, making the model transparent and facilitating their wide adoption in high-stakes tasks. Current studies on constructing such GNNs are limited by the single receptive field, resulting in the modeling of spurious correlations in self-explainable GNNs. To address this issue, this paper introduces a GNN model with incorporated multi-granularity receptive fields, capturing causal correlations during the model construction and providing explanations alongside its predictions. Specifically, we employ closeness matrices with multiple structural orders to construct multi-granularity receptive fields for the model. Subsequently, we design a model architecture with sliced channels to integrate representations learned from multiple receptive fields heuristically. Objective functions from a causal perspective are further designed to guide the optimization of the proposed model. Experiments conducted on five real-world datasets and one synthetic dataset demonstrate the superior performance of the proposed model. In terms of classification accuracy, compared to SOTA baseline, the proposed GNN achieves the improvement of 0.17%, 1.99%, 0.70%, 0.83%, and 0.78% on the real-world datasets MUTAG, PTC, PROTEINS, IMDB-M, and IMDB-B, respectively. Compared to three self-explainable baselines, qualitative and quantitative studies are conducted on MUTAG, PTC, PROTEINS, IMDB-M, IMDB-B, and the synthetic dataset Spurious-Motif. Experimental results confirm that the proposed model can accurately identify the essential substructures, such as <span><math><mrow><mi>N</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span> in the MUTAG dataset. Additionally, the proposed model assigns significant weights to the motif part and distinguishes it from the base part in the Spurious-Motif dataset, enhancing the accuracy of graph classification and the explanations of the predictions. The classifications along with explanations obtained with this approach align with human cognition and experience.</p></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457324001808\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324001808","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Enhancing graph neural networks for self-explainable modeling: A causal perspective with multi-granularity receptive fields
Self-explainable Graph Neural Networks (GNNs) provide explanations alongside their predictions, making the model transparent and facilitating their wide adoption in high-stakes tasks. Current studies on constructing such GNNs are limited by the single receptive field, resulting in the modeling of spurious correlations in self-explainable GNNs. To address this issue, this paper introduces a GNN model with incorporated multi-granularity receptive fields, capturing causal correlations during the model construction and providing explanations alongside its predictions. Specifically, we employ closeness matrices with multiple structural orders to construct multi-granularity receptive fields for the model. Subsequently, we design a model architecture with sliced channels to integrate representations learned from multiple receptive fields heuristically. Objective functions from a causal perspective are further designed to guide the optimization of the proposed model. Experiments conducted on five real-world datasets and one synthetic dataset demonstrate the superior performance of the proposed model. In terms of classification accuracy, compared to SOTA baseline, the proposed GNN achieves the improvement of 0.17%, 1.99%, 0.70%, 0.83%, and 0.78% on the real-world datasets MUTAG, PTC, PROTEINS, IMDB-M, and IMDB-B, respectively. Compared to three self-explainable baselines, qualitative and quantitative studies are conducted on MUTAG, PTC, PROTEINS, IMDB-M, IMDB-B, and the synthetic dataset Spurious-Motif. Experimental results confirm that the proposed model can accurately identify the essential substructures, such as in the MUTAG dataset. Additionally, the proposed model assigns significant weights to the motif part and distinguishes it from the base part in the Spurious-Motif dataset, enhancing the accuracy of graph classification and the explanations of the predictions. The classifications along with explanations obtained with this approach align with human cognition and experience.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.