Yunxiao Yang, Jianting Chen, Xiaoying Gao, Yang Xiang
{"title":"用于知识图谱错误检测的双重去混淆因果干预方法","authors":"Yunxiao Yang, Jianting Chen, Xiaoying Gao, Yang Xiang","doi":"10.1016/j.knosys.2024.112644","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the Knowledge Graph (KG) construction process, erroneous triples are virtually inevitable to be introduced into real-world KGs. Since these errors hinder the expressiveness and applicability of KGs, the development of knowledge graph error detection (KGED) methods is necessary. Despite the overall effectiveness of current KGED methods, their capacity to identify challenging errors is limited. In this work, we conduct empirical studies and find that previous works introduce structural and semantic bias, impeding the identification of erroneous triples, especially in challenging cases. To address this issue, we design a causal graph for the KGED task and propose a Dual De-confounded Causal Intervention (DuDCI) method for debiasing. Firstly, DuDCI utilizes the neighborhood and textual descriptions of triples to calculate their graph and text embeddings. Next, a Causal De-confounded Module is constructed to mitigate the impact of shortcuts caused by the bias through the front-door adjustment. Furthermore, we introduce Disentanglement Constraints to disentangle the information expressed by each embedding, thereby facilitating further bias mitigation. Experimental results on three widely used KGED datasets validate the effectiveness of DuDCI and demonstrate that DuDCI outperforms current KGED methods, with an improvement of at least 2.2%, especially in more challenging noise scenarios.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"305 ","pages":"Article 112644"},"PeriodicalIF":7.2000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual De-confounded Causal Intervention method for knowledge graph error detection\",\"authors\":\"Yunxiao Yang, Jianting Chen, Xiaoying Gao, Yang Xiang\",\"doi\":\"10.1016/j.knosys.2024.112644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Due to the Knowledge Graph (KG) construction process, erroneous triples are virtually inevitable to be introduced into real-world KGs. Since these errors hinder the expressiveness and applicability of KGs, the development of knowledge graph error detection (KGED) methods is necessary. Despite the overall effectiveness of current KGED methods, their capacity to identify challenging errors is limited. In this work, we conduct empirical studies and find that previous works introduce structural and semantic bias, impeding the identification of erroneous triples, especially in challenging cases. To address this issue, we design a causal graph for the KGED task and propose a Dual De-confounded Causal Intervention (DuDCI) method for debiasing. Firstly, DuDCI utilizes the neighborhood and textual descriptions of triples to calculate their graph and text embeddings. Next, a Causal De-confounded Module is constructed to mitigate the impact of shortcuts caused by the bias through the front-door adjustment. Furthermore, we introduce Disentanglement Constraints to disentangle the information expressed by each embedding, thereby facilitating further bias mitigation. Experimental results on three widely used KGED datasets validate the effectiveness of DuDCI and demonstrate that DuDCI outperforms current KGED methods, with an improvement of at least 2.2%, especially in more challenging noise scenarios.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"305 \",\"pages\":\"Article 112644\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705124012784\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124012784","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Dual De-confounded Causal Intervention method for knowledge graph error detection
Due to the Knowledge Graph (KG) construction process, erroneous triples are virtually inevitable to be introduced into real-world KGs. Since these errors hinder the expressiveness and applicability of KGs, the development of knowledge graph error detection (KGED) methods is necessary. Despite the overall effectiveness of current KGED methods, their capacity to identify challenging errors is limited. In this work, we conduct empirical studies and find that previous works introduce structural and semantic bias, impeding the identification of erroneous triples, especially in challenging cases. To address this issue, we design a causal graph for the KGED task and propose a Dual De-confounded Causal Intervention (DuDCI) method for debiasing. Firstly, DuDCI utilizes the neighborhood and textual descriptions of triples to calculate their graph and text embeddings. Next, a Causal De-confounded Module is constructed to mitigate the impact of shortcuts caused by the bias through the front-door adjustment. Furthermore, we introduce Disentanglement Constraints to disentangle the information expressed by each embedding, thereby facilitating further bias mitigation. Experimental results on three widely used KGED datasets validate the effectiveness of DuDCI and demonstrate that DuDCI outperforms current KGED methods, with an improvement of at least 2.2%, especially in more challenging noise scenarios.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.