{"title":"A knowledge enhanced learning and semantic composition model for multi-claim fact checking","authors":"Shuai Wang, Penghui Wei, Qingchao Kong, Wenji Mao","doi":"10.1016/j.knosys.2024.112439","DOIUrl":null,"url":null,"abstract":"<div><p>To inhibit the spread of rumorous information and its severe impacts, fact checking aims at retrieving relevant evidence to verify the veracity of a given statement. Fact checking methods typically use knowledge graphs (KGs) as external repositories and develop reasoning mechanism to retrieve evidence for verifying the statement. Existing fact checking methods have focused on verifying the statement of a single claim expressed by a clause. However, as real-world rumorous information is usually complex and a textual statement is often composed of multiple clauses (i.e. represented as multiple claims instead of a single one), multi-claim fact checking is not only necessary but more important for practical applications. Multi-claim statements imply rich contextual information and modeling the interactions of multiple claims can facilitate better verification. In this paper, we propose a knowledge enhanced learning and semantic composition model for multi-claim fact checking. Our model consists of two modules, KG-based learning enhancement and multi-claim semantic composition. To fully utilize the contextual information implied in multiple claims, the KG-based learning enhancement module learns the dynamic context-specific representations via selectively aggregating relevant attributes of entities. To robustly verify multiple claims robustly, the multi-claim semantic composition module learns a unified representation for multiple claims by modeling inter-claim interactions, and then verify them as a whole on the basis of this. We conduct experimental studies to validate our proposed method, and the experimental results on three typically datasets confirmed the efficacy of our model for multi-claim fact checking.</p></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124010736","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
To inhibit the spread of rumorous information and its severe impacts, fact checking aims at retrieving relevant evidence to verify the veracity of a given statement. Fact checking methods typically use knowledge graphs (KGs) as external repositories and develop reasoning mechanism to retrieve evidence for verifying the statement. Existing fact checking methods have focused on verifying the statement of a single claim expressed by a clause. However, as real-world rumorous information is usually complex and a textual statement is often composed of multiple clauses (i.e. represented as multiple claims instead of a single one), multi-claim fact checking is not only necessary but more important for practical applications. Multi-claim statements imply rich contextual information and modeling the interactions of multiple claims can facilitate better verification. In this paper, we propose a knowledge enhanced learning and semantic composition model for multi-claim fact checking. Our model consists of two modules, KG-based learning enhancement and multi-claim semantic composition. To fully utilize the contextual information implied in multiple claims, the KG-based learning enhancement module learns the dynamic context-specific representations via selectively aggregating relevant attributes of entities. To robustly verify multiple claims robustly, the multi-claim semantic composition module learns a unified representation for multiple claims by modeling inter-claim interactions, and then verify them as a whole on the basis of this. We conduct experimental studies to validate our proposed method, and the experimental results on three typically datasets confirmed the efficacy of our model for multi-claim fact checking.
为了抑制谣言信息的传播及其严重影响,事实核查的目的是检索相关证据,以验证给定声明的真实性。事实核查方法通常使用知识图谱(KG)作为外部存储库,并开发推理机制来检索证据以验证声明。现有的事实检查方法侧重于验证由条款表达的单一主张的声明。然而,由于现实世界中的谣言信息通常比较复杂,而且一个文本声明往往由多个分句组成(即表示为多个主张而非单一主张),因此多主张事实检查在实际应用中不仅必要,而且更加重要。多主张语句意味着丰富的上下文信息,对多主张的交互进行建模可以促进更好的验证。在本文中,我们提出了一种用于多索赔事实检查的知识增强学习和语义组合模型。我们的模型由两个模块组成:基于知识的学习增强和多声明语义合成。为了充分利用多索赔中隐含的上下文信息,基于 KG 的学习增强模块通过选择性地聚合实体的相关属性来学习动态的上下文特定表征。为了稳健地验证多个权利要求,多权利要求语义组合模块通过对权利要求之间的交互建模来学习多个权利要求的统一表示,然后在此基础上对它们进行整体验证。我们进行了实验研究来验证我们提出的方法,在三个典型数据集上的实验结果证实了我们的模型在多索赔事实检查方面的有效性。
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.