Knowledge-aware multimodal pre-training for fake news detection

IF 14.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Information Fusion Pub Date : 2024-09-24 DOI:10.1016/j.inffus.2024.102715

Litian Zhang , Xiaoming Zhang , Ziyi Zhou , Xi Zhang , Philip S. Yu , Chaozhuo Li

{"title":"Knowledge-aware multimodal pre-training for fake news detection","authors":"Litian Zhang , Xiaoming Zhang , Ziyi Zhou , Xi Zhang , Philip S. Yu , Chaozhuo Li","doi":"10.1016/j.inffus.2024.102715","DOIUrl":null,"url":null,"abstract":"<div><div>Amidst the rapid propagation of multimodal fake news across various social media platforms, the identification and filtering of disinformation have emerged as critical areas of academic research. A salient characteristic of fake news lies in its diversity, encompassing text–image inconsistency, content–knowledge inconsistency, and content fabrication. However, existing endeavors are generally tailored to a specific subset of fake news, leading to limited universality. Moreover, these models primarily rely on scarce and exorbitant manually labeled annotations, which is incapable of providing sufficient learning signals to detect a variety of fake news. To address these challenges, we propose a novel knowledge-aware multimodal pre-training paradigm for fake news detection, dubbed KAMP. Our motivation lies in incorporating unsupervised correlations through pre-training tasks as complementary to alleviate the dependency on annotations. KAMP consists of a novel multimodal learning model and various delicate pre-training tasks to simultaneously capture valuable knowledge from single modality, multiple modalities, and background knowledge graphs. Our proposal undergoes comprehensive evaluation across two widely utilized datasets, and experimental results demonstrate the superiority of our proposal.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"114 ","pages":"Article 102715"},"PeriodicalIF":14.7000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524004937","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Amidst the rapid propagation of multimodal fake news across various social media platforms, the identification and filtering of disinformation have emerged as critical areas of academic research. A salient characteristic of fake news lies in its diversity, encompassing text–image inconsistency, content–knowledge inconsistency, and content fabrication. However, existing endeavors are generally tailored to a specific subset of fake news, leading to limited universality. Moreover, these models primarily rely on scarce and exorbitant manually labeled annotations, which is incapable of providing sufficient learning signals to detect a variety of fake news. To address these challenges, we propose a novel knowledge-aware multimodal pre-training paradigm for fake news detection, dubbed KAMP. Our motivation lies in incorporating unsupervised correlations through pre-training tasks as complementary to alleviate the dependency on annotations. KAMP consists of a novel multimodal learning model and various delicate pre-training tasks to simultaneously capture valuable knowledge from single modality, multiple modalities, and background knowledge graphs. Our proposal undergoes comprehensive evaluation across two widely utilized datasets, and experimental results demonstrate the superiority of our proposal.

查看原文本刊更多论文

用于假新闻检测的知识感知多模态预培训

随着多模态假新闻在各种社交媒体平台上的迅速传播，识别和过滤虚假信息已成为学术研究的关键领域。假新闻的一个显著特点在于其多样性，包括文本与图像不一致、内容与知识不一致以及内容捏造。然而，现有的研究一般都是针对特定的假新闻子集，导致其普遍性有限。此外，这些模型主要依赖于稀缺且昂贵的人工标注注释，无法提供足够的学习信号来检测各种假新闻。为了应对这些挑战，我们提出了一种用于假新闻检测的新型知识感知多模态预训练范式，称为 KAMP。我们的动机在于通过预训练任务纳入无监督相关性作为补充，以减轻对注释的依赖。KAMP 由一个新颖的多模态学习模型和各种精细的预训练任务组成，可同时从单模态、多模态和背景知识图谱中获取有价值的知识。我们的建议在两个广泛使用的数据集上进行了全面评估，实验结果证明了我们建议的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.