A Headline-Centric Graph-Based Dual Context Matching Approach for Incongruent News Detection

IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS
Sujit Kumar;Saurabh Kumar;Sanasam Ranbir Singh
{"title":"A Headline-Centric Graph-Based Dual Context Matching Approach for Incongruent News Detection","authors":"Sujit Kumar;Saurabh Kumar;Sanasam Ranbir Singh","doi":"10.1109/TCSS.2024.3384698","DOIUrl":null,"url":null,"abstract":"The prevalence of incongruent news has demonstrated its significant role in propagating fake news, which catalyzes the dissemination of both misinformation and disinformation. Consequently, detecting incongruent news articles is an important research problem to counter early spreading of misinformation. In the literature, researchers have explored various bag-of-word-based features, news body-centric and news headline-centric encoding methods for incongruent news article detection. However, headline-centric and body-centric approaches in the literature fail to detect partially incongruent articles efficiently. Motivated by the above limitations, this study proposes graph-based dual context matching (GDCM), which first represents headlines and news bodies as a \n<italic>bigram</i>\n network to capture contextual relations between words and document structure. For every word in the headline, GDCM extracts dual contexts (positive and negative) from the \n<italic>bigram</i>\n network representing news body and estimates similarity between dual contexts and the headline for incongruent news detection. We conduct extensive experiments on three publicly available benchmark datasets and compare its performance with 16 baseline models. Our experimental results suggest that the proposed model outperforms existing state-of-the-art models and efficiently detects partially incongruent news. We further validate the performance of the proposed model through several ablation studies. The following key observations can be made from the ablation studies: 1) extracting dual \n<italic>bigram</i>\n context of words in the headline from different segments of news body and then estimating the similarity between dual \n<italic>bigram</i>\n contexts from news body and the headline helps in incongruent news detection and also helps in detecting partial incongruent news efficiently; and 2) representing news headlines and bodies in the form of a network based on \n<italic>bigram</i>\n context helps to capture better nonlinear and contextual relationships between headline and body.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"11 5","pages":"5913-5924"},"PeriodicalIF":4.5000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10509576/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0

Abstract

The prevalence of incongruent news has demonstrated its significant role in propagating fake news, which catalyzes the dissemination of both misinformation and disinformation. Consequently, detecting incongruent news articles is an important research problem to counter early spreading of misinformation. In the literature, researchers have explored various bag-of-word-based features, news body-centric and news headline-centric encoding methods for incongruent news article detection. However, headline-centric and body-centric approaches in the literature fail to detect partially incongruent articles efficiently. Motivated by the above limitations, this study proposes graph-based dual context matching (GDCM), which first represents headlines and news bodies as a bigram network to capture contextual relations between words and document structure. For every word in the headline, GDCM extracts dual contexts (positive and negative) from the bigram network representing news body and estimates similarity between dual contexts and the headline for incongruent news detection. We conduct extensive experiments on three publicly available benchmark datasets and compare its performance with 16 baseline models. Our experimental results suggest that the proposed model outperforms existing state-of-the-art models and efficiently detects partially incongruent news. We further validate the performance of the proposed model through several ablation studies. The following key observations can be made from the ablation studies: 1) extracting dual bigram context of words in the headline from different segments of news body and then estimating the similarity between dual bigram contexts from news body and the headline helps in incongruent news detection and also helps in detecting partial incongruent news efficiently; and 2) representing news headlines and bodies in the form of a network based on bigram context helps to capture better nonlinear and contextual relationships between headline and body.
基于标题中心图的不一致新闻检测双语境匹配方法
不一致新闻的盛行表明,它在传播假新闻方面起着重要作用,而假新闻则会催化错误信息和虚假信息的传播。因此,检测不一致的新闻文章是应对早期错误信息传播的一个重要研究课题。在文献中,研究人员探索了各种基于词袋特征、以新闻正文为中心和以新闻标题为中心的编码方法来检测不一致的新闻文章。然而,文献中以标题为中心和以正文为中心的方法无法有效地检测出部分不一致的文章。鉴于上述局限性,本研究提出了基于图的双重上下文匹配(GDCM),首先将标题和新闻正文表示为一个 bigram 网络,以捕捉单词和文档结构之间的上下文关系。对于标题中的每个单词,GDCM 都会从代表新闻正文的 bigram 网络中提取双重语境(正面语境和负面语境),并估算双重语境与标题之间的相似度,从而检测出不一致的新闻。我们在三个公开的基准数据集上进行了广泛的实验,并将其性能与 16 个基准模型进行了比较。实验结果表明,所提出的模型优于现有的最先进模型,并能有效地检测出部分不一致的新闻。我们通过几项消融研究进一步验证了所提模型的性能。从消减研究中可以得出以下主要结论:1)从新闻正文的不同片段中提取标题中单词的双重大语义上下文,然后估计新闻正文和标题中双重大语义上下文之间的相似性,这有助于不一致新闻的检测,也有助于高效检测部分不一致新闻;2)基于大语义上下文以网络形式表示新闻标题和正文,有助于更好地捕捉标题和正文之间的非线性和上下文关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Computational Social Systems
IEEE Transactions on Computational Social Systems Social Sciences-Social Sciences (miscellaneous)
CiteScore
10.00
自引率
20.00%
发文量
316
期刊介绍: IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信