{"title":"A Headline-Centric Graph-Based Dual Context Matching Approach for Incongruent News Detection","authors":"Sujit Kumar;Saurabh Kumar;Sanasam Ranbir Singh","doi":"10.1109/TCSS.2024.3384698","DOIUrl":null,"url":null,"abstract":"The prevalence of incongruent news has demonstrated its significant role in propagating fake news, which catalyzes the dissemination of both misinformation and disinformation. Consequently, detecting incongruent news articles is an important research problem to counter early spreading of misinformation. In the literature, researchers have explored various bag-of-word-based features, news body-centric and news headline-centric encoding methods for incongruent news article detection. However, headline-centric and body-centric approaches in the literature fail to detect partially incongruent articles efficiently. Motivated by the above limitations, this study proposes graph-based dual context matching (GDCM), which first represents headlines and news bodies as a \n<italic>bigram</i>\n network to capture contextual relations between words and document structure. For every word in the headline, GDCM extracts dual contexts (positive and negative) from the \n<italic>bigram</i>\n network representing news body and estimates similarity between dual contexts and the headline for incongruent news detection. We conduct extensive experiments on three publicly available benchmark datasets and compare its performance with 16 baseline models. Our experimental results suggest that the proposed model outperforms existing state-of-the-art models and efficiently detects partially incongruent news. We further validate the performance of the proposed model through several ablation studies. The following key observations can be made from the ablation studies: 1) extracting dual \n<italic>bigram</i>\n context of words in the headline from different segments of news body and then estimating the similarity between dual \n<italic>bigram</i>\n contexts from news body and the headline helps in incongruent news detection and also helps in detecting partial incongruent news efficiently; and 2) representing news headlines and bodies in the form of a network based on \n<italic>bigram</i>\n context helps to capture better nonlinear and contextual relationships between headline and body.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"11 5","pages":"5913-5924"},"PeriodicalIF":4.5000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10509576/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0
Abstract
The prevalence of incongruent news has demonstrated its significant role in propagating fake news, which catalyzes the dissemination of both misinformation and disinformation. Consequently, detecting incongruent news articles is an important research problem to counter early spreading of misinformation. In the literature, researchers have explored various bag-of-word-based features, news body-centric and news headline-centric encoding methods for incongruent news article detection. However, headline-centric and body-centric approaches in the literature fail to detect partially incongruent articles efficiently. Motivated by the above limitations, this study proposes graph-based dual context matching (GDCM), which first represents headlines and news bodies as a
bigram
network to capture contextual relations between words and document structure. For every word in the headline, GDCM extracts dual contexts (positive and negative) from the
bigram
network representing news body and estimates similarity between dual contexts and the headline for incongruent news detection. We conduct extensive experiments on three publicly available benchmark datasets and compare its performance with 16 baseline models. Our experimental results suggest that the proposed model outperforms existing state-of-the-art models and efficiently detects partially incongruent news. We further validate the performance of the proposed model through several ablation studies. The following key observations can be made from the ablation studies: 1) extracting dual
bigram
context of words in the headline from different segments of news body and then estimating the similarity between dual
bigram
contexts from news body and the headline helps in incongruent news detection and also helps in detecting partial incongruent news efficiently; and 2) representing news headlines and bodies in the form of a network based on
bigram
context helps to capture better nonlinear and contextual relationships between headline and body.
期刊介绍:
IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.