Automatic detection of manipulated Bangla news: A new knowledge-driven approach

Natural Language Processing Journal Pub Date : 2025-05-12 DOI:10.1016/j.nlp.2025.100155

Aysha Akther, Kazi Masudul Alam, Rameswar Debnath

{"title":"Automatic detection of manipulated Bangla news: A new knowledge-driven approach","authors":"Aysha Akther, Kazi Masudul Alam, Rameswar Debnath","doi":"10.1016/j.nlp.2025.100155","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, dissemination of misleading news has become easier than ever due to the simplicity of creating and distributing news content on online media platforms. Misleading news detection has become a global topic of interest due to its significant impact on society, economics, and politics. Automatic detection of the veracity of news remains challenging because of its diversity and close resemblance with true events. In many languages, fake news detection has been studied from different perspectives. However, in Bangla, existing endeavors on fake news detection generally relied on linguistic style analysis and latent representation-based machine learning and deep learning models. These models primarily rely on manually labeled annotations. To address these challenges, we proposed a knowledge-based Bangla fake news detection model that does not require model training. In our proposed manipulation detection approach, a news article is automatically labeled as fake or authentic based on an authenticity score that relies on the consistency of knowledge and semantics, underlying sentiment, and credibility of the news source. We also propose a consistent and context-aware manipulated news generation technique to facilitate the detection of partially manipulated Bangla news. We found the proposed model to be a reliable one for the detection of both fake news and partially manipulated news. We also developed a dataset that is balanced according to the number of authentic and fake news for the detection of Bangla fake news, where news items are collected from multiple domains and various news sources. The experimental evaluation of our proposed knowledge-driven approach on the developed dataset has shown 97.08% accuracy for only fake news detection.</div></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"11 ","pages":"Article 100155"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719125000317","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, dissemination of misleading news has become easier than ever due to the simplicity of creating and distributing news content on online media platforms. Misleading news detection has become a global topic of interest due to its significant impact on society, economics, and politics. Automatic detection of the veracity of news remains challenging because of its diversity and close resemblance with true events. In many languages, fake news detection has been studied from different perspectives. However, in Bangla, existing endeavors on fake news detection generally relied on linguistic style analysis and latent representation-based machine learning and deep learning models. These models primarily rely on manually labeled annotations. To address these challenges, we proposed a knowledge-based Bangla fake news detection model that does not require model training. In our proposed manipulation detection approach, a news article is automatically labeled as fake or authentic based on an authenticity score that relies on the consistency of knowledge and semantics, underlying sentiment, and credibility of the news source. We also propose a consistent and context-aware manipulated news generation technique to facilitate the detection of partially manipulated Bangla news. We found the proposed model to be a reliable one for the detection of both fake news and partially manipulated news. We also developed a dataset that is balanced according to the number of authentic and fake news for the detection of Bangla fake news, where news items are collected from multiple domains and various news sources. The experimental evaluation of our proposed knowledge-driven approach on the developed dataset has shown 97.08% accuracy for only fake news detection.

Abstract Image

查看原文本刊更多论文

自动检测被操纵的孟加拉新闻：一种新的知识驱动的方法

近年来，由于在网络媒体平台上创建和分发新闻内容的简单性，误导性新闻的传播变得比以往任何时候都更容易。由于对社会、经济和政治的重大影响，误导性新闻检测已成为一个全球关注的话题。由于新闻的多样性和与真实事件的相似性，自动检测新闻的真实性仍然具有挑战性。在许多语言中，假新闻检测已经从不同的角度进行了研究。然而，在孟加拉国，现有的假新闻检测工作通常依赖于语言风格分析和基于潜在表示的机器学习和深度学习模型。这些模型主要依赖于手动标记的注释。为了解决这些挑战，我们提出了一个基于知识的孟加拉假新闻检测模型，该模型不需要模型训练。在我们提出的操纵检测方法中，根据真实性评分自动将新闻文章标记为假或真实，真实性评分依赖于知识和语义的一致性、潜在情绪和新闻来源的可信度。我们还提出了一个一致的和上下文感知操纵新闻生成技术，以促进部分操纵孟加拉新闻的检测。我们发现所提出的模型对于假新闻和部分被操纵的新闻的检测都是可靠的。我们还开发了一个数据集，该数据集根据真实新闻和假新闻的数量进行平衡，用于检测孟加拉国假新闻，其中新闻项目是从多个领域和各种新闻来源收集的。我们提出的知识驱动方法在开发的数据集上的实验评估显示，仅对假新闻检测准确率为97.08%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Language Processing Journal

自引率

0.00%

发文量