Sentiment Analysis Model for Fake News Identification in Arabic Tweets

2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT) Pub Date : 2021-10-13 DOI:10.1109/AICT52784.2021.9620509

Aktham Sawan, Thaer Thaher, Noor Abu-El-Rub

{"title":"Sentiment Analysis Model for Fake News Identification in Arabic Tweets","authors":"Aktham Sawan, Thaer Thaher, Noor Abu-El-Rub","doi":"10.1109/AICT52784.2021.9620509","DOIUrl":null,"url":null,"abstract":"Over the last few years, the exponential rise of social media, particularly Twitter, is becoming a major source of news sharing and consumption among users. These platforms allow users to publish, author and distribute content. These environments may be used to report and spread gossip and false news, whether accidentally or maliciously. Fake news and inaccurate machine-generated text are serious issues affecting societies worldwide, particularly the Arab world. This motivates efforts to identify fake and distorted news. This paper aims to introduce a robust prediction model to identify fake news in Arabic Tweets. Several Natural Language Processing (NLP), feature selection, and advanced ML algorithms were exploited to achieve this purpose. NLP techniques were used to process and transform the given tweets into structured form. The recursive feature elimination (RFE) technique was employed to eliminate uninformative features. ML methods were used to build the prediction model. Experimental results revealed the superiority of the Logistic Regression (LR) classifier among other algorithms. Moreover, RFE proved its ability to enhance the overall performance of the LR classifier. Overall, the proposed model provided an acceptable prediction accuracy of 82%.","PeriodicalId":150606,"journal":{"name":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","volume":"239 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICT52784.2021.9620509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Over the last few years, the exponential rise of social media, particularly Twitter, is becoming a major source of news sharing and consumption among users. These platforms allow users to publish, author and distribute content. These environments may be used to report and spread gossip and false news, whether accidentally or maliciously. Fake news and inaccurate machine-generated text are serious issues affecting societies worldwide, particularly the Arab world. This motivates efforts to identify fake and distorted news. This paper aims to introduce a robust prediction model to identify fake news in Arabic Tweets. Several Natural Language Processing (NLP), feature selection, and advanced ML algorithms were exploited to achieve this purpose. NLP techniques were used to process and transform the given tweets into structured form. The recursive feature elimination (RFE) technique was employed to eliminate uninformative features. ML methods were used to build the prediction model. Experimental results revealed the superiority of the Logistic Regression (LR) classifier among other algorithms. Moreover, RFE proved its ability to enhance the overall performance of the LR classifier. Overall, the proposed model provided an acceptable prediction accuracy of 82%.

查看原文本刊更多论文

阿拉伯语推文虚假新闻识别的情感分析模型

在过去的几年里，社交媒体(尤其是Twitter)呈指数级增长，正成为用户分享和消费新闻的主要来源。这些平台允许用户发布、创作和分发内容。这些环境可能被用来报道和传播八卦和虚假新闻，无论是偶然的还是恶意的。假新闻和不准确的机器生成文本是影响全球社会的严重问题，尤其是阿拉伯世界。这促使人们努力识别虚假和扭曲的新闻。本文旨在引入一种鲁棒预测模型来识别阿拉伯语推文中的假新闻。利用自然语言处理(NLP)、特征选择和高级ML算法来实现这一目的。NLP技术用于处理并将给定的推文转换为结构化形式。采用递归特征消除(RFE)技术消除非信息特征。采用ML方法建立预测模型。实验结果表明，逻辑回归(LR)分类器在其他算法中具有优越性。此外，RFE证明了其提高LR分类器整体性能的能力。总体而言，所提出的模型提供了82%的可接受的预测精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)

自引率

0.00%

发文量