A real-time framework for opinion spam detection in Arabic social networks

IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Cherry A. Ezzat , Abdullah M. Alkadri , Abeer Elkorany
{"title":"A real-time framework for opinion spam detection in Arabic social networks","authors":"Cherry A. Ezzat ,&nbsp;Abdullah M. Alkadri ,&nbsp;Abeer Elkorany","doi":"10.1016/j.eij.2025.100626","DOIUrl":null,"url":null,"abstract":"<div><div>In today’s interconnected digital landscape, social media platforms serve as the primary avenue for global conversations, encompassing various topics and opinions. Opinion spam entails spreading misleading content masked as authentic opinions. The propagation of opinion spam poses a significant challenge, undermining the authenticity and trustworthiness of online interactions and disturbing the unrestricted exchange of ideas. One of the main challenges in spam detection is the rapid flow of spam content, which necessitates real-time detection mechanisms. Additionally, another important obstacle in detecting spam on Arabic social networks is the limited availability of labeled data. This paper proposes a framework for Real-Time Arabic Opinion Spam Detection (RTAOSD) that was developed to effectively detect opinion spam within Arabic social networks. This framework integrates advanced machine learning models, sentiment Analysis, and real-time processing techniques to achieve accurate and efficient detection of opinion spam. Furthermore, RTAOSD categorizes the non-spam content according to its relevance to topic of interest in to purify the content appear to social network users. Experimental evaluations conducted on real-world datasets demonstrate the effectiveness of RTAOSD in detecting opinion spam which leads to provide users with filtered content that match with their interest and overcome the problem of information overloading. The proposed framework achieved macro-F1 scores for spam detection ranging from 91% to 99% on different Arabic datasets surpassing previous work. While for topic relevance classification, RTAOSD achieved a macro-F1 of 86% for binary relevance and 78% for categorical relevance outperforming previous approaches used. The outcomes of this research is a real-time Arabic spam detector that accurately detects spam content and classifies non-spam text according to its relevance to topic . In addition to providing a visualization module for analyzing and reporting the characteristics of the filtered text.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100626"},"PeriodicalIF":5.0000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525000192","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

In today’s interconnected digital landscape, social media platforms serve as the primary avenue for global conversations, encompassing various topics and opinions. Opinion spam entails spreading misleading content masked as authentic opinions. The propagation of opinion spam poses a significant challenge, undermining the authenticity and trustworthiness of online interactions and disturbing the unrestricted exchange of ideas. One of the main challenges in spam detection is the rapid flow of spam content, which necessitates real-time detection mechanisms. Additionally, another important obstacle in detecting spam on Arabic social networks is the limited availability of labeled data. This paper proposes a framework for Real-Time Arabic Opinion Spam Detection (RTAOSD) that was developed to effectively detect opinion spam within Arabic social networks. This framework integrates advanced machine learning models, sentiment Analysis, and real-time processing techniques to achieve accurate and efficient detection of opinion spam. Furthermore, RTAOSD categorizes the non-spam content according to its relevance to topic of interest in to purify the content appear to social network users. Experimental evaluations conducted on real-world datasets demonstrate the effectiveness of RTAOSD in detecting opinion spam which leads to provide users with filtered content that match with their interest and overcome the problem of information overloading. The proposed framework achieved macro-F1 scores for spam detection ranging from 91% to 99% on different Arabic datasets surpassing previous work. While for topic relevance classification, RTAOSD achieved a macro-F1 of 86% for binary relevance and 78% for categorical relevance outperforming previous approaches used. The outcomes of this research is a real-time Arabic spam detector that accurately detects spam content and classifies non-spam text according to its relevance to topic . In addition to providing a visualization module for analyzing and reporting the characteristics of the filtered text.
阿拉伯社交网络中意见垃圾检测的实时框架
在当今相互关联的数字环境中,社交媒体平台是全球对话的主要渠道,涵盖了各种主题和观点。“意见垃圾”指的是伪装成真实意见的误导性内容的传播。垃圾意见的传播构成了一个重大挑战,破坏了在线互动的真实性和可信度,扰乱了思想的无限制交流。垃圾邮件检测的主要挑战之一是垃圾邮件内容的快速流动,这需要实时检测机制。此外,在阿拉伯社交网络上检测垃圾邮件的另一个重要障碍是标记数据的有限可用性。本文提出了一个实时阿拉伯语意见垃圾检测(RTAOSD)框架,该框架旨在有效地检测阿拉伯语社交网络中的意见垃圾。该框架集成了先进的机器学习模型、情感分析和实时处理技术,以实现准确有效的意见垃圾检测。此外,RTAOSD根据非垃圾内容与感兴趣话题的相关性对其进行分类,以净化出现在社交网络用户面前的内容。在真实数据集上进行的实验评估证明了RTAOSD在检测意见垃圾方面的有效性,从而为用户提供与其兴趣相匹配的过滤内容,并克服了信息过载的问题。所提出的框架在不同的阿拉伯数据集上实现了垃圾邮件检测的宏观f1分数,范围从91%到99%,超过了以前的工作。而对于主题相关性分类,RTAOSD在二元相关性和类别相关性方面的宏观f1分别达到86%和78%,优于之前使用的方法。本研究的结果是一个实时阿拉伯语垃圾邮件检测器,可以准确地检测垃圾邮件内容,并根据其与主题的相关性对非垃圾邮件文本进行分类。除了提供用于分析和报告过滤文本特征的可视化模块之外。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Egyptian Informatics Journal
Egyptian Informatics Journal Decision Sciences-Management Science and Operations Research
CiteScore
11.10
自引率
1.90%
发文量
59
审稿时长
110 days
期刊介绍: The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信