Hybrid Feature and Optimized Deep Learning Model Fusion for Detecting Hateful Arabic Content

IF 3.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Access Pub Date : 2025-07-22 DOI:10.1109/ACCESS.2025.3591673

Karim Gasmi;Ibtihel Ben Ltaifa;Alameen Eltoum Abdalrahman;Omer Hamid;Mohamed Othman Altaieb;Shahzad Ali;Lassaad Ben Ammar;Manel Mrabet

{"title":"Hybrid Feature and Optimized Deep Learning Model Fusion for Detecting Hateful Arabic Content","authors":"Karim Gasmi;Ibtihel Ben Ltaifa;Alameen Eltoum Abdalrahman;Omer Hamid;Mohamed Othman Altaieb;Shahzad Ali;Lassaad Ben Ammar;Manel Mrabet","doi":"10.1109/ACCESS.2025.3591673","DOIUrl":null,"url":null,"abstract":"Detecting hate speech in Arabic social media content is critical for ensuring safe, inclusive, and respectful online communication. However, this task remains challenging due to Arabic’s morphological richness, dialectal variations such as Levantine, and the scarcity of high-quality annotated data. This study proposes a comprehensive and language-aware approach to Arabic hate speech detection that integrates advanced preprocessing, targeted data augmentation, hybrid feature extraction, and deep ensemble learning. Our experiments are conducted on a Levantine Arabic tweet dataset labeled hateful or non-hateful. To address lexical variability and noise common in user-generated content, we apply a dedicated preprocessing pipeline that includes normalization, diacritic removal, and emoji filtering. To further enhance generalization and mitigate data imbalance, we employ two augmentation strategies: synonym replacement using a curated Arabic lexicon and semantic-preserving back-translation through English. We investigate lexical and contextual approaches for feature extraction, including TF-IDF vectors, contextualized AraBERT embeddings, and a hybrid combination of both. These features are input into multiple deep learning classifiers, including CNN-BiGRU, BiLSTM, and DNN architectures. To maximize predictive performance, we develop an ensemble framework that integrates these models. The final prediction is obtained through a weighted fusion of individual model outputs, where the optimal weights are selected using the Grey Wolf Optimizer (GWO), aiming to maximize classification accuracy. Experimental results demonstrate that our proposed hybrid and ensemble-based architecture achieves superior performance, with an accuracy of 83.33% and a ROC-AUC score of 89.5%, outperforming individual models and conventional baselines. These findings highlight the effectiveness of hybrid feature representations and nature-inspired optimization in enhancing Arabic hate speech detection. Our approach offers a scalable, linguistically informed solution for robust content moderation in Arabic digital spaces.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"131411-131431"},"PeriodicalIF":3.6000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11088089","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11088089/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Detecting hate speech in Arabic social media content is critical for ensuring safe, inclusive, and respectful online communication. However, this task remains challenging due to Arabic’s morphological richness, dialectal variations such as Levantine, and the scarcity of high-quality annotated data. This study proposes a comprehensive and language-aware approach to Arabic hate speech detection that integrates advanced preprocessing, targeted data augmentation, hybrid feature extraction, and deep ensemble learning. Our experiments are conducted on a Levantine Arabic tweet dataset labeled hateful or non-hateful. To address lexical variability and noise common in user-generated content, we apply a dedicated preprocessing pipeline that includes normalization, diacritic removal, and emoji filtering. To further enhance generalization and mitigate data imbalance, we employ two augmentation strategies: synonym replacement using a curated Arabic lexicon and semantic-preserving back-translation through English. We investigate lexical and contextual approaches for feature extraction, including TF-IDF vectors, contextualized AraBERT embeddings, and a hybrid combination of both. These features are input into multiple deep learning classifiers, including CNN-BiGRU, BiLSTM, and DNN architectures. To maximize predictive performance, we develop an ensemble framework that integrates these models. The final prediction is obtained through a weighted fusion of individual model outputs, where the optimal weights are selected using the Grey Wolf Optimizer (GWO), aiming to maximize classification accuracy. Experimental results demonstrate that our proposed hybrid and ensemble-based architecture achieves superior performance, with an accuracy of 83.33% and a ROC-AUC score of 89.5%, outperforming individual models and conventional baselines. These findings highlight the effectiveness of hybrid feature representations and nature-inspired optimization in enhancing Arabic hate speech detection. Our approach offers a scalable, linguistically informed solution for robust content moderation in Arabic digital spaces.

查看原文本刊更多论文

混合特征和优化深度学习模型融合检测仇恨阿拉伯语内容

检测阿拉伯社交媒体内容中的仇恨言论对于确保安全、包容和尊重的在线交流至关重要。然而，由于阿拉伯语的形态丰富，方言变化（如黎凡特语）以及高质量注释数据的缺乏，这项任务仍然具有挑战性。本研究提出了一种全面的、语言感知的阿拉伯仇恨言论检测方法，该方法集成了先进的预处理、目标数据增强、混合特征提取和深度集成学习。我们的实验是在标有可恨或非可恨的黎凡特阿拉伯语推文数据集上进行的。为了解决用户生成内容中常见的词汇可变性和噪声，我们应用了一个专用的预处理管道，包括归一化、变音符去除和表情符号过滤。为了进一步增强泛化和减轻数据不平衡，我们采用了两种增强策略：使用精心设计的阿拉伯语词典替换同义词和通过英语进行语义保留反翻译。我们研究了用于特征提取的词法和上下文方法，包括TF-IDF向量、上下文化的AraBERT嵌入以及两者的混合组合。这些特征被输入到多个深度学习分类器中，包括CNN-BiGRU、BiLSTM和DNN架构。为了最大限度地提高预测性能，我们开发了一个集成这些模型的集成框架。最终的预测是通过对单个模型输出进行加权融合得到的，其中使用灰狼优化器（GWO）选择最优权重，旨在最大化分类精度。实验结果表明，我们提出的混合和基于集成的架构取得了优异的性能，准确率为83.33%，ROC-AUC得分为89.5%，优于单个模型和传统基线。这些发现突出了混合特征表示和自然启发优化在增强阿拉伯语仇恨言论检测方面的有效性。我们的方法为阿拉伯数字空间的稳健内容审核提供了一个可扩展的、语言知情的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC

CiteScore

9.80

自引率

7.70%

发文量

6673

审稿时长

6 weeks

期刊介绍： IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.