HarmonyNet: Navigating hate speech detection

Natural Language Processing Journal Pub Date : 2024-08-20 DOI:10.1016/j.nlp.2024.100098

Shaina Raza, Veronica Chatrath

引用次数: 0

Abstract

In the digital era, social media platforms have become central to communication across various domains. However, the vast spread of unregulated content often leads to the prevalence of hate speech and toxicity. Existing methods to detect this toxicity struggle with context sensitivity, accommodating diverse dialects, and adapting to varied communication styles. To tackle these challenges, we introduce an ensemble classifier that leverages the strengths of language models and traditional deep neural network architectures for more effective hate speech detection on social media. Our evaluations show that this hybrid approach outperforms individual models and exhibits robustness against adversarial attacks. Future efforts will aim to enhance the model’s architecture to further boost its efficiency and extend its capability to recognize hate speech across an even wider range of languages and dialects.

查看原文本刊更多论文

和谐网络：仇恨言论检测导航

在数字时代，社交媒体平台已成为各领域交流的核心。然而，大量不规范内容的传播往往导致仇恨言论和毒性泛滥。现有的检测方法在上下文敏感性、适应不同方言和适应不同交流风格等方面存在困难。为了应对这些挑战，我们引入了一种集合分类器，利用语言模型和传统深度神经网络架构的优势，更有效地检测社交媒体上的仇恨言论。我们的评估结果表明，这种混合方法的性能优于单个模型，并在对抗恶意攻击时表现出鲁棒性。未来，我们将致力于增强模型的架构，进一步提高其效率，并扩展其识别仇恨言论的能力，使其适用于更广泛的语言和方言。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Language Processing Journal

自引率

0.00%

发文量