Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data

International Journal of Cognitive Computing in Engineering Pub Date : 2024-01-01 DOI:10.1016/j.ijcce.2024.02.002

Simrat Kaur, Sarbjeet Singh, Sakshi Kaushal

{"title":"Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data","authors":"Simrat Kaur, Sarbjeet Singh, Sakshi Kaushal","doi":"10.1016/j.ijcce.2024.02.002","DOIUrl":null,"url":null,"abstract":"<div><p>With the rapid growth of social media culture, the use of offensive or hateful language has surged, which necessitates the development of effective abusive language detection models for online platforms. This paper focuses on developing a multi-class classification model to identify different types of offensive language. The input data is taken in the form of labeled tweets and is classified into offensive language detection, offensive language categorization, and offensive language target identification. The data undergoes pre-processing, which removes NaN value and punctuation, as well as performs tokenization followed by the generation of a word cloud to assess data quality. Further, the tf-idf technique is used for the selection of features. In the case of classifiers, multiple deep learning techniques, namely, bidirectional gated recurrent unit, multi-dense long short-term memory, bidirectional long short-term memory, gated recurrent unit, and long short-term memory, are applied where it has been found that all the models, except long short-term memory, achieved a high accuracy of 99.9 % for offensive language target identification. Bidirectional LSTM and multi-dense LSTM obtained the lowest loss and RMSE values of 0.01 and 0.1, respectively. This research provides valuable insights and contributes to the development of effective abusive language detection methods to promote a safe and respectful online environment. The insights gained can aid platform administrators in efficiently moderating content and taking appropriate actions against offensive language.</p></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"5 ","pages":"Pages 104-122"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666307424000068/pdfft?md5=b671af6be3039a34479959213a86aa16&pid=1-s2.0-S2666307424000068-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307424000068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the rapid growth of social media culture, the use of offensive or hateful language has surged, which necessitates the development of effective abusive language detection models for online platforms. This paper focuses on developing a multi-class classification model to identify different types of offensive language. The input data is taken in the form of labeled tweets and is classified into offensive language detection, offensive language categorization, and offensive language target identification. The data undergoes pre-processing, which removes NaN value and punctuation, as well as performs tokenization followed by the generation of a word cloud to assess data quality. Further, the tf-idf technique is used for the selection of features. In the case of classifiers, multiple deep learning techniques, namely, bidirectional gated recurrent unit, multi-dense long short-term memory, bidirectional long short-term memory, gated recurrent unit, and long short-term memory, are applied where it has been found that all the models, except long short-term memory, achieved a high accuracy of 99.9 % for offensive language target identification. Bidirectional LSTM and multi-dense LSTM obtained the lowest loss and RMSE values of 0.01 and 0.1, respectively. This research provides valuable insights and contributes to the development of effective abusive language detection methods to promote a safe and respectful online environment. The insights gained can aid platform administrators in efficiently moderating content and taking appropriate actions against offensive language.

查看原文本刊更多论文

基于深度学习的多类在线用户生成数据滥用内容检测和分类方法

随着社交媒体文化的快速发展，攻击性或仇恨性语言的使用激增，因此有必要为网络平台开发有效的辱骂性语言检测模型。本文的重点是开发一个多类分类模型，以识别不同类型的攻击性语言。输入数据采用带标签推文的形式，分为攻击性语言检测、攻击性语言分类和攻击性语言目标识别。数据经过预处理，去除 NaN 值和标点符号，并进行标记化，然后生成词云以评估数据质量。此外，还使用了 tf-idf 技术来选择特征。在分类器方面，应用了多种深度学习技术，即双向门控递归单元、多密度长短期记忆、双向长短期记忆、门控递归单元和长短期记忆，结果发现，除长短期记忆外，所有模型在攻击性语言目标识别方面都达到了 99.9% 的高准确率。双向 LSTM 和多密度 LSTM 的损耗和 RMSE 值最低，分别为 0.01 和 0.1。这项研究提供了有价值的见解，有助于开发有效的滥用语言检测方法，以促进安全和相互尊重的网络环境。所获得的洞察力可以帮助平台管理员有效地管理内容，并对攻击性语言采取适当的措施。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Cognitive Computing in Engineering

CiteScore

13.80

自引率

0.00%

发文量