An experimental study on feature engineering and learning approaches for aggression detection in social media

Antonela Tommasel, J. Rodriguez, D. Godoy
{"title":"An experimental study on feature engineering and learning approaches for aggression detection in social media","authors":"Antonela Tommasel, J. Rodriguez, D. Godoy","doi":"10.4114/INTARTIF.VOL22ISS63PP81-100","DOIUrl":null,"url":null,"abstract":"With the widespread of modern technologies and social media networks, a new form of bullying occurring anytime and anywhere has emerged. This new phenomenon, known as cyberaggression or cyberbullying, refers to aggressive and intentional acts aiming at repeatedly causing harm to other person involving rude, insulting, offensive, teasing or demoralising comments through online social media. As these aggressions represent a threatening experience to Internet users, especially kids and teens who are still shaping their identities, social relations and well-being, it is crucial to understand how cyberbullying occurs to prevent it from escalating. Considering the massive information on the Web, the developing of intelligent techniques for automatically detecting harmful content is gaining importance, allowing the monitoring of large-scale social media and the early detection of unwanted and aggressive situations. Even though several approaches have been developed over the last few years based both on traditional and deep learning techniques, several concerns arise over the duplication of research and the difficulty of comparing results. Moreover, there is no agreement regarding neither which type of technique is better suited for the task, nor the type of features in which learning should be based. The goal of this work is to shed some light on the effects of learning paradigms and feature engineering approaches for detecting aggressions in social media texts. In this context, this work provides an evaluation of diverse traditional and deep learning techniques based on diverse sets of features, across multiple social media sites. ","PeriodicalId":176050,"journal":{"name":"Inteligencia Artif.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inteligencia Artif.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4114/INTARTIF.VOL22ISS63PP81-100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

With the widespread of modern technologies and social media networks, a new form of bullying occurring anytime and anywhere has emerged. This new phenomenon, known as cyberaggression or cyberbullying, refers to aggressive and intentional acts aiming at repeatedly causing harm to other person involving rude, insulting, offensive, teasing or demoralising comments through online social media. As these aggressions represent a threatening experience to Internet users, especially kids and teens who are still shaping their identities, social relations and well-being, it is crucial to understand how cyberbullying occurs to prevent it from escalating. Considering the massive information on the Web, the developing of intelligent techniques for automatically detecting harmful content is gaining importance, allowing the monitoring of large-scale social media and the early detection of unwanted and aggressive situations. Even though several approaches have been developed over the last few years based both on traditional and deep learning techniques, several concerns arise over the duplication of research and the difficulty of comparing results. Moreover, there is no agreement regarding neither which type of technique is better suited for the task, nor the type of features in which learning should be based. The goal of this work is to shed some light on the effects of learning paradigms and feature engineering approaches for detecting aggressions in social media texts. In this context, this work provides an evaluation of diverse traditional and deep learning techniques based on diverse sets of features, across multiple social media sites. 
社交媒体攻击检测的特征工程与学习方法实验研究
随着现代科技和社交媒体网络的普及,一种随时随地发生的新型欺凌行为出现了。这种新现象被称为网络攻击或网络欺凌,指的是通过网络社交媒体,通过粗鲁、侮辱、冒犯、戏弄或打击士气的言论,反复对他人造成伤害的攻击性和故意行为。由于这些攻击对互联网用户来说是一种威胁,尤其是对仍在塑造自己的身份、社会关系和福祉的儿童和青少年来说,了解网络欺凌是如何发生的,以防止其升级是至关重要的。考虑到网络上的海量信息,自动检测有害内容的智能技术的发展变得越来越重要,可以监控大规模的社交媒体,并及早发现不必要的和攻击性的情况。尽管在过去的几年中已经开发了几种基于传统和深度学习技术的方法,但由于研究的重复和比较结果的困难,出现了一些问题。此外,对于哪种类型的技术更适合这项任务,以及学习应该基于哪种类型的特征,都没有达成一致意见。这项工作的目的是阐明学习范式和特征工程方法对检测社交媒体文本中的攻击的影响。在此背景下,本研究基于多个社交媒体网站的不同特征集,对各种传统和深度学习技术进行了评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信