基于BERT算法的Twitter仇恨语音检测

Adine Nayla, C. Setianingsih, B. Dirgantoro
{"title":"基于BERT算法的Twitter仇恨语音检测","authors":"Adine Nayla, C. Setianingsih, B. Dirgantoro","doi":"10.1109/ICCoSITE57641.2023.10127831","DOIUrl":null,"url":null,"abstract":"Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hate Speech Detection on Twitter Using BERT Algorithm\",\"authors\":\"Adine Nayla, C. Setianingsih, B. Dirgantoro\",\"doi\":\"10.1109/ICCoSITE57641.2023.10127831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.\",\"PeriodicalId\":256184,\"journal\":{\"name\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCoSITE57641.2023.10127831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCoSITE57641.2023.10127831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在社交媒体平台推特上发表仇恨言论并不常见。用户可以在Twitter平台上自由获取信息、交流信息、表达意见。这是一个人可能在推特上接触到仇恨言论的主要因素之一。遭受仇恨言论的受害者可能患有精神健康障碍,因为大多数仇恨言论的受害者都受到口头或情感上的攻击。然而,缺乏针对Twitter社交媒体平台上的仇恨言论检测的对策仍然很少见。在本研究中,使用该网站进行了模拟,同时测试和分析了仇恨言论的检测。测试是通过在仇恨言论网站上输入一个文本,然后该网站会对该文本进行预处理,并使用BERT算法对该文本进行分析,以分类该词是否为仇恨言论。训练结果发现,使用BERT算法对Twitter用户账户上的仇恨言论进行检测,准确率为78.69%,精密度为78.90%,召回率为78.69%,针对仇恨言论群体分类的F1得分为78.77%。因此,通过使用仇恨言论网站,用户将更容易发现Twitter上的仇恨言论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hate Speech Detection on Twitter Using BERT Algorithm
Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信