基于BERT算法的Twitter仇恨语音检测

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE) Pub Date : 2023-02-16 DOI:10.1109/ICCoSITE57641.2023.10127831

Adine Nayla, C. Setianingsih, B. Dirgantoro

{"title":"基于BERT算法的Twitter仇恨语音检测","authors":"Adine Nayla, C. Setianingsih, B. Dirgantoro","doi":"10.1109/ICCoSITE57641.2023.10127831","DOIUrl":null,"url":null,"abstract":"Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hate Speech Detection on Twitter Using BERT Algorithm\",\"authors\":\"Adine Nayla, C. Setianingsih, B. Dirgantoro\",\"doi\":\"10.1109/ICCoSITE57641.2023.10127831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.\",\"PeriodicalId\":256184,\"journal\":{\"name\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCoSITE57641.2023.10127831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCoSITE57641.2023.10127831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在社交媒体平台推特上发表仇恨言论并不常见。用户可以在Twitter平台上自由获取信息、交流信息、表达意见。这是一个人可能在推特上接触到仇恨言论的主要因素之一。遭受仇恨言论的受害者可能患有精神健康障碍，因为大多数仇恨言论的受害者都受到口头或情感上的攻击。然而，缺乏针对Twitter社交媒体平台上的仇恨言论检测的对策仍然很少见。在本研究中，使用该网站进行了模拟，同时测试和分析了仇恨言论的检测。测试是通过在仇恨言论网站上输入一个文本，然后该网站会对该文本进行预处理，并使用BERT算法对该文本进行分析，以分类该词是否为仇恨言论。训练结果发现，使用BERT算法对Twitter用户账户上的仇恨言论进行检测，准确率为78.69%，精密度为78.90%，召回率为78.69%，针对仇恨言论群体分类的F1得分为78.77%。因此，通过使用仇恨言论网站，用户将更容易发现Twitter上的仇恨言论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hate Speech Detection on Twitter Using BERT Algorithm

Hate speech on one social media platform, Twitter, is uncommon. Users on the Twitter platform can freely obtain, exchange information, and express opinions. This is one of the main factors that a person can be exposed to hate speech on Twitter. Victims who are exposed to hate speech may suffer from mental health disorders because most victims of hate speech are attacked verbally or emotionally. However, the lack of countermeasures against the detection of hate speech on the Twitter social media platform is still rare. In this study, a simulation was carried out using the website, along with testing and analyzing the detection of hate speech. The test is done by inputting a text on the hate speech website, and then the website will do a preprocessing and analyze this text using the BERT algorithm to classify whether the word is hate speech or not. The training results found that the detection of hate speech on Twitter user accounts using the BERT Algorithm has a 78.69% accuracy, a 78.90% precision, a 78.69% recall, and a 78.77% F1 score against the classification of hate speech groups. Thus users will more easily detect hate speech on Twitter by using the hate speech website.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)

自引率

0.00%

发文量