使用卷积神经网络(CNN)方法设计仇恨语音传感器的不和谐机器人

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE) Pub Date : 2023-02-16 DOI:10.1109/ICCoSITE57641.2023.10127699

Nicholas Hadi, V. C. Mawardi, J. Hendryli

{"title":"使用卷积神经网络(CNN)方法设计仇恨语音传感器的不和谐机器人","authors":"Nicholas Hadi, V. C. Mawardi, J. Hendryli","doi":"10.1109/ICCoSITE57641.2023.10127699","DOIUrl":null,"url":null,"abstract":"Discord is growing in popularity, makes hard for an admin of Discord server maintaining their member in their everyday chat activity in their server. This no longer an issue if there is Discord bot that can detect hate speech feature in text message that member send and automatically censor them. The classifier for this experiment is using Convolutional Neural Network (CNN) method. The dataset for training and validation model are containing total 6 category of hates speech, abusive language, religion, race, gender, physical, and non-hate speech. The Discord bot program only can classify a message in Indonesian language. The dataset used for training and validation models was obtained from Kaggle and for additional data taken from Discord server messages totaling 18,986 sentences which will be divided by 80% training data and 20% test data. The final results of the training model experiment, this CNN model can classify test data with an average precision value of 89%, 90% recall, and 88,33% F1 score. The CNN model is integrated into a bot application which will be tested on messages sent from the test Discord server. Out of 279 messages, the designed Discord bot can obtain an accuracy of 70.6%.","PeriodicalId":256184,"journal":{"name":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discord Bot Design for Hate Speech Sensor Using Convolutional Neural Networks (CNN) Method\",\"authors\":\"Nicholas Hadi, V. C. Mawardi, J. Hendryli\",\"doi\":\"10.1109/ICCoSITE57641.2023.10127699\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discord is growing in popularity, makes hard for an admin of Discord server maintaining their member in their everyday chat activity in their server. This no longer an issue if there is Discord bot that can detect hate speech feature in text message that member send and automatically censor them. The classifier for this experiment is using Convolutional Neural Network (CNN) method. The dataset for training and validation model are containing total 6 category of hates speech, abusive language, religion, race, gender, physical, and non-hate speech. The Discord bot program only can classify a message in Indonesian language. The dataset used for training and validation models was obtained from Kaggle and for additional data taken from Discord server messages totaling 18,986 sentences which will be divided by 80% training data and 20% test data. The final results of the training model experiment, this CNN model can classify test data with an average precision value of 89%, 90% recall, and 88,33% F1 score. The CNN model is integrated into a bot application which will be tested on messages sent from the test Discord server. Out of 279 messages, the designed Discord bot can obtain an accuracy of 70.6%.\",\"PeriodicalId\":256184,\"journal\":{\"name\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCoSITE57641.2023.10127699\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCoSITE57641.2023.10127699","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

不和谐越来越受欢迎，使得不和谐服务器的管理员很难在他们的服务器上维护他们的成员的日常聊天活动。这不再是一个问题，如果有不和机器人可以检测仇恨言论功能的短信，成员发送并自动审查他们。本实验的分类器使用卷积神经网络(CNN)方法。用于训练和验证模型的数据集包含仇恨言论、辱骂语言、宗教、种族、性别、身体和非仇恨言论共6类。Discord机器人程序只能对印尼语的信息进行分类。用于训练和验证模型的数据集是从Kaggle获得的，用于从Discord服务器消息中获取的额外数据总计18,986个句子，这些句子将被80%的训练数据和20%的测试数据所分割。训练模型实验的最终结果表明，该CNN模型可以对测试数据进行分类，平均精度值为89%，召回率为90%，F1得分为88.33%。CNN模型被集成到一个机器人应用程序中，该应用程序将对从测试Discord服务器发送的消息进行测试。在279条信息中，设计的Discord机器人可以获得70.6%的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Discord Bot Design for Hate Speech Sensor Using Convolutional Neural Networks (CNN) Method

Discord is growing in popularity, makes hard for an admin of Discord server maintaining their member in their everyday chat activity in their server. This no longer an issue if there is Discord bot that can detect hate speech feature in text message that member send and automatically censor them. The classifier for this experiment is using Convolutional Neural Network (CNN) method. The dataset for training and validation model are containing total 6 category of hates speech, abusive language, religion, race, gender, physical, and non-hate speech. The Discord bot program only can classify a message in Indonesian language. The dataset used for training and validation models was obtained from Kaggle and for additional data taken from Discord server messages totaling 18,986 sentences which will be divided by 80% training data and 20% test data. The final results of the training model experiment, this CNN model can classify test data with an average precision value of 89%, 90% recall, and 88,33% F1 score. The CNN model is integrated into a bot application which will be tested on messages sent from the test Discord server. Out of 279 messages, the designed Discord bot can obtain an accuracy of 70.6%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)

自引率

0.00%

发文量