Comparison Support Vector Machine and Naive Bayes Methods for Classifying Cyberbullying in Twitter

N. Chamidah, R. Sahawaly
{"title":"Comparison Support Vector Machine and Naive Bayes Methods for Classifying Cyberbullying in Twitter","authors":"N. Chamidah, R. Sahawaly","doi":"10.26555/jiteki.v7i2.21175","DOIUrl":null,"url":null,"abstract":"Twitter users in Indonesia in 2019 were recorded at 6.43 million. The high level of Twitter users makes it allows for free opinion to anyone. It can cause cyberbullying. Victims of cyberbullying experienced higher levels of depression than other verbal acts of violence. The forms of cyberbullying that occur on Twitter are flaming, Denigration, and Body Shaming. The research contribution is able to make social media developers and users more aware of the type of cyberbullying that social media users sometimes do without realizing it. Social media developers can prevent cyberbullying by using policies such as word detection and filtering features that indicate cyberbullying more accurately by classifying it by type and using the most accurate method. To classify cyberbullying forms in twitter, in this study, we use the Naïve Bayes method and Support Vector Machine (SVM) and compare them based on classification accuracy. This research will also identify words that are characteristic of each category of cyberbullying so that each category is easy to identify by social media users and makes it easier to avoid cyberbullying. The results of this study are the classification accuracy of Naïve Bayes of 97.99% and the classification accuracy of SVM of 99.60%. It means that SVM is better than Naïve Bayes for classifying the forms of cyberbullying in Twitter. Twitter with an average classification accuracy of 99.60%. It shows that the SVM method can classify cyberbullying forms better than the Naïve Bayes Classifier method.","PeriodicalId":244902,"journal":{"name":"Jurnal Ilmiah Teknik Elektro Komputer dan Informatika","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Ilmiah Teknik Elektro Komputer dan Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26555/jiteki.v7i2.21175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Twitter users in Indonesia in 2019 were recorded at 6.43 million. The high level of Twitter users makes it allows for free opinion to anyone. It can cause cyberbullying. Victims of cyberbullying experienced higher levels of depression than other verbal acts of violence. The forms of cyberbullying that occur on Twitter are flaming, Denigration, and Body Shaming. The research contribution is able to make social media developers and users more aware of the type of cyberbullying that social media users sometimes do without realizing it. Social media developers can prevent cyberbullying by using policies such as word detection and filtering features that indicate cyberbullying more accurately by classifying it by type and using the most accurate method. To classify cyberbullying forms in twitter, in this study, we use the Naïve Bayes method and Support Vector Machine (SVM) and compare them based on classification accuracy. This research will also identify words that are characteristic of each category of cyberbullying so that each category is easy to identify by social media users and makes it easier to avoid cyberbullying. The results of this study are the classification accuracy of Naïve Bayes of 97.99% and the classification accuracy of SVM of 99.60%. It means that SVM is better than Naïve Bayes for classifying the forms of cyberbullying in Twitter. Twitter with an average classification accuracy of 99.60%. It shows that the SVM method can classify cyberbullying forms better than the Naïve Bayes Classifier method.
推特网络欺凌分类的比较支持向量机与朴素贝叶斯方法
据记录,2019年印尼的推特用户为643万。Twitter用户的高水平使得任何人都可以自由发表意见。这可能会导致网络欺凌。网络欺凌的受害者比其他言语暴力的受害者更容易抑郁。发生在Twitter上的网络欺凌的形式是燃烧,诋毁和身体羞辱。这项研究的贡献能够使社交媒体开发者和用户更加意识到社交媒体用户有时没有意识到的网络欺凌类型。社交媒体开发者可以通过使用诸如单词检测和过滤功能等策略来防止网络欺凌,这些策略可以通过按类型分类和使用最准确的方法来更准确地指示网络欺凌。为了对twitter中的网络欺凌形式进行分类,本研究使用Naïve贝叶斯方法和支持向量机(SVM),并根据分类准确率对两者进行比较。本研究还将识别每一类网络欺凌的特征词汇,以便社交媒体用户容易识别每一类网络欺凌,从而更容易避免网络欺凌。本研究结果为Naïve贝叶斯的分类准确率为97.99%,SVM的分类准确率为99.60%。这意味着SVM对Twitter中网络欺凌形式的分类优于Naïve Bayes。Twitter的平均分类准确率为99.60%。结果表明,SVM方法对网络欺凌形式的分类效果优于Naïve贝叶斯分类器方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信