{"title":"基于多个深度神经网络的言语虐待分类","authors":"Hyunju Park, Hong Kook Kim","doi":"10.1109/ICAIIC51459.2021.9415218","DOIUrl":null,"url":null,"abstract":"People can be exposed to verbal abuse practically anywhere. It is considered to be one of serious issues in society. In this paper, we describe a method to classify verbal abuse into five lasses by adding a convolutional neural network (CNN), a long short-term memory, and a dense layer on top of bidirectional encoder representations from transformers (BERT). The data are collected from Korean drama, movies, and YouTube. Due to data imbalance, weighted random sampler and data augmentation are used to train the models to be generalized. Experiments show that BERT with CNN after data augmentation performs the highest accuracy among all the compared methods.","PeriodicalId":432977,"journal":{"name":"2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Verbal Abuse Classification Using Multiple Deep Neural Networks\",\"authors\":\"Hyunju Park, Hong Kook Kim\",\"doi\":\"10.1109/ICAIIC51459.2021.9415218\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"People can be exposed to verbal abuse practically anywhere. It is considered to be one of serious issues in society. In this paper, we describe a method to classify verbal abuse into five lasses by adding a convolutional neural network (CNN), a long short-term memory, and a dense layer on top of bidirectional encoder representations from transformers (BERT). The data are collected from Korean drama, movies, and YouTube. Due to data imbalance, weighted random sampler and data augmentation are used to train the models to be generalized. Experiments show that BERT with CNN after data augmentation performs the highest accuracy among all the compared methods.\",\"PeriodicalId\":432977,\"journal\":{\"name\":\"2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIIC51459.2021.9415218\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIIC51459.2021.9415218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Verbal Abuse Classification Using Multiple Deep Neural Networks
People can be exposed to verbal abuse practically anywhere. It is considered to be one of serious issues in society. In this paper, we describe a method to classify verbal abuse into five lasses by adding a convolutional neural network (CNN), a long short-term memory, and a dense layer on top of bidirectional encoder representations from transformers (BERT). The data are collected from Korean drama, movies, and YouTube. Due to data imbalance, weighted random sampler and data augmentation are used to train the models to be generalized. Experiments show that BERT with CNN after data augmentation performs the highest accuracy among all the compared methods.