{"title":"Bengali Abusive Speech Classification: A Transfer Learning Approach Using VGG-16","authors":"Shantanu Kumar Rahut, Riffat Sharmin, Ridma Tabassum","doi":"10.1109/ETCCE51779.2020.9350919","DOIUrl":null,"url":null,"abstract":"Swear words used in speech to abuse someone is frowned upon in every society. Abusive speeches can destroy the victim's morale, mental strength, and the will to live. Abusing others through social media, video streaming sites, and over voice calls are becoming a common problem. There are laws to punish the offenders. However, without proper surveillance, stopping abusive speech is tough. Machine learning can help to create surveillance methods by detecting abusive speech from human conversation. There have been a few works in the relevant field to detect abusive speech. However, detecting abusive speech in the Bengali language remains an unexplored area. This paper aims at providing an approach towards the classification of abusive and non-abusive Bengali speech. The authors collected 960 voice recordings of native Bengali speakers. The authors used Transfer Learning for extracting features from the data. Then, the authors used different methods for classification. The proposed approach achieves high accuracy (98.61%) in classifying abusive and non-abusive Bengali speech.","PeriodicalId":234459,"journal":{"name":"2020 Emerging Technology in Computing, Communication and Electronics (ETCCE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Emerging Technology in Computing, Communication and Electronics (ETCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETCCE51779.2020.9350919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Swear words used in speech to abuse someone is frowned upon in every society. Abusive speeches can destroy the victim's morale, mental strength, and the will to live. Abusing others through social media, video streaming sites, and over voice calls are becoming a common problem. There are laws to punish the offenders. However, without proper surveillance, stopping abusive speech is tough. Machine learning can help to create surveillance methods by detecting abusive speech from human conversation. There have been a few works in the relevant field to detect abusive speech. However, detecting abusive speech in the Bengali language remains an unexplored area. This paper aims at providing an approach towards the classification of abusive and non-abusive Bengali speech. The authors collected 960 voice recordings of native Bengali speakers. The authors used Transfer Learning for extracting features from the data. Then, the authors used different methods for classification. The proposed approach achieves high accuracy (98.61%) in classifying abusive and non-abusive Bengali speech.