{"title":"Deep Learning-Based Implementation of Hate Speech Identification on Texts in Indonesian: Preliminary Study","authors":"Erryan Sazany, I. Budi","doi":"10.1109/ICAITI.2018.8686725","DOIUrl":null,"url":null,"abstract":"This paper presents an implementation of hate speech identification task for text data written in Indonesian language. There are some studies purposed for similar problem, but all of them use classical machine learning approach, whose heavily depends on the feature engineering. Switching the domain of data set means that the feature engineering should be redone. To address this issue, this preliminary research proposes another method based on deep learning approach which needs no feature engineering and is also adaptive to the varying context. Using data sets sourced from Twitter posts, the proposed method gives better result of 94.5% F1-score at a minimum.","PeriodicalId":233598,"journal":{"name":"2018 International Conference on Applied Information Technology and Innovation (ICAITI)","volume":"13 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Applied Information Technology and Innovation (ICAITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAITI.2018.8686725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
This paper presents an implementation of hate speech identification task for text data written in Indonesian language. There are some studies purposed for similar problem, but all of them use classical machine learning approach, whose heavily depends on the feature engineering. Switching the domain of data set means that the feature engineering should be redone. To address this issue, this preliminary research proposes another method based on deep learning approach which needs no feature engineering and is also adaptive to the varying context. Using data sets sourced from Twitter posts, the proposed method gives better result of 94.5% F1-score at a minimum.