{"title":"基于深度学习的印尼语文本仇恨言论识别:初步研究","authors":"Erryan Sazany, I. Budi","doi":"10.1109/ICAITI.2018.8686725","DOIUrl":null,"url":null,"abstract":"This paper presents an implementation of hate speech identification task for text data written in Indonesian language. There are some studies purposed for similar problem, but all of them use classical machine learning approach, whose heavily depends on the feature engineering. Switching the domain of data set means that the feature engineering should be redone. To address this issue, this preliminary research proposes another method based on deep learning approach which needs no feature engineering and is also adaptive to the varying context. Using data sets sourced from Twitter posts, the proposed method gives better result of 94.5% F1-score at a minimum.","PeriodicalId":233598,"journal":{"name":"2018 International Conference on Applied Information Technology and Innovation (ICAITI)","volume":"13 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Deep Learning-Based Implementation of Hate Speech Identification on Texts in Indonesian: Preliminary Study\",\"authors\":\"Erryan Sazany, I. Budi\",\"doi\":\"10.1109/ICAITI.2018.8686725\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an implementation of hate speech identification task for text data written in Indonesian language. There are some studies purposed for similar problem, but all of them use classical machine learning approach, whose heavily depends on the feature engineering. Switching the domain of data set means that the feature engineering should be redone. To address this issue, this preliminary research proposes another method based on deep learning approach which needs no feature engineering and is also adaptive to the varying context. Using data sets sourced from Twitter posts, the proposed method gives better result of 94.5% F1-score at a minimum.\",\"PeriodicalId\":233598,\"journal\":{\"name\":\"2018 International Conference on Applied Information Technology and Innovation (ICAITI)\",\"volume\":\"13 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Applied Information Technology and Innovation (ICAITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAITI.2018.8686725\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Applied Information Technology and Innovation (ICAITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAITI.2018.8686725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning-Based Implementation of Hate Speech Identification on Texts in Indonesian: Preliminary Study
This paper presents an implementation of hate speech identification task for text data written in Indonesian language. There are some studies purposed for similar problem, but all of them use classical machine learning approach, whose heavily depends on the feature engineering. Switching the domain of data set means that the feature engineering should be redone. To address this issue, this preliminary research proposes another method based on deep learning approach which needs no feature engineering and is also adaptive to the varying context. Using data sets sourced from Twitter posts, the proposed method gives better result of 94.5% F1-score at a minimum.