Dian Isnaeni Nurul Afra, Agung Santosa, Radhiyatul Fajri, N. Hidayati, Elvira Nurfadhilah, Siska Pebiana, Lyla Ruslana Aini, Harnum A. Prafitia, Yosi Sahreza, Junanto Prihantoro, Gunarso, Andi Djalal Latief, M. T. Uliniansyah, Hammam Riza
{"title":"Developing Sentiment Analysis of Indonesian Social Media Based on Convolutional Neural Network for Smarter Society","authors":"Dian Isnaeni Nurul Afra, Agung Santosa, Radhiyatul Fajri, N. Hidayati, Elvira Nurfadhilah, Siska Pebiana, Lyla Ruslana Aini, Harnum A. Prafitia, Yosi Sahreza, Junanto Prihantoro, Gunarso, Andi Djalal Latief, M. T. Uliniansyah, Hammam Riza","doi":"10.1109/ICISS55894.2022.9915148","DOIUrl":null,"url":null,"abstract":"The need for mining the public's opinion on specific national development issues can be attained by sentiment analysis. There has been much research on sentiment analysis in Indonesian, but there is room for improvement. In this study, we performed experimentation by comparing previously tested convolutional neural network (CNN) architecture with our proposal utilizing a pre-trained word-embedded model: Word2Vec and GloVe that proved to be having better accuracy. The experiment used the SmSA dataset from IndoBERTweet of 11000 sentences. Utilization of the pretrained Word2Vec and GloVe model that was created from 16.2 million BRIN monolingual text corpus could represent a text in a better way. The word-embedding model can capture the semantic and syntactic meaning of a word so that they can conceive knowledge that may be used to raise classification performances. The experimental results show a better F1 score value of 80.06%, which increases by approximately 2.24%.","PeriodicalId":125054,"journal":{"name":"2022 International Conference on ICT for Smart Society (ICISS)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on ICT for Smart Society (ICISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISS55894.2022.9915148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The need for mining the public's opinion on specific national development issues can be attained by sentiment analysis. There has been much research on sentiment analysis in Indonesian, but there is room for improvement. In this study, we performed experimentation by comparing previously tested convolutional neural network (CNN) architecture with our proposal utilizing a pre-trained word-embedded model: Word2Vec and GloVe that proved to be having better accuracy. The experiment used the SmSA dataset from IndoBERTweet of 11000 sentences. Utilization of the pretrained Word2Vec and GloVe model that was created from 16.2 million BRIN monolingual text corpus could represent a text in a better way. The word-embedding model can capture the semantic and syntactic meaning of a word so that they can conceive knowledge that may be used to raise classification performances. The experimental results show a better F1 score value of 80.06%, which increases by approximately 2.24%.