D. Munandar, Andria Arisal, D. Riswantini, A. Rozie
{"title":"Text Classification for Sentiment Prediction of Social Media Dataset using Multichannel Convolution Neural Network","authors":"D. Munandar, Andria Arisal, D. Riswantini, A. Rozie","doi":"10.1109/IC3INA.2018.8629522","DOIUrl":null,"url":null,"abstract":"Text classification of social media documents requires a specified model for a particular dataset. We investigated the public opinion and sentiment from Indonesian twitter toward the employment problem. To predict the sentiment about this topic, we collected a total of 75,126 tweets which correlate with employment conversation for the training and testing process. We analyzed the collected tweets by applying convolution neural networks (CNN) model which expanded from the standard one-dimensional deep learning model to a multichannel model. Applying multiple parallel CNN processes, the documents were processed using different CNN topologies by varying word embedding dimension, kernel size, number of hidden layers or nodes, and also different n-grams. Experiment results with three-dimension channel have 0.0172 loss error and 0.9917 accuracy. CNN model with multichannel configuration provides maximum accuracy and small loss error in training and testing process of model development.","PeriodicalId":179466,"journal":{"name":"2018 International Conference on Computer, Control, Informatics and its Applications (IC3INA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computer, Control, Informatics and its Applications (IC3INA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3INA.2018.8629522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Text classification of social media documents requires a specified model for a particular dataset. We investigated the public opinion and sentiment from Indonesian twitter toward the employment problem. To predict the sentiment about this topic, we collected a total of 75,126 tweets which correlate with employment conversation for the training and testing process. We analyzed the collected tweets by applying convolution neural networks (CNN) model which expanded from the standard one-dimensional deep learning model to a multichannel model. Applying multiple parallel CNN processes, the documents were processed using different CNN topologies by varying word embedding dimension, kernel size, number of hidden layers or nodes, and also different n-grams. Experiment results with three-dimension channel have 0.0172 loss error and 0.9917 accuracy. CNN model with multichannel configuration provides maximum accuracy and small loss error in training and testing process of model development.