{"title":"Sentiment analysis on microblog data based on word embedding and fusion techniques","authors":"Ahmet Hayran, M. Sert","doi":"10.1109/SIU.2017.7960519","DOIUrl":null,"url":null,"abstract":"People often use social platforms to state their views and desires. Twitter is one of the most popular microblog service used for this purpose. In this study, we propose a new approach for automatically classifying the sentiment of microblog messages. The proposed approach is based on utilizing robust feature representation and fusion. We make use of word embedding technique as the feature representation and the Support Vector Machine as the classifier. In our approach, we first calculate statistical measures from word embedding representations and fuse them using different combinations. Learning is performed using these fused features and tested on the Turkish tweet dataset. Results show that the proposed approach significantly reduces the dimension of tweet representation and enhances sentiment classification accuracy. Best performance is attained by the proposed Dvot fusion technique with an accuracy of %80.05.","PeriodicalId":217576,"journal":{"name":"2017 25th Signal Processing and Communications Applications Conference (SIU)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 25th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2017.7960519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
People often use social platforms to state their views and desires. Twitter is one of the most popular microblog service used for this purpose. In this study, we propose a new approach for automatically classifying the sentiment of microblog messages. The proposed approach is based on utilizing robust feature representation and fusion. We make use of word embedding technique as the feature representation and the Support Vector Machine as the classifier. In our approach, we first calculate statistical measures from word embedding representations and fuse them using different combinations. Learning is performed using these fused features and tested on the Turkish tweet dataset. Results show that the proposed approach significantly reduces the dimension of tweet representation and enhances sentiment classification accuracy. Best performance is attained by the proposed Dvot fusion technique with an accuracy of %80.05.