{"title":"基于词嵌入和融合技术的微博数据情感分析","authors":"Ahmet Hayran, M. Sert","doi":"10.1109/SIU.2017.7960519","DOIUrl":null,"url":null,"abstract":"People often use social platforms to state their views and desires. Twitter is one of the most popular microblog service used for this purpose. In this study, we propose a new approach for automatically classifying the sentiment of microblog messages. The proposed approach is based on utilizing robust feature representation and fusion. We make use of word embedding technique as the feature representation and the Support Vector Machine as the classifier. In our approach, we first calculate statistical measures from word embedding representations and fuse them using different combinations. Learning is performed using these fused features and tested on the Turkish tweet dataset. Results show that the proposed approach significantly reduces the dimension of tweet representation and enhances sentiment classification accuracy. Best performance is attained by the proposed Dvot fusion technique with an accuracy of %80.05.","PeriodicalId":217576,"journal":{"name":"2017 25th Signal Processing and Communications Applications Conference (SIU)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Sentiment analysis on microblog data based on word embedding and fusion techniques\",\"authors\":\"Ahmet Hayran, M. Sert\",\"doi\":\"10.1109/SIU.2017.7960519\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"People often use social platforms to state their views and desires. Twitter is one of the most popular microblog service used for this purpose. In this study, we propose a new approach for automatically classifying the sentiment of microblog messages. The proposed approach is based on utilizing robust feature representation and fusion. We make use of word embedding technique as the feature representation and the Support Vector Machine as the classifier. In our approach, we first calculate statistical measures from word embedding representations and fuse them using different combinations. Learning is performed using these fused features and tested on the Turkish tweet dataset. Results show that the proposed approach significantly reduces the dimension of tweet representation and enhances sentiment classification accuracy. Best performance is attained by the proposed Dvot fusion technique with an accuracy of %80.05.\",\"PeriodicalId\":217576,\"journal\":{\"name\":\"2017 25th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 25th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU.2017.7960519\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 25th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2017.7960519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sentiment analysis on microblog data based on word embedding and fusion techniques
People often use social platforms to state their views and desires. Twitter is one of the most popular microblog service used for this purpose. In this study, we propose a new approach for automatically classifying the sentiment of microblog messages. The proposed approach is based on utilizing robust feature representation and fusion. We make use of word embedding technique as the feature representation and the Support Vector Machine as the classifier. In our approach, we first calculate statistical measures from word embedding representations and fuse them using different combinations. Learning is performed using these fused features and tested on the Turkish tweet dataset. Results show that the proposed approach significantly reduces the dimension of tweet representation and enhances sentiment classification accuracy. Best performance is attained by the proposed Dvot fusion technique with an accuracy of %80.05.