{"title":"Interpolative self-training approach for sentiment analysis","authors":"S. Aghababaei, M. Makrehchi","doi":"10.1109/BESC.2016.7804475","DOIUrl":null,"url":null,"abstract":"Sentiment analysis has become one of the fundamental research areas with an objective of estimating the polarity of text documents. While sentiment analysis requires rich training resources, the number of available labeled documents is limited. The proposed interpolative self-training model is an extension of self-training as one of the most common semi-supervised learning algorithms. The proposed method is based on enlarging learning documents by interpolating data in both the training and the test phase. The method also includes a weighting strategy for data selection in each iteration. The method is evaluated using four Twitter datasets for the task of sentiment analysis. The results indicate that the proposed self-training model successfully outperforms the baseline and the standard self-training approach.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BESC.2016.7804475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Sentiment analysis has become one of the fundamental research areas with an objective of estimating the polarity of text documents. While sentiment analysis requires rich training resources, the number of available labeled documents is limited. The proposed interpolative self-training model is an extension of self-training as one of the most common semi-supervised learning algorithms. The proposed method is based on enlarging learning documents by interpolating data in both the training and the test phase. The method also includes a weighting strategy for data selection in each iteration. The method is evaluated using four Twitter datasets for the task of sentiment analysis. The results indicate that the proposed self-training model successfully outperforms the baseline and the standard self-training approach.