Interpolative self-training approach for sentiment analysis

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) Pub Date : 2016-11-01 DOI:10.1109/BESC.2016.7804475

S. Aghababaei, M. Makrehchi

引用次数: 5

Abstract

Sentiment analysis has become one of the fundamental research areas with an objective of estimating the polarity of text documents. While sentiment analysis requires rich training resources, the number of available labeled documents is limited. The proposed interpolative self-training model is an extension of self-training as one of the most common semi-supervised learning algorithms. The proposed method is based on enlarging learning documents by interpolating data in both the training and the test phase. The method also includes a weighting strategy for data selection in each iteration. The method is evaluated using four Twitter datasets for the task of sentiment analysis. The results indicate that the proposed self-training model successfully outperforms the baseline and the standard self-training approach.

查看原文本刊更多论文

情感分析的插值自我训练方法

情感分析以估计文本文档的极性为目的，已成为基础研究领域之一。虽然情感分析需要丰富的训练资源，但可用的标记文档数量有限。本文提出的插值自训练模型是自训练的扩展，是最常见的半监督学习算法之一。该方法通过在训练和测试阶段插入数据来扩大学习文档。该方法还包括在每次迭代中选择数据的加权策略。该方法使用四个Twitter数据集进行情感分析任务的评估。结果表明，所提出的自训练模型成功地优于基线和标准自训练方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)

自引率

0.00%

发文量