From Opinion Lexicons to Sentiment Classification of Tweets and Vice Versa: A Transfer Learning Approach

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI) Pub Date : 2016-10-01 DOI:10.1109/WI.2016.29

Felipe Bravo-Marquez, E. Frank, B. Pfahringer

{"title":"From Opinion Lexicons to Sentiment Classification of Tweets and Vice Versa: A Transfer Learning Approach","authors":"Felipe Bravo-Marquez, E. Frank, B. Pfahringer","doi":"10.1109/WI.2016.29","DOIUrl":null,"url":null,"abstract":"Message-level and word-level polarity classification are two popular tasks in Twitter sentiment analysis. They have been commonly addressed by training supervised models from labelled data. The main limitation of these models is the high cost of data annotation. Transferring existing labels from a related problem domain is one possible solution for this problem. In this paper, we propose a simple model for transferring sentiment labels from words to tweets and vice versa by representing both tweets and words using feature vectors residing in the same feature space. Tweets are represented by standard NLP features such as unigrams and part-of-speech tags. Words are represented by averaging the vectors of the tweets in which they occur. We evaluate our approach in two transfer learning problems: 1) training a tweet-level polarity classifier from a polarity lexicon, and 2) inducing a polarity lexicon from a collection of polarity-annotated tweets. Our results show that the proposed approach can successfully classify words and tweets after transfer.","PeriodicalId":6513,"journal":{"name":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"52 1","pages":"145-152"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2016.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

Message-level and word-level polarity classification are two popular tasks in Twitter sentiment analysis. They have been commonly addressed by training supervised models from labelled data. The main limitation of these models is the high cost of data annotation. Transferring existing labels from a related problem domain is one possible solution for this problem. In this paper, we propose a simple model for transferring sentiment labels from words to tweets and vice versa by representing both tweets and words using feature vectors residing in the same feature space. Tweets are represented by standard NLP features such as unigrams and part-of-speech tags. Words are represented by averaging the vectors of the tweets in which they occur. We evaluate our approach in two transfer learning problems: 1) training a tweet-level polarity classifier from a polarity lexicon, and 2) inducing a polarity lexicon from a collection of polarity-annotated tweets. Our results show that the proposed approach can successfully classify words and tweets after transfer.

查看原文本刊更多论文

从观点词汇到推文的情感分类，反之亦然:一种迁移学习方法

消息级和词级极性分类是Twitter情感分析中的两个常用任务。它们通常通过从标记数据中训练监督模型来解决。这些模型的主要限制是数据注释的高成本。从相关问题领域转移现有标签是解决此问题的一种可能方法。在本文中，我们提出了一个简单的模型，通过使用驻留在相同特征空间中的特征向量表示tweet和单词，将情感标签从单词转移到tweet，反之亦然。推文由标准的NLP特征表示，如单字符和词性标记。单词是通过对它们出现的tweet的向量进行平均来表示的。我们在两个迁移学习问题中评估了我们的方法:1)从极性词典中训练推文级极性分类器，以及2)从极性注释的推文集合中诱导极性词典。实验结果表明，该方法可以成功地对迁移后的词和推文进行分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI)

自引率

0.00%

发文量