Jorge E. Camargo, Vladimir Vargas-Calderón, Nelson Vargas, Liliana Calderón-Benavides
{"title":"Sentiment polarity classification of tweets using a extended dictionary","authors":"Jorge E. Camargo, Vladimir Vargas-Calderón, Nelson Vargas, Liliana Calderón-Benavides","doi":"10.4114/INTARTIF.VOL21ISS62PP1-12","DOIUrl":null,"url":null,"abstract":"With the purpose of classifying text based on its sentiment polarity (positive or negative), we proposed an extension of a 68,000 tweets corpus through the inclusion of word definitions from a dictionary of the Real Academia Espa\\~{n}ola de la Lengua (RAE). A set of 28,000 combinations of 6 Word2Vec and support vector machine parameters were considered in order to evaluate how positively would affect the inclusion of a RAE's dictionary definitions classification performance. We found that such a corpus extension significantly improve the classification accuracy. Therefore, we conclude that the inclusion of a RAE's dictionary increases the semantic relations learned by Word2Vec allowing a better classification accuracy.","PeriodicalId":176050,"journal":{"name":"Inteligencia Artif.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inteligencia Artif.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4114/INTARTIF.VOL21ISS62PP1-12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
With the purpose of classifying text based on its sentiment polarity (positive or negative), we proposed an extension of a 68,000 tweets corpus through the inclusion of word definitions from a dictionary of the Real Academia Espa\~{n}ola de la Lengua (RAE). A set of 28,000 combinations of 6 Word2Vec and support vector machine parameters were considered in order to evaluate how positively would affect the inclusion of a RAE's dictionary definitions classification performance. We found that such a corpus extension significantly improve the classification accuracy. Therefore, we conclude that the inclusion of a RAE's dictionary increases the semantic relations learned by Word2Vec allowing a better classification accuracy.
为了根据文本的情感极性(积极或消极)对其进行分类,我们提出了一个68,000条推文语料库的扩展,该语料库包含了Real Academia Espa\ {n}ola de la Lengua (RAE)字典中的单词定义。考虑了6个Word2Vec和支持向量机参数的28000个组合,以评估将如何积极地影响RAE的字典定义分类性能。我们发现这样的语料库扩展显著提高了分类精度。因此,我们得出结论,包含RAE的字典增加了Word2Vec学习的语义关系,从而提高了分类精度。