Eissa Alshari, A. Azman, S. Doraisamy, N. Mustapha, Mostafa Alkeshr
{"title":"基于Word2Vec的情感词典充实方法","authors":"Eissa Alshari, A. Azman, S. Doraisamy, N. Mustapha, Mostafa Alkeshr","doi":"10.1109/INFRKM.2018.8464775","DOIUrl":null,"url":null,"abstract":"Recently, many researchers have shown interest in using lexical dictionary for sentiment analysis. The SentiWordNet is the most used sentiment lexical to determine the polarity of texts. However, there are huge number of terms in the corpus vocabulary that are not in the SentiWordNet due to the curse of dimensionality, which will limit the performance of the sentiment analysis. This paper proposed a method to enlarge the size of opinion words by learning the polarity of those non-opinion words in the vocabulary based on the SentiWordNet. The effectiveness of the method is evaluated by using the Internet Movie Review Dataset. The result is promising, showing that the proposed Senti2Vec method can be more effective than the SentiWordNet as the sentiment lexical resource.","PeriodicalId":196731,"journal":{"name":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":"{\"title\":\"Effective Method for Sentiment Lexical Dictionary Enrichment Based on Word2Vec for Sentiment Analysis\",\"authors\":\"Eissa Alshari, A. Azman, S. Doraisamy, N. Mustapha, Mostafa Alkeshr\",\"doi\":\"10.1109/INFRKM.2018.8464775\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, many researchers have shown interest in using lexical dictionary for sentiment analysis. The SentiWordNet is the most used sentiment lexical to determine the polarity of texts. However, there are huge number of terms in the corpus vocabulary that are not in the SentiWordNet due to the curse of dimensionality, which will limit the performance of the sentiment analysis. This paper proposed a method to enlarge the size of opinion words by learning the polarity of those non-opinion words in the vocabulary based on the SentiWordNet. The effectiveness of the method is evaluated by using the Internet Movie Review Dataset. The result is promising, showing that the proposed Senti2Vec method can be more effective than the SentiWordNet as the sentiment lexical resource.\",\"PeriodicalId\":196731,\"journal\":{\"name\":\"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"38\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INFRKM.2018.8464775\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFRKM.2018.8464775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Effective Method for Sentiment Lexical Dictionary Enrichment Based on Word2Vec for Sentiment Analysis
Recently, many researchers have shown interest in using lexical dictionary for sentiment analysis. The SentiWordNet is the most used sentiment lexical to determine the polarity of texts. However, there are huge number of terms in the corpus vocabulary that are not in the SentiWordNet due to the curse of dimensionality, which will limit the performance of the sentiment analysis. This paper proposed a method to enlarge the size of opinion words by learning the polarity of those non-opinion words in the vocabulary based on the SentiWordNet. The effectiveness of the method is evaluated by using the Internet Movie Review Dataset. The result is promising, showing that the proposed Senti2Vec method can be more effective than the SentiWordNet as the sentiment lexical resource.