{"title":"使用自定义模糊同义词库合并语义并减少Twitter情感分析的数据稀疏性","authors":"Heba M. Ismail, Nazar Zaki, B. Belkhouche","doi":"10.1109/ISCMI.2016.56","DOIUrl":null,"url":null,"abstract":"Considerable research efforts have been devoted to Twitter sentiment analysis in recent years. Given the informal writing style of Twitter, there exists an endless variety of sound vocabulary, slogans, emoticons and special characters that can be used to express one's opinion in a maximum of 140-characters. This results in a sparsity problem making the training of machine learning classifiers from Twitter data a highly challenging task. In this work we propose using sentiment replacement of Twitter slogans and incorporating a fuzzy thesaurus for twitter sentiment classification in order to incorporate semantic as well as solve the sparsity problem. The experimental results show that the proposed method consistently outperforms the baselines in addition to some methods in the literature.","PeriodicalId":417057,"journal":{"name":"2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)","volume":"230 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Using Custom Fuzzy Thesaurus to Incorporate Semantic and Reduce Data Sparsity for Twitter Sentiment Analysis\",\"authors\":\"Heba M. Ismail, Nazar Zaki, B. Belkhouche\",\"doi\":\"10.1109/ISCMI.2016.56\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Considerable research efforts have been devoted to Twitter sentiment analysis in recent years. Given the informal writing style of Twitter, there exists an endless variety of sound vocabulary, slogans, emoticons and special characters that can be used to express one's opinion in a maximum of 140-characters. This results in a sparsity problem making the training of machine learning classifiers from Twitter data a highly challenging task. In this work we propose using sentiment replacement of Twitter slogans and incorporating a fuzzy thesaurus for twitter sentiment classification in order to incorporate semantic as well as solve the sparsity problem. The experimental results show that the proposed method consistently outperforms the baselines in addition to some methods in the literature.\",\"PeriodicalId\":417057,\"journal\":{\"name\":\"2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)\",\"volume\":\"230 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCMI.2016.56\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCMI.2016.56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Custom Fuzzy Thesaurus to Incorporate Semantic and Reduce Data Sparsity for Twitter Sentiment Analysis
Considerable research efforts have been devoted to Twitter sentiment analysis in recent years. Given the informal writing style of Twitter, there exists an endless variety of sound vocabulary, slogans, emoticons and special characters that can be used to express one's opinion in a maximum of 140-characters. This results in a sparsity problem making the training of machine learning classifiers from Twitter data a highly challenging task. In this work we propose using sentiment replacement of Twitter slogans and incorporating a fuzzy thesaurus for twitter sentiment classification in order to incorporate semantic as well as solve the sparsity problem. The experimental results show that the proposed method consistently outperforms the baselines in addition to some methods in the literature.