{"title":"Building a Chinese Slang Sentiment Lexicon Using Online Crowdsourcing Dictionaries","authors":"Binjun Jiang","doi":"10.1109/CONF-SPML54095.2021.00026","DOIUrl":null,"url":null,"abstract":"Microblogging platforms are now one of the most popular means of social media in China. Carrying sentiment analysis on those platforms can provide valuable insights for various uses. However, the heavy use of Internet slang in microblog contexts and the lack of slang vocabulary in sentiment lexicons make it problematic. Aimed at this issue, we propose a method to build a comprehensive sentiment lexicon for Chinese internet slang. We leverage online sources to acquire a list of slang words first. Then, a method based on SO-PMI (Semantic Orientation from Pointwise Mutual Information) is used to assign the sentiment polarity to each word. By Utilizing online sources, the slang lexicon has comprehensive coverage of internet slang. The sentiment categorization method based on SO-PMI guarantees the sentiment polarity we acquire from microblog flatforms is compatible with the same microblog context the lexicon aimed to analyze.","PeriodicalId":415094,"journal":{"name":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONF-SPML54095.2021.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Microblogging platforms are now one of the most popular means of social media in China. Carrying sentiment analysis on those platforms can provide valuable insights for various uses. However, the heavy use of Internet slang in microblog contexts and the lack of slang vocabulary in sentiment lexicons make it problematic. Aimed at this issue, we propose a method to build a comprehensive sentiment lexicon for Chinese internet slang. We leverage online sources to acquire a list of slang words first. Then, a method based on SO-PMI (Semantic Orientation from Pointwise Mutual Information) is used to assign the sentiment polarity to each word. By Utilizing online sources, the slang lexicon has comprehensive coverage of internet slang. The sentiment categorization method based on SO-PMI guarantees the sentiment polarity we acquire from microblog flatforms is compatible with the same microblog context the lexicon aimed to analyze.