{"title":"A Review on Sentiment Analysis for Code-Mix Chinese and English Text on Social Media","authors":"Kong Hua Lim, T. Lim","doi":"10.56453/icdxa.2020.1001","DOIUrl":null,"url":null,"abstract":"Social media is rich with opinions. Millions of people shared their thoughts on products, services and events on Social Media Sites (SMS). Digital marketers extract and analyse content from SMS so that they know how best to promote their products or services to potential buyers. Government can get feedback from citizens about policies they have implemented. Works here reviews numerous sentiment analysis research works that study code-mix posts and comments that were expressed in formal and informal languages with a code-mix of Chinese and English or English and Hindi. Research in code-mix English and Hindi sentiment analysis are reviewed to provide some insights for application in code-mix Chinese and English. Raw data collected will be pre-processed into structured representation. Works here will discuss sentiment analysis that adopts lexicon approach, machine learning and combination of both. Works here will highlight translation and non-translation approaches used to analyse code-mix text. Discussion about propose solution for further exploration is discussion in a section. Critical remarks and a concluding section will be presented at the end of the paper. Keywords: code-mix, machine learning, lexicon","PeriodicalId":216696,"journal":{"name":"Conference Proceedings: International Conference on Digital Transformation and Applications (ICDXA 2020)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Proceedings: International Conference on Digital Transformation and Applications (ICDXA 2020)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56453/icdxa.2020.1001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Social media is rich with opinions. Millions of people shared their thoughts on products, services and events on Social Media Sites (SMS). Digital marketers extract and analyse content from SMS so that they know how best to promote their products or services to potential buyers. Government can get feedback from citizens about policies they have implemented. Works here reviews numerous sentiment analysis research works that study code-mix posts and comments that were expressed in formal and informal languages with a code-mix of Chinese and English or English and Hindi. Research in code-mix English and Hindi sentiment analysis are reviewed to provide some insights for application in code-mix Chinese and English. Raw data collected will be pre-processed into structured representation. Works here will discuss sentiment analysis that adopts lexicon approach, machine learning and combination of both. Works here will highlight translation and non-translation approaches used to analyse code-mix text. Discussion about propose solution for further exploration is discussion in a section. Critical remarks and a concluding section will be presented at the end of the paper. Keywords: code-mix, machine learning, lexicon