Shaiful Bakhtiar bin Rodzman, Mohammad Hanif Rashid, N. K. Ismail, Nurazzah Abd Rahman, S. A. Aljunid, Hayati Abd Rahman
{"title":"Experiment with Lexicon Based Techniques on Domain-Specific Malay Document Sentiment Analysis","authors":"Shaiful Bakhtiar bin Rodzman, Mohammad Hanif Rashid, N. K. Ismail, Nurazzah Abd Rahman, S. A. Aljunid, Hayati Abd Rahman","doi":"10.1109/ISCAIE.2019.8743942","DOIUrl":null,"url":null,"abstract":"The nature of Sentiment Analysis (SA) mostly is generated by human beings. They expressed their emotion in writing or expressing their feeling via social media or blog. The Advancement of Internet and the increasing number of users in social media are the factors on why the sentiment analysis gaining its popularity in Malay languages. This research aims to implement the Sentiment Analysis on Malay language documents and propose a lexicon-based technique for Malay based sentiment analysis on specific domain such as Song, Politic and Product to find the best SA classifier on the Domain-Specific Malay Document Sentiment Analysis. Analysis of the evaluation result is based on the comparison of expert evaluation, Lexicon-based evaluation’s result and Naïve Bayes SA classification’s result, which is Naïve Bayes represent Machine Learning approach in this study. The result shows Lexicon-based Classification has outperformed Naïve Bayes SA classification in overall 3 topics which are Song, Politic and Product in average of 70% compared to 50% average for Naïve Bayes. For the future works, the researcher would like to improve in the particular area such as Sentiment Analysis based on the Malay dialect, increase the data in the dictionary and applying phrase level for better and optimum results.","PeriodicalId":369098,"journal":{"name":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAIE.2019.8743942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The nature of Sentiment Analysis (SA) mostly is generated by human beings. They expressed their emotion in writing or expressing their feeling via social media or blog. The Advancement of Internet and the increasing number of users in social media are the factors on why the sentiment analysis gaining its popularity in Malay languages. This research aims to implement the Sentiment Analysis on Malay language documents and propose a lexicon-based technique for Malay based sentiment analysis on specific domain such as Song, Politic and Product to find the best SA classifier on the Domain-Specific Malay Document Sentiment Analysis. Analysis of the evaluation result is based on the comparison of expert evaluation, Lexicon-based evaluation’s result and Naïve Bayes SA classification’s result, which is Naïve Bayes represent Machine Learning approach in this study. The result shows Lexicon-based Classification has outperformed Naïve Bayes SA classification in overall 3 topics which are Song, Politic and Product in average of 70% compared to 50% average for Naïve Bayes. For the future works, the researcher would like to improve in the particular area such as Sentiment Analysis based on the Malay dialect, increase the data in the dictionary and applying phrase level for better and optimum results.