Dhanika Jeihan Aguinta, P. P. Adikara, R. Wihandika
{"title":"基于naïve贝叶斯分类器和Twitter上基于规则的意见目标检测的雅加达捷运情感分析","authors":"Dhanika Jeihan Aguinta, P. P. Adikara, R. Wihandika","doi":"10.1145/3427423.3427437","DOIUrl":null,"url":null,"abstract":"Jakarta Mass Rapid Transit (MRT) is a national project in Indonesia that has been operated since the beginning of 2019 for phase 1 and still under further development. People's opinions posted on Twitter concerning Jakarta MRT could become an evaluation material, viewed from the sentiment score and object of the opinion. To analyze the sentiment score, opinion sentiment classified using the naive Bayes classifier, followed by rule-based target opinion detection. The classification process is using bag of words (BoW) features and lexicon-based features. In the weighting process of the Lexicon-based features and detection of the target opinion, POS tagging is employed to get the word class. In determining the target opinion object, the POS tagging result is used to do chunking that has specific rules, specific to noun-phrase (NP) tags. Therefore, the obtained sentiment class and object become the target in the opinion. Using Naïve Bayes with the bag of words features and lexicon-based, we achieve precision 0,92, recall 1,0, f-measure 0,95, and accuracy 0,92. The results of the rule-based target opinion detection are 0.78, 0.85, 0.79, and 0.75 for precision, recall, f-measure, and accuracy, respectively.","PeriodicalId":120194,"journal":{"name":"Proceedings of the 5th International Conference on Sustainable Information Engineering and Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sentiment analysis of mass rapid transit jakarta using naïve bayes classifier and rule-based opinion target detection on Twitter\",\"authors\":\"Dhanika Jeihan Aguinta, P. P. Adikara, R. Wihandika\",\"doi\":\"10.1145/3427423.3427437\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Jakarta Mass Rapid Transit (MRT) is a national project in Indonesia that has been operated since the beginning of 2019 for phase 1 and still under further development. People's opinions posted on Twitter concerning Jakarta MRT could become an evaluation material, viewed from the sentiment score and object of the opinion. To analyze the sentiment score, opinion sentiment classified using the naive Bayes classifier, followed by rule-based target opinion detection. The classification process is using bag of words (BoW) features and lexicon-based features. In the weighting process of the Lexicon-based features and detection of the target opinion, POS tagging is employed to get the word class. In determining the target opinion object, the POS tagging result is used to do chunking that has specific rules, specific to noun-phrase (NP) tags. Therefore, the obtained sentiment class and object become the target in the opinion. Using Naïve Bayes with the bag of words features and lexicon-based, we achieve precision 0,92, recall 1,0, f-measure 0,95, and accuracy 0,92. The results of the rule-based target opinion detection are 0.78, 0.85, 0.79, and 0.75 for precision, recall, f-measure, and accuracy, respectively.\",\"PeriodicalId\":120194,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Sustainable Information Engineering and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Sustainable Information Engineering and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3427423.3427437\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Sustainable Information Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3427423.3427437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sentiment analysis of mass rapid transit jakarta using naïve bayes classifier and rule-based opinion target detection on Twitter
Jakarta Mass Rapid Transit (MRT) is a national project in Indonesia that has been operated since the beginning of 2019 for phase 1 and still under further development. People's opinions posted on Twitter concerning Jakarta MRT could become an evaluation material, viewed from the sentiment score and object of the opinion. To analyze the sentiment score, opinion sentiment classified using the naive Bayes classifier, followed by rule-based target opinion detection. The classification process is using bag of words (BoW) features and lexicon-based features. In the weighting process of the Lexicon-based features and detection of the target opinion, POS tagging is employed to get the word class. In determining the target opinion object, the POS tagging result is used to do chunking that has specific rules, specific to noun-phrase (NP) tags. Therefore, the obtained sentiment class and object become the target in the opinion. Using Naïve Bayes with the bag of words features and lexicon-based, we achieve precision 0,92, recall 1,0, f-measure 0,95, and accuracy 0,92. The results of the rule-based target opinion detection are 0.78, 0.85, 0.79, and 0.75 for precision, recall, f-measure, and accuracy, respectively.