Qiryn Adriana Kharul Zaman, Wan Nur Syahidah Wan Yusoff, Qistina Batrisyia Azman Shah
{"title":"关于马来西亚兴趣所在的情感分析","authors":"Qiryn Adriana Kharul Zaman, Wan Nur Syahidah Wan Yusoff, Qistina Batrisyia Azman Shah","doi":"10.37934/araset.43.1.5465","DOIUrl":null,"url":null,"abstract":"This study focuses on utilizing machine learning methods for sentiment analysis to identify positive and negative comments regarding Malaysian Places of Interest (PoI). The data was collected from Twitter using social media monitoring software and organized into tables. Pre-processing techniques and Natural Language Processing (NLP) methods were applied to handle missing values and prepare the text data for analysis. The dataset was then split into training and testing sets, and three supervised learning algorithms which are Support Vector Machine, Random Forest, and Naive Bayes were employed to evaluate the sentiment analysis models. The performance of each model was compared, and it was found that Support Vector Machine achieved the highest accuracy, recall score, F1 score, and precision score. This study demonstrates the potential to extend sentiment analysis to analyze sentiments expressed in texts written in the Malay language by utilizing the Malaya corpus. Additionally, visual dashboards can be created to present the findings and provide recommendations based on the insights gathered from the sentiment analysis of PoI feedback.","PeriodicalId":506443,"journal":{"name":"Journal of Advanced Research in Applied Sciences and Engineering Technology","volume":"30 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment Analysis on The Place of Interest in Malaysia\",\"authors\":\"Qiryn Adriana Kharul Zaman, Wan Nur Syahidah Wan Yusoff, Qistina Batrisyia Azman Shah\",\"doi\":\"10.37934/araset.43.1.5465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study focuses on utilizing machine learning methods for sentiment analysis to identify positive and negative comments regarding Malaysian Places of Interest (PoI). The data was collected from Twitter using social media monitoring software and organized into tables. Pre-processing techniques and Natural Language Processing (NLP) methods were applied to handle missing values and prepare the text data for analysis. The dataset was then split into training and testing sets, and three supervised learning algorithms which are Support Vector Machine, Random Forest, and Naive Bayes were employed to evaluate the sentiment analysis models. The performance of each model was compared, and it was found that Support Vector Machine achieved the highest accuracy, recall score, F1 score, and precision score. This study demonstrates the potential to extend sentiment analysis to analyze sentiments expressed in texts written in the Malay language by utilizing the Malaya corpus. Additionally, visual dashboards can be created to present the findings and provide recommendations based on the insights gathered from the sentiment analysis of PoI feedback.\",\"PeriodicalId\":506443,\"journal\":{\"name\":\"Journal of Advanced Research in Applied Sciences and Engineering Technology\",\"volume\":\"30 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advanced Research in Applied Sciences and Engineering Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37934/araset.43.1.5465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Research in Applied Sciences and Engineering Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37934/araset.43.1.5465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究的重点是利用机器学习方法进行情感分析,以识别有关马来西亚景点(PoI)的正面和负面评论。数据通过社交媒体监测软件从 Twitter 收集并整理成表格。预处理技术和自然语言处理(NLP)方法用于处理缺失值,并为分析准备文本数据。然后将数据集分成训练集和测试集,并采用支持向量机、随机森林和奈维贝叶斯三种监督学习算法来评估情感分析模型。对每个模型的性能进行比较后发现,支持向量机的准确率、召回分数、F1 分数和精确度分数都是最高的。这项研究表明,通过利用马来亚语语料库,可以将情感分析扩展到分析马来语文本中表达的情感。此外,还可以创建可视化仪表板来展示研究结果,并根据从 PoI 反馈的情感分析中收集到的见解提供建议。
Sentiment Analysis on The Place of Interest in Malaysia
This study focuses on utilizing machine learning methods for sentiment analysis to identify positive and negative comments regarding Malaysian Places of Interest (PoI). The data was collected from Twitter using social media monitoring software and organized into tables. Pre-processing techniques and Natural Language Processing (NLP) methods were applied to handle missing values and prepare the text data for analysis. The dataset was then split into training and testing sets, and three supervised learning algorithms which are Support Vector Machine, Random Forest, and Naive Bayes were employed to evaluate the sentiment analysis models. The performance of each model was compared, and it was found that Support Vector Machine achieved the highest accuracy, recall score, F1 score, and precision score. This study demonstrates the potential to extend sentiment analysis to analyze sentiments expressed in texts written in the Malay language by utilizing the Malaya corpus. Additionally, visual dashboards can be created to present the findings and provide recommendations based on the insights gathered from the sentiment analysis of PoI feedback.