N. M. Sabri, Jasmine Nor Azman Norman, Norulhidayah Isa, Ummu Fatihah Mohd Bahrin
{"title":"Sentiment Analysis On Covid-19 Outbreak Awareness Using Naïve Bayes Algorithm","authors":"N. M. Sabri, Jasmine Nor Azman Norman, Norulhidayah Isa, Ummu Fatihah Mohd Bahrin","doi":"10.1109/IVIT55443.2022.10033379","DOIUrl":null,"url":null,"abstract":"Sentiment analysis has gained much attention nowadays among the researchers especially during the Covid-19 pandemic. Due to the increasing volume of data coming from the social media platforms, researchers have been using sentiment analysis to analyse topics regarding commercial products, daily issues among the society and also to detect important events from the community. Since the social media users are consisting of the community, content that are shared could also be used to detect possible situational hazard such as the outbreak of Covid-19 in advanced. The result from the sentiment analysis could be beneficial to government organizations in order to contain the outbreaks and public health crisis related to Covid-19. The objective of this research is to explore Naive Bayes algorithm for the sentiment analysis on the Covid-19 outbreak awareness based on Twitter data. In this research, the data were collected during the Malaysia's second lock down, which was between the months of April to June 2021 using the Twitter API Tweepy. After the pre-processing and feature extraction stages, the data have been divided into the training and testing dataset for the Naive Bayes sentiment classification. The result has shown that Naive Bayes has been able to generate high performance with more than 90% accuracy for this classification problem. Future work would include the improvement of data preprocessing, more balance of dataset, enhancement of the algorithm and also comparing the performance with other well-known classification algorithms.","PeriodicalId":325667,"journal":{"name":"2022 International Visualization, Informatics and Technology Conference (IVIT)","volume":"76 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Visualization, Informatics and Technology Conference (IVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVIT55443.2022.10033379","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sentiment analysis has gained much attention nowadays among the researchers especially during the Covid-19 pandemic. Due to the increasing volume of data coming from the social media platforms, researchers have been using sentiment analysis to analyse topics regarding commercial products, daily issues among the society and also to detect important events from the community. Since the social media users are consisting of the community, content that are shared could also be used to detect possible situational hazard such as the outbreak of Covid-19 in advanced. The result from the sentiment analysis could be beneficial to government organizations in order to contain the outbreaks and public health crisis related to Covid-19. The objective of this research is to explore Naive Bayes algorithm for the sentiment analysis on the Covid-19 outbreak awareness based on Twitter data. In this research, the data were collected during the Malaysia's second lock down, which was between the months of April to June 2021 using the Twitter API Tweepy. After the pre-processing and feature extraction stages, the data have been divided into the training and testing dataset for the Naive Bayes sentiment classification. The result has shown that Naive Bayes has been able to generate high performance with more than 90% accuracy for this classification problem. Future work would include the improvement of data preprocessing, more balance of dataset, enhancement of the algorithm and also comparing the performance with other well-known classification algorithms.
尤其是在新冠肺炎疫情期间,情绪分析受到了研究人员的广泛关注。由于来自社交媒体平台的数据量不断增加,研究人员一直在使用情感分析来分析有关商业产品的主题,社会中的日常问题以及从社区中发现重要事件。由于社交媒体用户是由社区组成的,因此分享的内容也可以用于提前发现新冠疫情等可能出现的情况危险。情绪分析的结果可能对政府机构有利,以便控制与新冠肺炎相关的疫情和公共卫生危机。本研究的目的是探索基于Twitter数据的新冠肺炎疫情意识情绪分析的朴素贝叶斯算法。在这项研究中,数据是在马来西亚第二次封锁期间收集的,这是在2021年4月至6月期间使用Twitter API Tweepy收集的。经过预处理和特征提取阶段,将数据分为朴素贝叶斯情感分类的训练和测试数据集。结果表明,对于这个分类问题,朴素贝叶斯已经能够产生超过90%准确率的高性能。未来的工作将包括改进数据预处理,更加平衡数据集,增强算法,并与其他知名分类算法进行性能比较。