Sentiment Analysis On Covid-19 Outbreak Awareness Using Naïve Bayes Algorithm

N. M. Sabri, Jasmine Nor Azman Norman, Norulhidayah Isa, Ummu Fatihah Mohd Bahrin
{"title":"Sentiment Analysis On Covid-19 Outbreak Awareness Using Naïve Bayes Algorithm","authors":"N. M. Sabri, Jasmine Nor Azman Norman, Norulhidayah Isa, Ummu Fatihah Mohd Bahrin","doi":"10.1109/IVIT55443.2022.10033379","DOIUrl":null,"url":null,"abstract":"Sentiment analysis has gained much attention nowadays among the researchers especially during the Covid-19 pandemic. Due to the increasing volume of data coming from the social media platforms, researchers have been using sentiment analysis to analyse topics regarding commercial products, daily issues among the society and also to detect important events from the community. Since the social media users are consisting of the community, content that are shared could also be used to detect possible situational hazard such as the outbreak of Covid-19 in advanced. The result from the sentiment analysis could be beneficial to government organizations in order to contain the outbreaks and public health crisis related to Covid-19. The objective of this research is to explore Naive Bayes algorithm for the sentiment analysis on the Covid-19 outbreak awareness based on Twitter data. In this research, the data were collected during the Malaysia's second lock down, which was between the months of April to June 2021 using the Twitter API Tweepy. After the pre-processing and feature extraction stages, the data have been divided into the training and testing dataset for the Naive Bayes sentiment classification. The result has shown that Naive Bayes has been able to generate high performance with more than 90% accuracy for this classification problem. Future work would include the improvement of data preprocessing, more balance of dataset, enhancement of the algorithm and also comparing the performance with other well-known classification algorithms.","PeriodicalId":325667,"journal":{"name":"2022 International Visualization, Informatics and Technology Conference (IVIT)","volume":"76 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Visualization, Informatics and Technology Conference (IVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVIT55443.2022.10033379","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Sentiment analysis has gained much attention nowadays among the researchers especially during the Covid-19 pandemic. Due to the increasing volume of data coming from the social media platforms, researchers have been using sentiment analysis to analyse topics regarding commercial products, daily issues among the society and also to detect important events from the community. Since the social media users are consisting of the community, content that are shared could also be used to detect possible situational hazard such as the outbreak of Covid-19 in advanced. The result from the sentiment analysis could be beneficial to government organizations in order to contain the outbreaks and public health crisis related to Covid-19. The objective of this research is to explore Naive Bayes algorithm for the sentiment analysis on the Covid-19 outbreak awareness based on Twitter data. In this research, the data were collected during the Malaysia's second lock down, which was between the months of April to June 2021 using the Twitter API Tweepy. After the pre-processing and feature extraction stages, the data have been divided into the training and testing dataset for the Naive Bayes sentiment classification. The result has shown that Naive Bayes has been able to generate high performance with more than 90% accuracy for this classification problem. Future work would include the improvement of data preprocessing, more balance of dataset, enhancement of the algorithm and also comparing the performance with other well-known classification algorithms.
基于Naïve贝叶斯算法的Covid-19疫情感知情绪分析
尤其是在新冠肺炎疫情期间,情绪分析受到了研究人员的广泛关注。由于来自社交媒体平台的数据量不断增加,研究人员一直在使用情感分析来分析有关商业产品的主题,社会中的日常问题以及从社区中发现重要事件。由于社交媒体用户是由社区组成的,因此分享的内容也可以用于提前发现新冠疫情等可能出现的情况危险。情绪分析的结果可能对政府机构有利,以便控制与新冠肺炎相关的疫情和公共卫生危机。本研究的目的是探索基于Twitter数据的新冠肺炎疫情意识情绪分析的朴素贝叶斯算法。在这项研究中,数据是在马来西亚第二次封锁期间收集的,这是在2021年4月至6月期间使用Twitter API Tweepy收集的。经过预处理和特征提取阶段,将数据分为朴素贝叶斯情感分类的训练和测试数据集。结果表明,对于这个分类问题,朴素贝叶斯已经能够产生超过90%准确率的高性能。未来的工作将包括改进数据预处理,更加平衡数据集,增强算法,并与其他知名分类算法进行性能比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信