Economic Impact due Covid-19 Pandemic: Sentiment Analysis on Twitter Using Naïve Bayes Classifier and Support Vector Machine

Q3 Decision Sciences
Qurrotul Aini, Raffie Rizky Fauzi, Eva Khudzaeva
{"title":"Economic Impact due Covid-19 Pandemic: Sentiment Analysis on Twitter Using Naïve Bayes Classifier and Support Vector Machine","authors":"Qurrotul Aini, Raffie Rizky Fauzi, Eva Khudzaeva","doi":"10.30630/joiv.7.3.1474","DOIUrl":null,"url":null,"abstract":"Covid-19 is an outbreak caused by severe acute respiratory syndrome. Covid-19 first appeared in Indonesia on March 2, 2020, with two confirmed cases and increased to 1285 cases in 30 provinces. One of the impacts of the Covid-19 pandemic is on the economic aspect, which has experienced a drastic decline in income. This study aims to classify public opinion to determine the level of public sentiment on the economic impact of the Covid-19 pandemic and to identify parameters that influence the accuracy of the sentiment analysis classification model. The methods used in this current research are Lexicon, Support Vector Machine (SVM), and Naive Bayes Classifier (NBC). First, Lexicon is used for scoring and labeling the preprocessed data. Second, SVM is used to classify the sentiment, then find the best accuracy using linear, radial, polynomial, and sigmoid kernels. Third, NBC is used to classify sentiment as a comparison method. The results indicated that 255 tweet data consisted of 44 positive tweets (17.25%), 46 neutral tweets (18.04%), and 165 negative tweets (64.71%). Therefore, it can be inferred that the economic impact on the Indonesian people due to the Covid-19 pandemic has a high negative sentiment value. In the performance, SVM yielded a better accuracy of 100%, precision, recall, and F-measure are 1. This study proves that selecting the kernel type and applying underfitting can improve the accuracy of SVM. Also, SVM can perform well on a small amount of training data.","PeriodicalId":32468,"journal":{"name":"JOIV International Journal on Informatics Visualization","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JOIV International Journal on Informatics Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30630/joiv.7.3.1474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Decision Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Covid-19 is an outbreak caused by severe acute respiratory syndrome. Covid-19 first appeared in Indonesia on March 2, 2020, with two confirmed cases and increased to 1285 cases in 30 provinces. One of the impacts of the Covid-19 pandemic is on the economic aspect, which has experienced a drastic decline in income. This study aims to classify public opinion to determine the level of public sentiment on the economic impact of the Covid-19 pandemic and to identify parameters that influence the accuracy of the sentiment analysis classification model. The methods used in this current research are Lexicon, Support Vector Machine (SVM), and Naive Bayes Classifier (NBC). First, Lexicon is used for scoring and labeling the preprocessed data. Second, SVM is used to classify the sentiment, then find the best accuracy using linear, radial, polynomial, and sigmoid kernels. Third, NBC is used to classify sentiment as a comparison method. The results indicated that 255 tweet data consisted of 44 positive tweets (17.25%), 46 neutral tweets (18.04%), and 165 negative tweets (64.71%). Therefore, it can be inferred that the economic impact on the Indonesian people due to the Covid-19 pandemic has a high negative sentiment value. In the performance, SVM yielded a better accuracy of 100%, precision, recall, and F-measure are 1. This study proves that selecting the kernel type and applying underfitting can improve the accuracy of SVM. Also, SVM can perform well on a small amount of training data.
Covid-19大流行对经济的影响:使用Naïve贝叶斯分类器和支持向量机对Twitter的情绪分析
Covid-19是由严重急性呼吸系统综合征引起的疫情。新冠肺炎于2020年3月2日首次在印度尼西亚出现,确诊2例,并在30个省份增加到1285例。2019冠状病毒病大流行的影响之一是经济方面,收入急剧下降。本研究旨在对公众意见进行分类,以确定公众对Covid-19大流行经济影响的情绪水平,并确定影响情绪分析分类模型准确性的参数。目前研究中使用的方法有Lexicon、支持向量机(SVM)和朴素贝叶斯分类器(NBC)。首先,使用Lexicon对预处理数据进行评分和标记。其次,使用支持向量机对情感进行分类,然后使用线性、径向、多项式和s型核找到最佳精度。第三,将NBC作为一种比较方法对情绪进行分类。结果表明,255条推文数据中,正面推文44条(17.25%),中性推文46条(18.04%),负面推文165条(64.71%)。因此,可以推断,新冠肺炎疫情对印尼民众的经济影响具有很高的负面情绪值。在性能上,SVM的准确率为100%,精密度、召回率和F-measure均为1。研究证明,选择核类型并应用欠拟合可以提高支持向量机的准确率。同时,SVM在少量的训练数据上也有很好的表现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
JOIV International Journal on Informatics Visualization
JOIV International Journal on Informatics Visualization Decision Sciences-Information Systems and Management
CiteScore
1.40
自引率
0.00%
发文量
100
审稿时长
16 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信