Improved Identification of Negative Tweets related to Covid-19 Vaccination by Mitigating Class Imbalance

Naman Bhoj, Mayank Khari, Bishwajeet K. Pandey
{"title":"Improved Identification of Negative Tweets related to Covid-19 Vaccination by Mitigating Class Imbalance","authors":"Naman Bhoj, Mayank Khari, Bishwajeet K. Pandey","doi":"10.1109/CICN51697.2021.9574664","DOIUrl":null,"url":null,"abstract":"With an exponential rise in the number of cases of Covid-19, researchers have been painstakingly focused towards developing an effective vaccine. Consequently, there has been ongoing discussion about the vaccine on the social media platform filled with positive and negative sentiments. In this paper, we narrow down our research space by focusing on only identifying tweets imparting negative sentiment towards vaccines on social media. This identification model holds vital importance for government and medical agencies as it can help them analyse the possible reasons or causes behind the negative sentiment via tweets. Empirical results of the experiments conducted in this paper indicated that Support Vector Machine is best suited to identify negative tweets on a balanced dataset with the highest F1-Score of 87.179, and K-Nearest Neighbour shows the highest improvement after mitigating class imbalance using Edited Nearest Neighbour, which indicates the class dependency of distance based methods.","PeriodicalId":224313,"journal":{"name":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICN51697.2021.9574664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

With an exponential rise in the number of cases of Covid-19, researchers have been painstakingly focused towards developing an effective vaccine. Consequently, there has been ongoing discussion about the vaccine on the social media platform filled with positive and negative sentiments. In this paper, we narrow down our research space by focusing on only identifying tweets imparting negative sentiment towards vaccines on social media. This identification model holds vital importance for government and medical agencies as it can help them analyse the possible reasons or causes behind the negative sentiment via tweets. Empirical results of the experiments conducted in this paper indicated that Support Vector Machine is best suited to identify negative tweets on a balanced dataset with the highest F1-Score of 87.179, and K-Nearest Neighbour shows the highest improvement after mitigating class imbalance using Edited Nearest Neighbour, which indicates the class dependency of distance based methods.
通过减轻类别不平衡改进与Covid-19疫苗接种相关的负面推文识别
随着新冠肺炎病例数量呈指数级增长,研究人员一直在努力开发有效的疫苗。因此,在社交媒体平台上,关于疫苗的讨论一直在进行,褒贬不一。在本文中,我们通过只关注识别社交媒体上对疫苗产生负面情绪的推文来缩小我们的研究空间。这种识别模型对于政府和医疗机构来说至关重要,因为它可以帮助他们分析推文负面情绪背后可能的原因或原因。本文实验的实证结果表明,支持向量机最适合在平衡数据集上识别负面推文,其f1得分最高为87.179,而k近邻在使用编辑近邻缓解类不平衡后表现出最高的改进,这表明基于距离的方法具有类依赖性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信