Impact of the COVID-19 pandemic on the expression of emotions in social media

Debabrata Ghosh
{"title":"Impact of the COVID-19 pandemic on the expression of emotions in social media","authors":"Debabrata Ghosh","doi":"10.22367/mcdm.2020.15.02","DOIUrl":null,"url":null,"abstract":"In the age of social media, every second thousands of messages are exchanged. Analyzing those unstructured data to find out specific emotions is a challenging task. Analysis of emotions involves evaluation and classification of text into emotion classes such as Happy, Sad, Anger, Disgust, Fear, Surprise, as defined by emotion dimensional models which are described in the theory of psychology (www 1; Russell, 2005). The main goal of this paper is to cover the COVID-19 pandemic situation in India and its impact on human emotions. As people very often express their state of the mind through social media, analyzing and tracking their emotions can be very effective for government and local authorities to take required measures. We have analyzed different machine learning classification models, such as Naïve Bayes, Support Vector Machine, Random Forest Classifier, Decision Tree and Logistic Regression with 10-fold cross validation to find out top ML models for emotion classification. After tuning the Hyperparameter, we got Logistic regression as the best suited model with accuracy 77% with the given datasets. We worked on algorithm based supervised ML technique to get the expected result. Although multiple studies were conducted earlier along the same lines, none of them performed comparative study among different ML techniques or hyperparameter tuning to optimize the results. Besides, this study has been done on the dataset of the most recent COVID-19 pandemic situation, which is itself unique. We captured Twitter data for a duration of 45 days with hashtag #COVID19India OR #COVID19 and analyzed the data using Logistic Regression to find out how the emotion changed over time based on certain social factors. Keywords: classification, COVID-19, emotion, emotion analysis, Naïve Bayes, Pandemic, Random Forest, SVM.","PeriodicalId":259350,"journal":{"name":"Multiple Criteria Decision Making","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multiple Criteria Decision Making","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22367/mcdm.2020.15.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In the age of social media, every second thousands of messages are exchanged. Analyzing those unstructured data to find out specific emotions is a challenging task. Analysis of emotions involves evaluation and classification of text into emotion classes such as Happy, Sad, Anger, Disgust, Fear, Surprise, as defined by emotion dimensional models which are described in the theory of psychology (www 1; Russell, 2005). The main goal of this paper is to cover the COVID-19 pandemic situation in India and its impact on human emotions. As people very often express their state of the mind through social media, analyzing and tracking their emotions can be very effective for government and local authorities to take required measures. We have analyzed different machine learning classification models, such as Naïve Bayes, Support Vector Machine, Random Forest Classifier, Decision Tree and Logistic Regression with 10-fold cross validation to find out top ML models for emotion classification. After tuning the Hyperparameter, we got Logistic regression as the best suited model with accuracy 77% with the given datasets. We worked on algorithm based supervised ML technique to get the expected result. Although multiple studies were conducted earlier along the same lines, none of them performed comparative study among different ML techniques or hyperparameter tuning to optimize the results. Besides, this study has been done on the dataset of the most recent COVID-19 pandemic situation, which is itself unique. We captured Twitter data for a duration of 45 days with hashtag #COVID19India OR #COVID19 and analyzed the data using Logistic Regression to find out how the emotion changed over time based on certain social factors. Keywords: classification, COVID-19, emotion, emotion analysis, Naïve Bayes, Pandemic, Random Forest, SVM.
COVID-19大流行对社交媒体情绪表达的影响
在社交媒体时代,每秒钟都有数千条信息被交换。分析这些非结构化数据以找出特定的情绪是一项具有挑战性的任务。情绪分析包括根据心理学理论中描述的情绪维度模型对文本进行评估和分类,如快乐、悲伤、愤怒、厌恶、恐惧、惊讶等情绪类别。罗素,2005)。本文的主要目标是报道2019冠状病毒病在印度的流行情况及其对人类情绪的影响。由于人们经常通过社交媒体表达自己的心理状态,分析和跟踪他们的情绪可以非常有效地帮助政府和地方当局采取必要的措施。我们分析了不同的机器学习分类模型,如Naïve贝叶斯,支持向量机,随机森林分类器,决策树和逻辑回归,并进行了10倍交叉验证,以找出最适合情感分类的ML模型。在调整了超参数之后,我们得到了逻辑回归作为给定数据集的最适合模型,准确率为77%。我们研究了基于算法的监督式机器学习技术,以获得预期的结果。尽管之前沿着同一路线进行了多项研究,但没有一项研究在不同的ML技术或超参数调优之间进行比较研究以优化结果。此外,本研究是在最新的COVID-19大流行情况数据集上进行的,这本身就是独特的。我们用# covid - 19 india或# covid - 19标签捕获了45天的推特数据,并使用逻辑回归分析了数据,以找出基于某些社会因素的情绪如何随着时间的推移而变化。关键词:分类,COVID-19,情绪,情绪分析,Naïve贝叶斯,流行病,随机森林,支持向量机。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.50
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信