从Twitter数据研究和分析COVID-19传播的情感分析

Qanita Bani Baker, Ayah Abu Aqouleh, Ola Altiti
{"title":"从Twitter数据研究和分析COVID-19传播的情感分析","authors":"Qanita Bani Baker, Ayah Abu Aqouleh, Ola Altiti","doi":"10.1109/SNAMS53716.2021.9731855","DOIUrl":null,"url":null,"abstract":"Coronavirus, known as COVID-19, rapidly spread on a wide scale in a short time consequently. World Health Organization (WHO) classified it as a global pandemic. Social networks news becomes a valuable resource for massive amounts of data and news about the epidemic in which news is deliberating every day. Twitter is one of these networks which is a popular platform that contains rich information and currently it repre-sents a rich resource of data about COVID-19. In this research, we study and analyze the spreading of the COVID-19 epidemic based on the location and dates using datasets from Twitter. Moreover, the study has done by performing sentiment analysis and making a correlation study between confirmed cases in a set of countries and the sentiment's polarity value including negative and positive as well as a correlation between the number of confirmed cases and number of tweets per country. Also, we have experimented with several machine learning classifiers including Naive base, Support Vector Machine, and Logistic Regression as well as RoBERTa model to predict the sentiment analysis on the dataset. The experimental results show that Logistic Regression outperforms other classifiers with an accuracy of 0.86%, thus, machine learning techniques could be used to study the sentiment of tweets which gives reasonable results.","PeriodicalId":387260,"journal":{"name":"2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Sentimental Analysis for Studying and Analyzing the Spreading of COVID-19 from Twitter Data\",\"authors\":\"Qanita Bani Baker, Ayah Abu Aqouleh, Ola Altiti\",\"doi\":\"10.1109/SNAMS53716.2021.9731855\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Coronavirus, known as COVID-19, rapidly spread on a wide scale in a short time consequently. World Health Organization (WHO) classified it as a global pandemic. Social networks news becomes a valuable resource for massive amounts of data and news about the epidemic in which news is deliberating every day. Twitter is one of these networks which is a popular platform that contains rich information and currently it repre-sents a rich resource of data about COVID-19. In this research, we study and analyze the spreading of the COVID-19 epidemic based on the location and dates using datasets from Twitter. Moreover, the study has done by performing sentiment analysis and making a correlation study between confirmed cases in a set of countries and the sentiment's polarity value including negative and positive as well as a correlation between the number of confirmed cases and number of tweets per country. Also, we have experimented with several machine learning classifiers including Naive base, Support Vector Machine, and Logistic Regression as well as RoBERTa model to predict the sentiment analysis on the dataset. The experimental results show that Logistic Regression outperforms other classifiers with an accuracy of 0.86%, thus, machine learning techniques could be used to study the sentiment of tweets which gives reasonable results.\",\"PeriodicalId\":387260,\"journal\":{\"name\":\"2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SNAMS53716.2021.9731855\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNAMS53716.2021.9731855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

因此,冠状病毒(COVID-19)在短时间内迅速大规模传播。世界卫生组织(WHO)将其列为全球流行病。社交网络新闻成为大量数据和疫情新闻的宝贵资源,新闻每天都在审议。推特是这些网络之一,它是一个包含丰富信息的流行平台,目前它代表了关于COVID-19的丰富数据资源。在本研究中,我们使用来自Twitter的数据集,基于位置和日期来研究和分析COVID-19流行病的传播。此外,本研究通过进行情绪分析,对一组国家的确诊病例与情绪的极性值(负极性和正极性)以及每个国家的确诊病例数与推文数之间的相关性进行了研究。此外,我们还试验了几种机器学习分类器,包括朴素基、支持向量机和逻辑回归,以及RoBERTa模型来预测数据集上的情感分析。实验结果表明,Logistic回归的准确率为0.86%,优于其他分类器,因此可以使用机器学习技术来研究推文的情绪,并给出合理的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sentimental Analysis for Studying and Analyzing the Spreading of COVID-19 from Twitter Data
Coronavirus, known as COVID-19, rapidly spread on a wide scale in a short time consequently. World Health Organization (WHO) classified it as a global pandemic. Social networks news becomes a valuable resource for massive amounts of data and news about the epidemic in which news is deliberating every day. Twitter is one of these networks which is a popular platform that contains rich information and currently it repre-sents a rich resource of data about COVID-19. In this research, we study and analyze the spreading of the COVID-19 epidemic based on the location and dates using datasets from Twitter. Moreover, the study has done by performing sentiment analysis and making a correlation study between confirmed cases in a set of countries and the sentiment's polarity value including negative and positive as well as a correlation between the number of confirmed cases and number of tweets per country. Also, we have experimented with several machine learning classifiers including Naive base, Support Vector Machine, and Logistic Regression as well as RoBERTa model to predict the sentiment analysis on the dataset. The experimental results show that Logistic Regression outperforms other classifiers with an accuracy of 0.86%, thus, machine learning techniques could be used to study the sentiment of tweets which gives reasonable results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信