Temporal Relationship between Daily Reports of COVID-19 Infections and Related GDELT and Tweet Mentions

Q3 Social Sciences
Innocensia Owuor, Hartwig H. Hochmair
{"title":"Temporal Relationship between Daily Reports of COVID-19 Infections and Related GDELT and Tweet Mentions","authors":"Innocensia Owuor, Hartwig H. Hochmair","doi":"10.3390/geographies3030031","DOIUrl":null,"url":null,"abstract":"Social media platforms are valuable data sources in the study of public reactions to events such as natural disasters and epidemics. This research assesses for selected countries around the globe the time lag between daily reports of COVID-19 cases and GDELT (Global Database of Events, Language, and Tone) and Twitter (X) COVID-19 mentions between February 2020 and April 2021 using time series analysis. Results show that GDELT articles and tweets preceded COVID-19 infections in Australia, Brazil, France, Greece, India, Italy, the U.S., Canada, Germany, and the U.K., while for Poland and the Philippines, tweets preceded and GDELT articles lagged behind COVID-19 disease incidences, respectively. This shows that the application of social media and news data for surveillance and management of pandemics needs to be assessed on a case-by-case basis for different countries. It also points towards the applicability of time series data analysis for only a limited number of countries due to strict data requirements (e.g., stationarity). A deviation from generally observed lag patterns in a country, i.e., periods with low COVID-19 infections but unusually high numbers of COVID-19-related GDELT articles or tweets, signals an anomaly. We use the seasonal hybrid extreme Studentized deviate test to detect such anomalies. This is followed by text analysis of news headlines from NewsBank and Google on the date of these anomalies to determine the probable event causing an anomaly, which includes elections, holidays, and protests.","PeriodicalId":38507,"journal":{"name":"Human Geographies","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Geographies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/geographies3030031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 1

Abstract

Social media platforms are valuable data sources in the study of public reactions to events such as natural disasters and epidemics. This research assesses for selected countries around the globe the time lag between daily reports of COVID-19 cases and GDELT (Global Database of Events, Language, and Tone) and Twitter (X) COVID-19 mentions between February 2020 and April 2021 using time series analysis. Results show that GDELT articles and tweets preceded COVID-19 infections in Australia, Brazil, France, Greece, India, Italy, the U.S., Canada, Germany, and the U.K., while for Poland and the Philippines, tweets preceded and GDELT articles lagged behind COVID-19 disease incidences, respectively. This shows that the application of social media and news data for surveillance and management of pandemics needs to be assessed on a case-by-case basis for different countries. It also points towards the applicability of time series data analysis for only a limited number of countries due to strict data requirements (e.g., stationarity). A deviation from generally observed lag patterns in a country, i.e., periods with low COVID-19 infections but unusually high numbers of COVID-19-related GDELT articles or tweets, signals an anomaly. We use the seasonal hybrid extreme Studentized deviate test to detect such anomalies. This is followed by text analysis of news headlines from NewsBank and Google on the date of these anomalies to determine the probable event causing an anomaly, which includes elections, holidays, and protests.
每日COVID-19感染报告与相关GDELT和推特提及的时间关系
在研究公众对自然灾害和流行病等事件的反应时,社交媒体平台是宝贵的数据来源。本研究使用时间序列分析评估了全球选定国家在2020年2月至2021年4月期间每日COVID-19病例报告与GDELT(全球事件、语言和语气数据库)和Twitter (X) COVID-19提及之间的时间差。结果显示,在澳大利亚、巴西、法国、希腊、印度、意大利、美国、加拿大、德国和英国,GDELT文章和推文分别先于COVID-19疾病发病率,而在波兰和菲律宾,推文和GDELT文章分别滞后于COVID-19疾病发病率。这表明,在监测和管理大流行病方面应用社交媒体和新闻数据需要根据不同国家的具体情况进行评估。它还指出,由于严格的数据要求(例如,平稳性),时间序列数据分析只适用于有限数量的国家。一国与普遍观察到的滞后模式发生偏差,即在COVID-19感染率较低但与COVID-19相关的GDELT文章或推文数量异常高的时期,表明存在异常现象。我们使用季节性混合极端学生偏差检验来检测这些异常。接下来是对NewsBank和Google在这些异常日期的新闻标题进行文本分析,以确定导致异常的可能事件,包括选举、假日和抗议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Geographies
Human Geographies Social Sciences-Geography, Planning and Development
CiteScore
1.10
自引率
0.00%
发文量
7
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信