Using Digital Humanities for Understanding COVID-19: Lessons from Digital History about earlier Coronavirus Pandemic

T. Jurić
{"title":"Using Digital Humanities for Understanding COVID-19: Lessons from Digital History about earlier Coronavirus Pandemic","authors":"T. Jurić","doi":"10.1101/2022.02.02.22270333","DOIUrl":null,"url":null,"abstract":"Background: At the time of the COVID-19 epidemic, it is useful to look at what lessons (digital) history can give us about the past pandemics and dealing with them. We show that the Google Ngram (GNV) can discover hidden patterns in history and, therefore, can be used as a window into history. By using the approach of Digital Humanities, we analysed the epidemiological literature on the development of the Russian flu pandemic for hints on how the COVID-19 might develop in the following years. Objective: Our study is searching for evidence that the COVID-19 is not a unique phenomenon in human history. We are testing the hypothesis that the flu-like illness that caused loss of taste and smell in the late 19th century (Russian flu) was caused by a coronavirus. We are aware that it is difficult to formulate a hypothesis for a microbiological aetiology of a pandemic that occurred 133 years ago. But differentiating an influenza virus infection from a COVID-19 patient purely on the clinical ground is difficult for a physician because the symptoms overlap. The most crucial observation of similarities between the Russian flu pandemic and COVID-19 is the loss of smell and taste (anosmia and ageusia). The objective was to calculate the ratio of increasing to decreasing trends in the changes in frequencies of the selected words representing symptoms of the Russian flu and COVID-19. Methods: The primary methodological concept of our approach is to analyse the ratio of increasing to decreasing trends in the changes in frequencies of the selected words representing symptoms of the Russian flu and COVID-19 with the Google NGram analytical tool. Initially, keywords were chosen that are specific and common for the Russian flu and COVID-19. We show the graphic display on the Y-axis what percentage of words in the selected corpus of books (collective memory) over the years (X-axis) make up the word. To standardise the data, we requested the data from 1800 to 2019 in English, German and Russian (to 2012) book corpora and focused on the ten years before, during and after the outbreak of the Russian flu. We compared this frequency index with non-epidemic periods to test the model analytical potential and prove the signification of the results. Results: The COVID-19 is not a unique phenomenon because the Russian flu was probably the coronavirus infection. Results show that all the three analysed book corpora (including newspapers and magazines) show the increase in the mention of the symptoms loss of smell and loss of taste during the Russian flu (1889-1891), which are today undoubtedly proven to be key symptoms of COVID-19. In the English corpus, the frequency rose from 0.0000040433 % in 1880 to 0.0000047123 % in 1889. The frequency fell sharply after the pandemic stopped in 1900 (0.0000033861%). In the Russian corpus, the frequency rises from 0 % in 1880 to 0.0000004682 % in 1889 and decreased rapidly after the pandemic (1900 = 0.0000011834 %). In the German corpus, the frequency rose from 0.0000014463 % in 1880 to 0.0000018015 % in 1889 and decreased also rapidly after the pandemic (1900 = 0.0000016600 %). According to our analysis of historical records with the approach of GNV, 1) the natural length of a pandemic is two to five years; 2) the pandemic stops on their own; 3) the viruses weaken over time; 4) the so-called herd immunity is not necessary to stop the pandemic; 5) history has shown that a significant crisis does not need to occur after the COVID-19 pandemic. Conclusion: According to our study, the Google Books Ngram Viewer (GNV) gives a clear evidence of the influence that social changes have on word frequency. The results of this study open a discussion on the usefulness of the Google Ngram insights possibilities into past socio-cultural development, i.e. epidemics and pandemics that can serve as lessons for today. We showed hidden patterns of conceptual trends in history and their relationships with current development in the case of the pandemic COVID-19. The benefit of this method could help complement historical medical records, which are often woefully incomplete. However, this method comes with severe limitations and can be useful only under cautious handling and testing. Despite the numerous indications we have shown, we are aware that this thesis still cannot be confirmed and that it is necessary to require further historical and medical research. Keywords: Google Ngram, Big Data, epidemic, COVID-19, Russian flu, Digital Humanities","PeriodicalId":197899,"journal":{"name":"Athens Journal of Τechnology & Engineering","volume":"03 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Athens Journal of Τechnology & Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2022.02.02.22270333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Background: At the time of the COVID-19 epidemic, it is useful to look at what lessons (digital) history can give us about the past pandemics and dealing with them. We show that the Google Ngram (GNV) can discover hidden patterns in history and, therefore, can be used as a window into history. By using the approach of Digital Humanities, we analysed the epidemiological literature on the development of the Russian flu pandemic for hints on how the COVID-19 might develop in the following years. Objective: Our study is searching for evidence that the COVID-19 is not a unique phenomenon in human history. We are testing the hypothesis that the flu-like illness that caused loss of taste and smell in the late 19th century (Russian flu) was caused by a coronavirus. We are aware that it is difficult to formulate a hypothesis for a microbiological aetiology of a pandemic that occurred 133 years ago. But differentiating an influenza virus infection from a COVID-19 patient purely on the clinical ground is difficult for a physician because the symptoms overlap. The most crucial observation of similarities between the Russian flu pandemic and COVID-19 is the loss of smell and taste (anosmia and ageusia). The objective was to calculate the ratio of increasing to decreasing trends in the changes in frequencies of the selected words representing symptoms of the Russian flu and COVID-19. Methods: The primary methodological concept of our approach is to analyse the ratio of increasing to decreasing trends in the changes in frequencies of the selected words representing symptoms of the Russian flu and COVID-19 with the Google NGram analytical tool. Initially, keywords were chosen that are specific and common for the Russian flu and COVID-19. We show the graphic display on the Y-axis what percentage of words in the selected corpus of books (collective memory) over the years (X-axis) make up the word. To standardise the data, we requested the data from 1800 to 2019 in English, German and Russian (to 2012) book corpora and focused on the ten years before, during and after the outbreak of the Russian flu. We compared this frequency index with non-epidemic periods to test the model analytical potential and prove the signification of the results. Results: The COVID-19 is not a unique phenomenon because the Russian flu was probably the coronavirus infection. Results show that all the three analysed book corpora (including newspapers and magazines) show the increase in the mention of the symptoms loss of smell and loss of taste during the Russian flu (1889-1891), which are today undoubtedly proven to be key symptoms of COVID-19. In the English corpus, the frequency rose from 0.0000040433 % in 1880 to 0.0000047123 % in 1889. The frequency fell sharply after the pandemic stopped in 1900 (0.0000033861%). In the Russian corpus, the frequency rises from 0 % in 1880 to 0.0000004682 % in 1889 and decreased rapidly after the pandemic (1900 = 0.0000011834 %). In the German corpus, the frequency rose from 0.0000014463 % in 1880 to 0.0000018015 % in 1889 and decreased also rapidly after the pandemic (1900 = 0.0000016600 %). According to our analysis of historical records with the approach of GNV, 1) the natural length of a pandemic is two to five years; 2) the pandemic stops on their own; 3) the viruses weaken over time; 4) the so-called herd immunity is not necessary to stop the pandemic; 5) history has shown that a significant crisis does not need to occur after the COVID-19 pandemic. Conclusion: According to our study, the Google Books Ngram Viewer (GNV) gives a clear evidence of the influence that social changes have on word frequency. The results of this study open a discussion on the usefulness of the Google Ngram insights possibilities into past socio-cultural development, i.e. epidemics and pandemics that can serve as lessons for today. We showed hidden patterns of conceptual trends in history and their relationships with current development in the case of the pandemic COVID-19. The benefit of this method could help complement historical medical records, which are often woefully incomplete. However, this method comes with severe limitations and can be useful only under cautious handling and testing. Despite the numerous indications we have shown, we are aware that this thesis still cannot be confirmed and that it is necessary to require further historical and medical research. Keywords: Google Ngram, Big Data, epidemic, COVID-19, Russian flu, Digital Humanities
利用数字人文科学理解COVID-19:早期冠状病毒大流行的数字历史教训
背景:在2019冠状病毒病流行之际,回顾历史(数字)可以给我们提供哪些关于过去大流行的经验教训并加以应对,是有益的。我们表明Google Ngram (GNV)可以发现历史中隐藏的模式,因此可以用作历史的窗口。通过使用数字人文学科的方法,我们分析了有关俄罗斯流感大流行发展的流行病学文献,以寻找2019冠状病毒病在未来几年可能如何发展的线索。目的:我们的研究是寻找证据,证明COVID-19不是人类历史上的一个独特现象。我们正在测试一种假设,即19世纪末导致味觉和嗅觉丧失的流感样疾病(俄罗斯流感)是由冠状病毒引起的。我们意识到,很难对133年前发生的一场大流行病的微生物病因作出假设。但是,单纯从临床角度区分流感病毒感染和COVID-19患者对医生来说很困难,因为症状重叠。关于俄罗斯流感大流行与COVID-19之间的相似之处,最重要的观察结果是嗅觉和味觉的丧失(嗅觉缺失和衰老)。目的是计算代表俄罗斯流感和COVID-19症状的选定单词频率变化的增减趋势的比率。方法:本方法的主要方法学概念是使用Google NGram分析工具分析代表俄罗斯流感和COVID-19症状的选定单词频率变化的增减趋势之比。最初,选择了针对俄罗斯流感和COVID-19的特定和常见关键词。我们在y轴上显示图形,显示多年来选定的书籍语料库(集体记忆)中组成单词的单词百分比(x轴)。为了使数据标准化,我们要求从1800年到2019年的英语、德语和俄语(到2012年)图书语料库的数据,并重点关注俄罗斯流感爆发前、期间和之后的十年。我们将该频率指数与非流行病时期进行了比较,以检验模型的分析潜力并证明结果的意义。结果:俄罗斯流感很可能是冠状病毒感染,COVID-19不是一种独特的现象。结果表明,分析的所有三种图书语料库(包括报纸和杂志)都显示,在俄罗斯流感(1889-1891年)期间,嗅觉丧失和味觉丧失的症状被提及的次数有所增加,这些症状今天无疑被证明是COVID-19的关键症状。在英语语料库中,频率从1880年的0.0000040433%上升到1889年的0.0000047123%。在1900年大流行停止后,频率急剧下降(0.0000033861%)。在俄语语料库中,发病率从1880年的0%上升到1889年的0.0000004682%,并在大流行后迅速下降(1900年= 0.0000011834%)。在德语语料库中,发病率从1880年的0.0000014463%上升到1889年的0.0000018015%,大流行后也迅速下降(1900 = 0.0000016600%)。根据我们用GNV方法对历史记录的分析,1)大流行的自然长度为2至5年;2)疫情自行停止;3)病毒随着时间的推移而减弱;4)所谓的群体免疫不是阻止大流行所必需的;5)历史表明,在2019冠状病毒病大流行之后,并不一定会发生重大危机。结论:根据我们的研究,Google Books Ngram Viewer (GNV)给出了社会变化对词频影响的明确证据。这项研究的结果开启了关于Google Ngram洞察过去社会文化发展可能性的有用性的讨论,即流行病和流行病可以作为今天的教训。我们以COVID-19大流行为例,揭示了历史上概念趋势的隐藏模式及其与当前发展的关系。这种方法的好处是可以帮助补充历史医疗记录,这些记录往往是不完整的。然而,这种方法有严重的局限性,只有在谨慎处理和测试的情况下才能使用。尽管我们已经表明了许多迹象,但我们意识到,这一论点仍然无法得到证实,需要进一步的历史和医学研究。关键词:Google Ngram,大数据,疫情,COVID-19,俄罗斯流感,数字人文
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信