Chenghan Wen, Deema Alnuhait, Shanghu Liu, Wenda Zhou, Jun Zhang
{"title":"COVID-19危机期间美国社交媒体仇恨言论的实证研究","authors":"Chenghan Wen, Deema Alnuhait, Shanghu Liu, Wenda Zhou, Jun Zhang","doi":"10.56028/iajhss.1.2.1","DOIUrl":null,"url":null,"abstract":"As the COVID-19 outbreak, hate speech on social media towards Chinese and other Asian groups has encouraged \"Sinophobia\". To capture the situation of hate speech on Twitter, we hydrated the tweets discussing COVID-19, written in English language and have location within the USA. These tweets hydrated from Twitter API in span of 5 months (153 days). We have obtained 543,943 tweets in which we identify 40,579 Hate Speech occurrences. We categorized and analyzed them according to Pysentimiento model which is based on BERT models and, Latent Dirichlet Allocation Model (LDA). The results indicate that there are substantial associations between the increased amount of hate speech and the increased rate of deaths due to COVID-19,increased rate of new COVID-19 cases, and negative tests rate.","PeriodicalId":216977,"journal":{"name":"International Academic Journal of Humanities and Social Sciences","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Empirical Study of Hate Speech in Social Media During COVID-19 Crisis in the United States\",\"authors\":\"Chenghan Wen, Deema Alnuhait, Shanghu Liu, Wenda Zhou, Jun Zhang\",\"doi\":\"10.56028/iajhss.1.2.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the COVID-19 outbreak, hate speech on social media towards Chinese and other Asian groups has encouraged \\\"Sinophobia\\\". To capture the situation of hate speech on Twitter, we hydrated the tweets discussing COVID-19, written in English language and have location within the USA. These tweets hydrated from Twitter API in span of 5 months (153 days). We have obtained 543,943 tweets in which we identify 40,579 Hate Speech occurrences. We categorized and analyzed them according to Pysentimiento model which is based on BERT models and, Latent Dirichlet Allocation Model (LDA). The results indicate that there are substantial associations between the increased amount of hate speech and the increased rate of deaths due to COVID-19,increased rate of new COVID-19 cases, and negative tests rate.\",\"PeriodicalId\":216977,\"journal\":{\"name\":\"International Academic Journal of Humanities and Social Sciences\",\"volume\":\"69 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Academic Journal of Humanities and Social Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56028/iajhss.1.2.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Academic Journal of Humanities and Social Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56028/iajhss.1.2.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
随着新冠肺炎疫情的爆发,社交媒体上针对中国和其他亚洲群体的仇恨言论助长了“恐华症”。为了捕捉推特上仇恨言论的情况,我们对讨论COVID-19的推文进行了补水,这些推文用英语写成,位于美国境内。这些推文是在5个月(153天)的时间里从Twitter API中获取的。我们已经获得了543,943条推文,其中我们确定了40,579个仇恨言论。我们根据基于BERT模型的Pysentimiento模型和Latent Dirichlet Allocation model (LDA)对它们进行分类和分析。结果表明,仇恨言论数量的增加与COVID-19死亡率、新病例率和阴性检测率的增加存在实质性关联。
Empirical Study of Hate Speech in Social Media During COVID-19 Crisis in the United States
As the COVID-19 outbreak, hate speech on social media towards Chinese and other Asian groups has encouraged "Sinophobia". To capture the situation of hate speech on Twitter, we hydrated the tweets discussing COVID-19, written in English language and have location within the USA. These tweets hydrated from Twitter API in span of 5 months (153 days). We have obtained 543,943 tweets in which we identify 40,579 Hate Speech occurrences. We categorized and analyzed them according to Pysentimiento model which is based on BERT models and, Latent Dirichlet Allocation Model (LDA). The results indicate that there are substantial associations between the increased amount of hate speech and the increased rate of deaths due to COVID-19,increased rate of new COVID-19 cases, and negative tests rate.