Chenghan Wen, Deema Alnuhait, Shanghu Liu, Wenda Zhou, Jun Zhang
{"title":"Empirical Study of Hate Speech in Social Media During COVID-19 Crisis in the United States","authors":"Chenghan Wen, Deema Alnuhait, Shanghu Liu, Wenda Zhou, Jun Zhang","doi":"10.56028/iajhss.1.2.1","DOIUrl":null,"url":null,"abstract":"As the COVID-19 outbreak, hate speech on social media towards Chinese and other Asian groups has encouraged \"Sinophobia\". To capture the situation of hate speech on Twitter, we hydrated the tweets discussing COVID-19, written in English language and have location within the USA. These tweets hydrated from Twitter API in span of 5 months (153 days). We have obtained 543,943 tweets in which we identify 40,579 Hate Speech occurrences. We categorized and analyzed them according to Pysentimiento model which is based on BERT models and, Latent Dirichlet Allocation Model (LDA). The results indicate that there are substantial associations between the increased amount of hate speech and the increased rate of deaths due to COVID-19,increased rate of new COVID-19 cases, and negative tests rate.","PeriodicalId":216977,"journal":{"name":"International Academic Journal of Humanities and Social Sciences","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Academic Journal of Humanities and Social Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56028/iajhss.1.2.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As the COVID-19 outbreak, hate speech on social media towards Chinese and other Asian groups has encouraged "Sinophobia". To capture the situation of hate speech on Twitter, we hydrated the tweets discussing COVID-19, written in English language and have location within the USA. These tweets hydrated from Twitter API in span of 5 months (153 days). We have obtained 543,943 tweets in which we identify 40,579 Hate Speech occurrences. We categorized and analyzed them according to Pysentimiento model which is based on BERT models and, Latent Dirichlet Allocation Model (LDA). The results indicate that there are substantial associations between the increased amount of hate speech and the increased rate of deaths due to COVID-19,increased rate of new COVID-19 cases, and negative tests rate.