Uzair Shah, Alaa A. Abd-alrazaq, Jens Schneider, Mowafa J Househ, Zubair Shah
{"title":"Twitters’ Concerns and Opinions About the COVID-19 Booster Shots: Infoveillance Study","authors":"Uzair Shah, Alaa A. Abd-alrazaq, Jens Schneider, Mowafa J Househ, Zubair Shah","doi":"10.1080/15398285.2022.2106404","DOIUrl":null,"url":null,"abstract":"Abstract Objective: This study aimed to categorize and analyze the public response toward third/booster shots of COVID-19 on Twitter. Methods: We downloaded the COVID-19 vaccine booster shots related Tweets using the Twitter API. The collected Tweets were pre-processed to prepare them for analysis by (1) removing non-English language tweets, retweets, emojis, emoticons, non-printable characters, the punctuation marks, and the prepositions, (2) anonymizing the identity of the users, and (3) normalizing various forms of the same words. We used the state-of-the-art BertTopic modeling library to identify the most popular topics. Results: Of 165,048 Tweets collected, 36,908 Tweets were analyzed in this study. From these tweets, we identified 9 topics, which were about Biden administration, Pfizer & BioNTech, Moderna, Johnson & Johnson, eligibility for booster shots, side effects, Donald Trump, variants of the Novel Coronavirus, and conspiracy theory & propaganda. The mean of sentiment was positive in all topics. The lowest and highest mean of sentiments were for the Donald Trump topic (0.0097) and the Johnson & Johnson topic (0.1294), respectively. Conclusions: The topics identified in this study not only accurately reflect the contemporary COVID-19 discussion, but also the high degree of politicization in the USA. While the latter might be a result of our rejection of non-English tweets, it is reassuring to see our fully automated, unsupervised pipeline reliably extract such global features in the data at scale. We, therefore, believe that the methodology presented in this study is mature and useful for other infoveillance studies on a wide variety of topics.","PeriodicalId":44184,"journal":{"name":"Journal of Consumer Health on the Internet","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Consumer Health on the Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/15398285.2022.2106404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 1
Abstract
Abstract Objective: This study aimed to categorize and analyze the public response toward third/booster shots of COVID-19 on Twitter. Methods: We downloaded the COVID-19 vaccine booster shots related Tweets using the Twitter API. The collected Tweets were pre-processed to prepare them for analysis by (1) removing non-English language tweets, retweets, emojis, emoticons, non-printable characters, the punctuation marks, and the prepositions, (2) anonymizing the identity of the users, and (3) normalizing various forms of the same words. We used the state-of-the-art BertTopic modeling library to identify the most popular topics. Results: Of 165,048 Tweets collected, 36,908 Tweets were analyzed in this study. From these tweets, we identified 9 topics, which were about Biden administration, Pfizer & BioNTech, Moderna, Johnson & Johnson, eligibility for booster shots, side effects, Donald Trump, variants of the Novel Coronavirus, and conspiracy theory & propaganda. The mean of sentiment was positive in all topics. The lowest and highest mean of sentiments were for the Donald Trump topic (0.0097) and the Johnson & Johnson topic (0.1294), respectively. Conclusions: The topics identified in this study not only accurately reflect the contemporary COVID-19 discussion, but also the high degree of politicization in the USA. While the latter might be a result of our rejection of non-English tweets, it is reassuring to see our fully automated, unsupervised pipeline reliably extract such global features in the data at scale. We, therefore, believe that the methodology presented in this study is mature and useful for other infoveillance studies on a wide variety of topics.
期刊介绍:
The Journal of Consumer Health on the Internet is the only professional peer-reviewed journal devoted to locating consumer health information via the Internet. In this journal librarians and health information providers describe programs and services aimed at helping patients and the general public find the health information they need. From the Editor: "Studies have shown that health information is one of the major reasons that people worldwide access the Internet. As the amount of health information on the Web increases exponentially, it becomes critical that librarians-including public and medical librarians-be knowledgeable about what is available online and be able to direct users to reliable, accurate, quality information."