Leveraging natural language processing and geospatial time series model to analyze COVID-19 vaccination sentiment dynamics on Tweets.

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open Pub Date : 2023-07-01 DOI:10.1093/jamiaopen/ooad023

Jiancheng Ye, Jiarui Hai, Zidan Wang, Chumei Wei, Jiacheng Song

{"title":"Leveraging natural language processing and geospatial time series model to analyze COVID-19 vaccination sentiment dynamics on Tweets.","authors":"Jiancheng Ye, Jiarui Hai, Zidan Wang, Chumei Wei, Jiacheng Song","doi":"10.1093/jamiaopen/ooad023","DOIUrl":null,"url":null,"abstract":"Objective: To develop and apply a natural language processing (NLP)-based approach to analyze public sentiments on social media and their geographic pattern in the United States toward coronavirus disease 2019 (COVID-19) vaccination. We also aim to provide insights to facilitate the understanding of the public attitudes and concerns regarding COVID-19 vaccination.Methods: We collected Tweet posts by the residents in the United States after the dissemination of the COVID-19 vaccine. We performed sentiment analysis based on the Bidirectional Encoder Representations from Transformers (BERT) and qualitative content analysis. Time series models were leveraged to describe sentiment trends. Key topics were analyzed longitudinally and geospatially.Results: A total of 3 198 686 Tweets related to COVID-19 vaccination were extracted from January 2021 to February 2022. 2 358 783 Tweets were identified to contain clear opinions, among which 824 755 (35.0%) expressed negative opinions towards vaccination while 1 534 028 (65.0%) demonstrated positive opinions. The accuracy of the BERT model was 79.67%. The key hashtag-based topics include Pfizer, breaking, wearamask, and smartnews. The sentiment towards vaccination across the states showed manifest variability. Key barriers to vaccination include mistrust, hesitancy, safety concern, misinformation, and inequity.Conclusion: We found that opinions toward the COVID-19 vaccination varied across different places and over time. This study demonstrates the potential of an analytical pipeline, which integrates NLP-enabled modeling, time series, and geospatial analyses of social media data. Such analyses could enable real-time assessment, at scale, of public confidence and trust in COVID-19 vaccination, help address the concerns of vaccine skeptics, and provide support for developing tailored policies and communication strategies to maximize uptake.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"6 2","pages":"ooad023"},"PeriodicalIF":2.5000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10097455/pdf/","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooad023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 6

Abstract

Objective: To develop and apply a natural language processing (NLP)-based approach to analyze public sentiments on social media and their geographic pattern in the United States toward coronavirus disease 2019 (COVID-19) vaccination. We also aim to provide insights to facilitate the understanding of the public attitudes and concerns regarding COVID-19 vaccination.

Methods: We collected Tweet posts by the residents in the United States after the dissemination of the COVID-19 vaccine. We performed sentiment analysis based on the Bidirectional Encoder Representations from Transformers (BERT) and qualitative content analysis. Time series models were leveraged to describe sentiment trends. Key topics were analyzed longitudinally and geospatially.

Results: A total of 3 198 686 Tweets related to COVID-19 vaccination were extracted from January 2021 to February 2022. 2 358 783 Tweets were identified to contain clear opinions, among which 824 755 (35.0%) expressed negative opinions towards vaccination while 1 534 028 (65.0%) demonstrated positive opinions. The accuracy of the BERT model was 79.67%. The key hashtag-based topics include Pfizer, breaking, wearamask, and smartnews. The sentiment towards vaccination across the states showed manifest variability. Key barriers to vaccination include mistrust, hesitancy, safety concern, misinformation, and inequity.

Conclusion: We found that opinions toward the COVID-19 vaccination varied across different places and over time. This study demonstrates the potential of an analytical pipeline, which integrates NLP-enabled modeling, time series, and geospatial analyses of social media data. Such analyses could enable real-time assessment, at scale, of public confidence and trust in COVID-19 vaccination, help address the concerns of vaccine skeptics, and provide support for developing tailored policies and communication strategies to maximize uptake.

Abstract Image

查看原文本刊更多论文

利用自然语言处理和地理空间时间序列模型分析推特上COVID-19疫苗接种情绪动态。

目的:开发并应用基于自然语言处理(NLP)的方法，分析美国社交媒体上公众对2019冠状病毒病(COVID-19)疫苗接种的情绪及其地理分布。我们还旨在提供见解，以促进了解公众对COVID-19疫苗接种的态度和关切。方法:收集COVID-19疫苗传播后美国居民的推文。我们基于变形金刚的双向编码器表示(BERT)和定性内容分析进行了情感分析。利用时间序列模型来描述情绪趋势。重点课题在纵向和地理空间上进行了分析。结果:从2021年1月至2022年2月，共提取了3 198686条与COVID-19疫苗接种相关的推文。共有2 358 783条推文表达了明确的意见，其中824 755条(35.0%)对疫苗接种持否定意见，1 534 028条(65.0%)持肯定意见。BERT模型的准确率为79.67%。关键的话题包括辉瑞、突发事件、戴面具和智能新闻。各州对疫苗接种的态度表现出明显的差异。接种疫苗的主要障碍包括不信任、犹豫、安全担忧、错误信息和不公平。结论:我们发现，不同地区和不同时期对COVID-19疫苗接种的看法存在差异。这项研究展示了分析管道的潜力，它集成了支持nlp的建模、时间序列和社交媒体数据的地理空间分析。这种分析可以大规模实时评估公众对COVID-19疫苗接种的信心和信任，有助于解决疫苗怀疑论者的担忧，并为制定量身定制的政策和沟通战略提供支持，以最大限度地利用疫苗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JAMIA Open Medicine-Health Informatics

CiteScore

4.10

自引率

4.80%

发文量

102

审稿时长

16 weeks