Using machine-assisted topic analysis to expedite thematic analysis of free-text data: Exemplar investigation of factors influencing health behaviours and wellbeing during the COVID-19 pandemic

IF 2.5 2区 心理学 Q1 PSYCHOLOGY, CLINICAL
Emma Ward, Felix Naughton, Pippa Belderson, Trisevgeni Papakonstantinou, Ben Ainsworth, Sarah Hanson, Caitlin Notley, Paulina Bondaronek
{"title":"Using machine-assisted topic analysis to expedite thematic analysis of free-text data: Exemplar investigation of factors influencing health behaviours and wellbeing during the COVID-19 pandemic","authors":"Emma Ward,&nbsp;Felix Naughton,&nbsp;Pippa Belderson,&nbsp;Trisevgeni Papakonstantinou,&nbsp;Ben Ainsworth,&nbsp;Sarah Hanson,&nbsp;Caitlin Notley,&nbsp;Paulina Bondaronek","doi":"10.1111/bjhp.70017","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objectives</h3>\n \n <p>Investigate the use of machine learning to expedite thematic analysis of qualitative data concerning factors that influenced health behaviours and wellbeing during the COVID-19 pandemic.</p>\n </section>\n \n <section>\n \n <h3> Design</h3>\n \n <p>Qualitative investigation using Machine-Assisted Topic Analysis (MATA) of free-text data collected from a prospective cohort.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Free-text survey data (2177 responses from 762 participants) of influences on health behaviours and wellbeing were collected among UK participants recruited online, using Qualtrics at 3, 6, 12 and 24 months after the COVID-19 pandemic started. MATA, which employs structural topic modelling (STM), was used (in R) to discern latent topics within the responses. Two researchers independently labelled topics and collaboratively organized them into themes, with ‘sense checking’ from two additional researchers. Plots and rankings were generated, showing change in topic prevalence by time. Total researcher time to complete analysis was collated.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Fifteen STM-generated topics were labelled and integrated into six themes: the influences of and impacts on (1) health behaviours, (2) physical health (3) mood and (4) how these interacted, partly moderated by (5) external influences of control and (6) reflections on wellbeing and personal growth. Topic prevalence varied meaningfully over time, aligning with changes in the pandemic context. Themes were generated (excluding write-up) with 20 h combined researcher time.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>MATA shows promise as a resource-saving method for thematic analysis of large qualitative datasets whilst maintaining researcher control and insight. Findings show the interconnection between health behaviours, physical health and wellbeing over the pandemic, and the influence of control and reflective processes.</p>\n </section>\n </div>","PeriodicalId":48161,"journal":{"name":"British Journal of Health Psychology","volume":"30 3","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bjhp.70017","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Health Psychology","FirstCategoryId":"102","ListUrlMain":"https://bpspsychub.onlinelibrary.wiley.com/doi/10.1111/bjhp.70017","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, CLINICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives

Investigate the use of machine learning to expedite thematic analysis of qualitative data concerning factors that influenced health behaviours and wellbeing during the COVID-19 pandemic.

Design

Qualitative investigation using Machine-Assisted Topic Analysis (MATA) of free-text data collected from a prospective cohort.

Methods

Free-text survey data (2177 responses from 762 participants) of influences on health behaviours and wellbeing were collected among UK participants recruited online, using Qualtrics at 3, 6, 12 and 24 months after the COVID-19 pandemic started. MATA, which employs structural topic modelling (STM), was used (in R) to discern latent topics within the responses. Two researchers independently labelled topics and collaboratively organized them into themes, with ‘sense checking’ from two additional researchers. Plots and rankings were generated, showing change in topic prevalence by time. Total researcher time to complete analysis was collated.

Results

Fifteen STM-generated topics were labelled and integrated into six themes: the influences of and impacts on (1) health behaviours, (2) physical health (3) mood and (4) how these interacted, partly moderated by (5) external influences of control and (6) reflections on wellbeing and personal growth. Topic prevalence varied meaningfully over time, aligning with changes in the pandemic context. Themes were generated (excluding write-up) with 20 h combined researcher time.

Conclusions

MATA shows promise as a resource-saving method for thematic analysis of large qualitative datasets whilst maintaining researcher control and insight. Findings show the interconnection between health behaviours, physical health and wellbeing over the pandemic, and the influence of control and reflective processes.

Abstract Image

Abstract Image

Abstract Image

利用机器辅助主题分析加快自由文本数据的主题分析:COVID-19大流行期间影响健康行为和福祉因素的范例调查
目的研究使用机器学习来加快对COVID-19大流行期间影响健康行为和福祉因素的定性数据的专题分析。设计使用机器辅助主题分析(MATA)对从前瞻性队列中收集的自由文本数据进行定性调查。方法在2019冠状病毒病大流行开始后的3、6、12和24个月,使用Qualtrics收集在线招募的英国参与者的健康行为和幸福感影响的自由文本调查数据(来自762名参与者的2177份回复)。采用结构主题建模(STM)的MATA被用于(在R中)识别响应中的潜在主题。两名研究人员独立标记主题,并合作将其组织成主题,另外两名研究人员进行“感觉检查”。生成图表和排名,显示话题流行度随时间的变化。整理研究人员完成分析的总时间。结果15个stm生成的主题被标记并整合为六个主题:对(1)健康行为的影响和影响,(2)身体健康,(3)情绪和(4)这些因素如何相互作用,部分由(5)控制的外部影响和(6)对健康和个人成长的反思来调节。随着时间的推移,主题流行率发生了有意义的变化,与大流行背景的变化保持一致。主题生成(不包括撰写)总共需要20小时的研究时间。结论:MATA有望成为大型定性数据集专题分析的资源节约方法,同时保持研究人员的控制和洞察力。调查结果显示,大流行期间的健康行为、身体健康和福祉,以及控制和反思过程的影响之间存在相互联系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
British Journal of Health Psychology
British Journal of Health Psychology PSYCHOLOGY, CLINICAL-
CiteScore
14.10
自引率
1.30%
发文量
58
期刊介绍: The focus of the British Journal of Health Psychology is to publish original research on various aspects of psychology that are related to health, health-related behavior, and illness throughout a person's life. The journal specifically seeks articles that are based on health psychology theory or discuss theoretical matters within the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信