Emma Ward, Felix Naughton, Pippa Belderson, Trisevgeni Papakonstantinou, Ben Ainsworth, Sarah Hanson, Caitlin Notley, Paulina Bondaronek
{"title":"利用机器辅助主题分析加快自由文本数据的主题分析:COVID-19大流行期间影响健康行为和福祉因素的范例调查","authors":"Emma Ward, Felix Naughton, Pippa Belderson, Trisevgeni Papakonstantinou, Ben Ainsworth, Sarah Hanson, Caitlin Notley, Paulina Bondaronek","doi":"10.1111/bjhp.70017","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objectives</h3>\n \n <p>Investigate the use of machine learning to expedite thematic analysis of qualitative data concerning factors that influenced health behaviours and wellbeing during the COVID-19 pandemic.</p>\n </section>\n \n <section>\n \n <h3> Design</h3>\n \n <p>Qualitative investigation using Machine-Assisted Topic Analysis (MATA) of free-text data collected from a prospective cohort.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Free-text survey data (2177 responses from 762 participants) of influences on health behaviours and wellbeing were collected among UK participants recruited online, using Qualtrics at 3, 6, 12 and 24 months after the COVID-19 pandemic started. MATA, which employs structural topic modelling (STM), was used (in R) to discern latent topics within the responses. Two researchers independently labelled topics and collaboratively organized them into themes, with ‘sense checking’ from two additional researchers. Plots and rankings were generated, showing change in topic prevalence by time. Total researcher time to complete analysis was collated.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Fifteen STM-generated topics were labelled and integrated into six themes: the influences of and impacts on (1) health behaviours, (2) physical health (3) mood and (4) how these interacted, partly moderated by (5) external influences of control and (6) reflections on wellbeing and personal growth. Topic prevalence varied meaningfully over time, aligning with changes in the pandemic context. Themes were generated (excluding write-up) with 20 h combined researcher time.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>MATA shows promise as a resource-saving method for thematic analysis of large qualitative datasets whilst maintaining researcher control and insight. Findings show the interconnection between health behaviours, physical health and wellbeing over the pandemic, and the influence of control and reflective processes.</p>\n </section>\n </div>","PeriodicalId":48161,"journal":{"name":"British Journal of Health Psychology","volume":"30 3","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bjhp.70017","citationCount":"0","resultStr":"{\"title\":\"Using machine-assisted topic analysis to expedite thematic analysis of free-text data: Exemplar investigation of factors influencing health behaviours and wellbeing during the COVID-19 pandemic\",\"authors\":\"Emma Ward, Felix Naughton, Pippa Belderson, Trisevgeni Papakonstantinou, Ben Ainsworth, Sarah Hanson, Caitlin Notley, Paulina Bondaronek\",\"doi\":\"10.1111/bjhp.70017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Objectives</h3>\\n \\n <p>Investigate the use of machine learning to expedite thematic analysis of qualitative data concerning factors that influenced health behaviours and wellbeing during the COVID-19 pandemic.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Design</h3>\\n \\n <p>Qualitative investigation using Machine-Assisted Topic Analysis (MATA) of free-text data collected from a prospective cohort.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>Free-text survey data (2177 responses from 762 participants) of influences on health behaviours and wellbeing were collected among UK participants recruited online, using Qualtrics at 3, 6, 12 and 24 months after the COVID-19 pandemic started. MATA, which employs structural topic modelling (STM), was used (in R) to discern latent topics within the responses. Two researchers independently labelled topics and collaboratively organized them into themes, with ‘sense checking’ from two additional researchers. Plots and rankings were generated, showing change in topic prevalence by time. Total researcher time to complete analysis was collated.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>Fifteen STM-generated topics were labelled and integrated into six themes: the influences of and impacts on (1) health behaviours, (2) physical health (3) mood and (4) how these interacted, partly moderated by (5) external influences of control and (6) reflections on wellbeing and personal growth. Topic prevalence varied meaningfully over time, aligning with changes in the pandemic context. Themes were generated (excluding write-up) with 20 h combined researcher time.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>MATA shows promise as a resource-saving method for thematic analysis of large qualitative datasets whilst maintaining researcher control and insight. Findings show the interconnection between health behaviours, physical health and wellbeing over the pandemic, and the influence of control and reflective processes.</p>\\n </section>\\n </div>\",\"PeriodicalId\":48161,\"journal\":{\"name\":\"British Journal of Health Psychology\",\"volume\":\"30 3\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bjhp.70017\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"British Journal of Health Psychology\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://bpspsychub.onlinelibrary.wiley.com/doi/10.1111/bjhp.70017\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, CLINICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Health Psychology","FirstCategoryId":"102","ListUrlMain":"https://bpspsychub.onlinelibrary.wiley.com/doi/10.1111/bjhp.70017","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, CLINICAL","Score":null,"Total":0}
Using machine-assisted topic analysis to expedite thematic analysis of free-text data: Exemplar investigation of factors influencing health behaviours and wellbeing during the COVID-19 pandemic
Objectives
Investigate the use of machine learning to expedite thematic analysis of qualitative data concerning factors that influenced health behaviours and wellbeing during the COVID-19 pandemic.
Design
Qualitative investigation using Machine-Assisted Topic Analysis (MATA) of free-text data collected from a prospective cohort.
Methods
Free-text survey data (2177 responses from 762 participants) of influences on health behaviours and wellbeing were collected among UK participants recruited online, using Qualtrics at 3, 6, 12 and 24 months after the COVID-19 pandemic started. MATA, which employs structural topic modelling (STM), was used (in R) to discern latent topics within the responses. Two researchers independently labelled topics and collaboratively organized them into themes, with ‘sense checking’ from two additional researchers. Plots and rankings were generated, showing change in topic prevalence by time. Total researcher time to complete analysis was collated.
Results
Fifteen STM-generated topics were labelled and integrated into six themes: the influences of and impacts on (1) health behaviours, (2) physical health (3) mood and (4) how these interacted, partly moderated by (5) external influences of control and (6) reflections on wellbeing and personal growth. Topic prevalence varied meaningfully over time, aligning with changes in the pandemic context. Themes were generated (excluding write-up) with 20 h combined researcher time.
Conclusions
MATA shows promise as a resource-saving method for thematic analysis of large qualitative datasets whilst maintaining researcher control and insight. Findings show the interconnection between health behaviours, physical health and wellbeing over the pandemic, and the influence of control and reflective processes.
期刊介绍:
The focus of the British Journal of Health Psychology is to publish original research on various aspects of psychology that are related to health, health-related behavior, and illness throughout a person's life. The journal specifically seeks articles that are based on health psychology theory or discuss theoretical matters within the field.