在孕产妇健康研究中用于专题分析的生成人工智能：使用大型语言模型编写半结构化访谈

IF 3.6 2区心理学 Q1 PSYCHOLOGY, APPLIED

Applied psychology. Health and well-being Pub Date : 2025-05-16 DOI:10.1111/aphw.70038

Shan Qiao, Xingyu Fang, Junbo Wang, Ran Zhang, Xiaoming Li, Yuhao Kang

{"title":"在孕产妇健康研究中用于专题分析的生成人工智能：使用大型语言模型编写半结构化访谈","authors":"Shan Qiao, Xingyu Fang, Junbo Wang, Ran Zhang, Xiaoming Li, Yuhao Kang","doi":"10.1111/aphw.70038","DOIUrl":null,"url":null,"abstract":"Study Objectives: The coding of semistructured interview transcripts is a critical step for thematic analysis of qualitative data. However, the coding process is often labor-intensive and time-consuming. The emergence of generative artificial intelligence (GenAI) presents new opportunities to enhance the efficiency of qualitative coding. This study proposed a computational pipeline using GenAI to automatically extract themes from interview transcripts. Methods: Using transcripts from interviews conducted with maternity care providers in South Carolina, we leveraged ChatGPT for inductive coding to generate codes from interview transcripts without a predetermined coding scheme. Structured prompts were designed to instruct ChatGPT to generate and summarize codes. The performance of GenAI was evaluated by comparing the AI-generated codes with those generated manually. Results: GenAI demonstrated promise in detecting and summarizing codes from interview transcripts. ChatGPT exhibited an overall accuracy exceeding 80% in inductive coding. More impressively, GenAI reduced the time required for coding by 81%. Discussion: GenAI models are capable of efficiently processing language datasets and performing multi-level semantic identification. However, challenges such as inaccuracy, systematic biases, and privacy concerns must be acknowledged and addressed. Future research should focus on refining these models to enhance reliability and address inherent limitations associated with their application in qualitative research.","PeriodicalId":8127,"journal":{"name":"Applied psychology. Health and well-being","volume":"17 3","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/aphw.70038","citationCount":"0","resultStr":"{\"title\":\"Generative AI for thematic analysis in a maternal health study: coding semistructured interviews using large language models\",\"authors\":\"Shan Qiao, Xingyu Fang, Junbo Wang, Ran Zhang, Xiaoming Li, Yuhao Kang\",\"doi\":\"10.1111/aphw.70038\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Study Objectives: The coding of semistructured interview transcripts is a critical step for thematic analysis of qualitative data. However, the coding process is often labor-intensive and time-consuming. The emergence of generative artificial intelligence (GenAI) presents new opportunities to enhance the efficiency of qualitative coding. This study proposed a computational pipeline using GenAI to automatically extract themes from interview transcripts. Methods: Using transcripts from interviews conducted with maternity care providers in South Carolina, we leveraged ChatGPT for inductive coding to generate codes from interview transcripts without a predetermined coding scheme. Structured prompts were designed to instruct ChatGPT to generate and summarize codes. The performance of GenAI was evaluated by comparing the AI-generated codes with those generated manually. Results: GenAI demonstrated promise in detecting and summarizing codes from interview transcripts. ChatGPT exhibited an overall accuracy exceeding 80% in inductive coding. More impressively, GenAI reduced the time required for coding by 81%. Discussion: GenAI models are capable of efficiently processing language datasets and performing multi-level semantic identification. However, challenges such as inaccuracy, systematic biases, and privacy concerns must be acknowledged and addressed. Future research should focus on refining these models to enhance reliability and address inherent limitations associated with their application in qualitative research.\",\"PeriodicalId\":8127,\"journal\":{\"name\":\"Applied psychology. Health and well-being\",\"volume\":\"17 3\",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/aphw.70038\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied psychology. Health and well-being\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://iaap-journals.onlinelibrary.wiley.com/doi/10.1111/aphw.70038\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied psychology. Health and well-being","FirstCategoryId":"102","ListUrlMain":"https://iaap-journals.onlinelibrary.wiley.com/doi/10.1111/aphw.70038","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, APPLIED","Score":null,"Total":0}

引用次数: 0

摘要

研究目的：半结构化访谈记录的编码是对定性数据进行专题分析的关键步骤。然而，编码过程通常是劳动密集型和耗时的。生成式人工智能（GenAI）的出现为提高定性编码的效率提供了新的机会。本研究提出了一个使用GenAI的计算管道，用于从访谈记录中自动提取主题。方法：使用与南卡罗来纳的产科护理提供者进行的访谈记录，我们利用ChatGPT进行归纳编码，在没有预定编码方案的情况下从访谈记录中生成代码。结构化提示被设计用来指导ChatGPT生成和总结代码。通过比较人工智能生成的代码和人工生成的代码来评估GenAI的性能。结果：GenAI在从采访记录中检测和总结代码方面表现出了希望。ChatGPT在归纳编码中显示出超过80%的总体准确率。更令人印象深刻的是，GenAI将编码所需的时间减少了81%。讨论：GenAI模型能够有效地处理语言数据集并执行多层次语义识别。然而，诸如不准确、系统性偏见和隐私问题等挑战必须得到承认和解决。未来的研究应侧重于改进这些模型，以提高可靠性，并解决其在定性研究中应用的固有局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Generative AI for thematic analysis in a maternal health study: coding semistructured interviews using large language models

查看原文本刊更多论文

Generative AI for thematic analysis in a maternal health study: coding semistructured interviews using large language models

Study Objectives: The coding of semistructured interview transcripts is a critical step for thematic analysis of qualitative data. However, the coding process is often labor-intensive and time-consuming. The emergence of generative artificial intelligence (GenAI) presents new opportunities to enhance the efficiency of qualitative coding. This study proposed a computational pipeline using GenAI to automatically extract themes from interview transcripts. Methods: Using transcripts from interviews conducted with maternity care providers in South Carolina, we leveraged ChatGPT for inductive coding to generate codes from interview transcripts without a predetermined coding scheme. Structured prompts were designed to instruct ChatGPT to generate and summarize codes. The performance of GenAI was evaluated by comparing the AI-generated codes with those generated manually. Results: GenAI demonstrated promise in detecting and summarizing codes from interview transcripts. ChatGPT exhibited an overall accuracy exceeding 80% in inductive coding. More impressively, GenAI reduced the time required for coding by 81%. Discussion: GenAI models are capable of efficiently processing language datasets and performing multi-level semantic identification. However, challenges such as inaccuracy, systematic biases, and privacy concerns must be acknowledged and addressed. Future research should focus on refining these models to enhance reliability and address inherent limitations associated with their application in qualitative research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied psychology. Health and well-being PSYCHOLOGY, APPLIED-

CiteScore

12.10

自引率

2.90%

发文量

期刊介绍： Applied Psychology: Health and Well-Being is a triannual peer-reviewed academic journal published by Wiley-Blackwell on behalf of the International Association of Applied Psychology. It was established in 2009 and covers applied psychology topics such as clinical psychology, counseling, cross-cultural psychology, and environmental psychology.