Artificial Intelligence to Support Qualitative Data Analysis: Promises, Approaches, Pitfalls.

IF 5.3 2区教育学 Q1 EDUCATION, SCIENTIFIC DISCIPLINES

Academic Medicine Pub Date : 2025-06-24 DOI:10.1097/ACM.0000000000006134

David A Cook, Shiphra Ginsburg, Adam P Sawatsky, Ayelet Kuper, Jonathan D D'Angelo

{"title":"Artificial Intelligence to Support Qualitative Data Analysis: Promises, Approaches, Pitfalls.","authors":"David A Cook, Shiphra Ginsburg, Adam P Sawatsky, Ayelet Kuper, Jonathan D D'Angelo","doi":"10.1097/ACM.0000000000006134","DOIUrl":null,"url":null,"abstract":"<p><strong>Abstract: </strong>How can artificial intelligence (AI) be used to support qualitative data analysis (QDA)? To address this question, the authors conducted 3 scholarly activities. First, they used a readily available large language model, ChatGPT-4, to analyze 3 existing narrative datasets (February 2024). ChatGPT generated accurate brief summaries; for all other attempted tasks the initial prompt failed to produce desired results. After iterative prompt engineering, some tasks (e.g., keyword counting, summarization) were successful, whereas others (e.g., thematic analysis, keyword highlighting, word tree diagram, cross-theme insights) never generated satisfactory results. Second, the authors conducted a brief scoping review of AI-supported QDA (through May 2024). They identified 130 articles (104 original research, 26 nonresearch) of which 64 were published in 2023 or 2024. Seventy studies inductively analyzed data for themes, 39 used keyword detection, 30 applied a coding rubric, 28 used sentiment analysis, and 13 applied discourse analysis. Seventy-five used unsupervised learning (e.g., transformers, other neural networks). Third, building on these experiences and drawing from additional literature, the authors examined the potential capabilities, shortcomings, dangers, and ethical repercussions of AI-supported QDA. They note that AI has been used for QDA for more than 25 years. AI-supported QDA approaches include inductive and deductive coding, thematic analysis, computational grounded theory, discourse analysis, analysis of large datasets, preanalysis transcription and translation, and suggestions for study planning and interpretation. Concerns include the imperative of a \"human in the loop\" for data collection and analysis, the need for researchers to understand the technology, the risk of unsophisticated analyses, inevitable influences on workforce, and apprehensions regarding data privacy and security. Reflexivity should embrace both strengths and weaknesses of AI-supported QDA. The authors conclude that AI has a long history of supporting QDA through widely varied methods. Evolving technologies make AI-supported QDA more accessible and introduce both promises and pitfalls.</p>","PeriodicalId":50929,"journal":{"name":"Academic Medicine","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Medicine","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1097/ACM.0000000000006134","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract: How can artificial intelligence (AI) be used to support qualitative data analysis (QDA)? To address this question, the authors conducted 3 scholarly activities. First, they used a readily available large language model, ChatGPT-4, to analyze 3 existing narrative datasets (February 2024). ChatGPT generated accurate brief summaries; for all other attempted tasks the initial prompt failed to produce desired results. After iterative prompt engineering, some tasks (e.g., keyword counting, summarization) were successful, whereas others (e.g., thematic analysis, keyword highlighting, word tree diagram, cross-theme insights) never generated satisfactory results. Second, the authors conducted a brief scoping review of AI-supported QDA (through May 2024). They identified 130 articles (104 original research, 26 nonresearch) of which 64 were published in 2023 or 2024. Seventy studies inductively analyzed data for themes, 39 used keyword detection, 30 applied a coding rubric, 28 used sentiment analysis, and 13 applied discourse analysis. Seventy-five used unsupervised learning (e.g., transformers, other neural networks). Third, building on these experiences and drawing from additional literature, the authors examined the potential capabilities, shortcomings, dangers, and ethical repercussions of AI-supported QDA. They note that AI has been used for QDA for more than 25 years. AI-supported QDA approaches include inductive and deductive coding, thematic analysis, computational grounded theory, discourse analysis, analysis of large datasets, preanalysis transcription and translation, and suggestions for study planning and interpretation. Concerns include the imperative of a "human in the loop" for data collection and analysis, the need for researchers to understand the technology, the risk of unsophisticated analyses, inevitable influences on workforce, and apprehensions regarding data privacy and security. Reflexivity should embrace both strengths and weaknesses of AI-supported QDA. The authors conclude that AI has a long history of supporting QDA through widely varied methods. Evolving technologies make AI-supported QDA more accessible and introduce both promises and pitfalls.

查看原文本刊更多论文

支持定性数据分析的人工智能：承诺、方法和缺陷。

摘要：人工智能如何支持定性数据分析（QDA）？为了解决这个问题，作者进行了3次学术活动。首先，他们使用了一个现成的大型语言模型，ChatGPT-4，来分析3个现有的叙事数据集（2024年2月）。ChatGPT生成准确的简短摘要；对于所有其他尝试的任务，初始提示无法产生期望的结果。在迭代的提示工程之后，一些任务（如关键字计数、总结）是成功的，而另一些任务（如主题分析、关键字高亮、词树图、跨主题洞察）则没有产生令人满意的结果。其次，作者对人工智能支持的QDA进行了简要的范围审查（到2024年5月）。他们确定了130篇文章（104篇原创研究，26篇非研究），其中64篇发表于2023年或2024年。70项研究对主题数据进行归纳分析，39项研究使用关键字检测，30项研究使用编码标题，28项研究使用情感分析，13项研究使用话语分析。75个使用无监督学习（例如，变压器，其他神经网络）。第三，基于这些经验并借鉴其他文献，作者研究了人工智能支持的QDA的潜在能力、缺点、危险和伦理影响。他们指出，人工智能用于QDA已经超过25年了。人工智能支持的QDA方法包括归纳和演绎编码、主题分析、计算基础理论、话语分析、大数据集分析、预分析转录和翻译，以及学习计划和解释建议。人们的担忧包括数据收集和分析的“人在循环中”的必要性，研究人员了解技术的必要性，简单分析的风险，对劳动力的不可避免的影响，以及对数据隐私和安全的担忧。反身性应该包括人工智能支持的QDA的优点和缺点。作者得出结论，人工智能通过各种各样的方法支持QDA有着悠久的历史。不断发展的技术使人工智能支持的QDA更容易获得，并引入了承诺和陷阱。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Academic Medicine 医学-卫生保健

CiteScore

7.80

自引率

9.50%

发文量

982

审稿时长

3-6 weeks

期刊介绍： Academic Medicine, the official peer-reviewed journal of the Association of American Medical Colleges, acts as an international forum for exchanging ideas, information, and strategies to address the significant challenges in academic medicine. The journal covers areas such as research, education, clinical care, community collaboration, and leadership, with a commitment to serving the public interest.