Natural Language Processing Enhanced Qualitative Methods: An Opportunity to Improve Health Outcomes

IF 3.9 2区 社会学 Q1 SOCIAL SCIENCES, INTERDISCIPLINARY
R. David Parker, Karen Mancini, Marissa D. Abram
{"title":"Natural Language Processing Enhanced Qualitative Methods: An Opportunity to Improve Health Outcomes","authors":"R. David Parker, Karen Mancini, Marissa D. Abram","doi":"10.1177/16094069231214144","DOIUrl":null,"url":null,"abstract":"Background Electronic health systems contain large amounts of unstructured data (UD) which are often unanalyzed due to the time and costs involved. Unanalyzed data creates missed opportunities to improve health outcomes. Natural language processing (NLP) is the foundation of generative artificial intelligence (GAI), which is the basis for large language models, such as ChatGPT. NLP and GAI are machine learning methods that analyze large amounts of data in a short time at minimal cost. The ability of NLP to conduct qualitative analyses is increasing, yet the results can lack context and nuance in their findings, requiring human intervention. Methods Our study compared outcomes, time, and costs of a previously published qualitative study. Our approach partnered an NLP model and a qualitative researcher (NLP+). UD from behavioral health patients were analyzed using NLP and a Latent Dirichlet allocation to identify the topics using probability of word coherence scores. The topics were then analyzed by a qualitative researcher, translated into themes, and compared with the original findings. Results The NLP + method results aligned with the original, qualitative derived themes. Our model also identified two additional themes which were not originally detected. The NLP + method required 6 hours of labor, 3 minutes for transcription, and a transcription cost of $1.17. The original, qualitative researcher only method required more than 36 hours ($2,250) of time and $1,100 for transcription. Conclusions While natural language processing analyzes voluminous amounts of data in seconds, context and nuance in human language are regularly missed. Combining a qualitative researcher with NLP + could be deployed in many settings, reducing time and costs, and improving context. Until large language models are more prevalent, a human interaction can help translate the patient experience by contextualizing data rich in social determinant indicators which may otherwise go unanalyzed.","PeriodicalId":48220,"journal":{"name":"International Journal of Qualitative Methods","volume":"27 1","pages":"0"},"PeriodicalIF":3.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Qualitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/16094069231214144","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL SCIENCES, INTERDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Background Electronic health systems contain large amounts of unstructured data (UD) which are often unanalyzed due to the time and costs involved. Unanalyzed data creates missed opportunities to improve health outcomes. Natural language processing (NLP) is the foundation of generative artificial intelligence (GAI), which is the basis for large language models, such as ChatGPT. NLP and GAI are machine learning methods that analyze large amounts of data in a short time at minimal cost. The ability of NLP to conduct qualitative analyses is increasing, yet the results can lack context and nuance in their findings, requiring human intervention. Methods Our study compared outcomes, time, and costs of a previously published qualitative study. Our approach partnered an NLP model and a qualitative researcher (NLP+). UD from behavioral health patients were analyzed using NLP and a Latent Dirichlet allocation to identify the topics using probability of word coherence scores. The topics were then analyzed by a qualitative researcher, translated into themes, and compared with the original findings. Results The NLP + method results aligned with the original, qualitative derived themes. Our model also identified two additional themes which were not originally detected. The NLP + method required 6 hours of labor, 3 minutes for transcription, and a transcription cost of $1.17. The original, qualitative researcher only method required more than 36 hours ($2,250) of time and $1,100 for transcription. Conclusions While natural language processing analyzes voluminous amounts of data in seconds, context and nuance in human language are regularly missed. Combining a qualitative researcher with NLP + could be deployed in many settings, reducing time and costs, and improving context. Until large language models are more prevalent, a human interaction can help translate the patient experience by contextualizing data rich in social determinant indicators which may otherwise go unanalyzed.
自然语言处理增强定性方法:改善健康结果的机会
电子卫生系统包含大量的非结构化数据(UD),由于涉及时间和成本,这些数据通常无法进行分析。未经分析的数据导致错失改善健康结果的机会。自然语言处理(NLP)是生成式人工智能(GAI)的基础,GAI是ChatGPT等大型语言模型的基础。NLP和GAI是机器学习方法,可以在短时间内以最小的成本分析大量数据。NLP进行定性分析的能力正在增强,但结果可能缺乏背景和细微差别,需要人工干预。方法本研究比较了先前发表的一项定性研究的结果、时间和成本。我们的方法与NLP模型和定性研究人员(NLP+)合作。使用NLP和Latent Dirichlet分配来分析行为健康患者的UD,以使用单词连贯得分的概率来识别主题。然后由定性研究人员分析这些主题,将其转化为主题,并与原始研究结果进行比较。结果NLP +方法的结果与原始的定性衍生主题一致。我们的模型还确定了最初未检测到的另外两个主题。NLP +方法需要6小时的人工,3分钟的转录时间,转录成本为1.17美元。最初的定性研究方法需要超过36小时(2250美元)的时间和1100美元的转录费用。当自然语言处理在几秒钟内分析大量数据时,人类语言中的上下文和细微差别经常被遗漏。将定性研究人员与NLP +相结合可以在许多环境中部署,减少时间和成本,并改善环境。在大型语言模型更加普遍之前,人类互动可以通过将丰富的社会决定指标数据置于背景中来帮助翻译患者体验,否则这些数据可能无法分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Qualitative Methods
International Journal of Qualitative Methods SOCIAL SCIENCES, INTERDISCIPLINARY-
CiteScore
6.90
自引率
11.10%
发文量
139
审稿时长
12 weeks
期刊介绍: Journal Highlights Impact Factor: 5.4 Ranked 5/110 in Social Sciences, Interdisciplinary – SSCI Indexed In: Clarivate Analytics: Social Science Citation Index, the Directory of Open Access Journals (DOAJ), and Scopus Launched In: 2002 Publication is subject to payment of an article processing charge (APC) Submit here International Journal of Qualitative Methods (IJQM) is a peer-reviewed open access journal which focuses on methodological advances, innovations, and insights in qualitative or mixed methods studies. Please see the Aims and Scope tab for further information.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信