大语言模型时代数字健康的自然语言处理。

Yearbook of medical informatics Pub Date : 2024-08-01 Epub Date: 2025-04-08 DOI:10.1055/s-0044-1800750

Abeed Sarker, Rui Zhang, Yanshan Wang, Yunyu Xiao, Sudeshna Das, Dalton Schutte, David Oniani, Qianqian Xie, Hua Xu

{"title":"大语言模型时代数字健康的自然语言处理。","authors":"Abeed Sarker, Rui Zhang, Yanshan Wang, Yunyu Xiao, Sudeshna Das, Dalton Schutte, David Oniani, Qianqian Xie, Hua Xu","doi":"10.1055/s-0044-1800750","DOIUrl":null,"url":null,"abstract":"Objectives: Large language models (LLMs) are revolutionizing the natural language pro-cessing (NLP) landscape within healthcare, prompting the need to synthesize the latest ad-vancements and their diverse medical applications. We attempt to summarize the current state of research in this rapidly evolving space.Methods: We conducted a review of the most recent studies on biomedical NLP facilitated by LLMs, sourcing literature from PubMed, the Association for Computational Linguistics Anthology, IEEE Explore, and Google Scholar (the latter particularly for preprints). Given the ongoing exponential growth in LLM-related publications, our survey was inherently selective. We attempted to abstract key findings in terms of (i) LLMs customized for medical texts, and (ii) the type of medical text being leveraged by LLMs, namely medical literature, electronic health records (EHRs), and social media. In addition to technical details, we touch upon topics such as privacy, bias, interpretability, and equitability.Results: We observed that while general-purpose LLMs (e.g., GPT-4) are most popular, there is a growing trend in training or customizing open-source LLMs for specific biomedi-cal texts and tasks. Several promising open-source LLMs are currently available, and appli-cations involving EHRs and biomedical literature are more prominent relative to noisier data sources such as social media. For supervised classification and named entity recogni-tion tasks, traditional (encoder only) transformer-based models still outperform new-age LLMs, and the latter are typically suited for few-shot settings and generative tasks such as summarization. There is still a paucity of research on evaluation, bias, privacy, reproduci-bility, and equitability of LLMs.Conclusions: LLMs have the potential to transform NLP tasks within the broader medical domain. While technical progress continues, biomedical application focused research must prioritize aspects not necessarily related to performance such as task-oriented evaluation, bias, and equitable use.","PeriodicalId":40027,"journal":{"name":"Yearbook of medical informatics","volume":"33 1","pages":"229-240"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020548/pdf/","citationCount":"0","resultStr":"{\"title\":\"Natural Language Processing for Digital Health in the Era of Large Language Models.\",\"authors\":\"Abeed Sarker, Rui Zhang, Yanshan Wang, Yunyu Xiao, Sudeshna Das, Dalton Schutte, David Oniani, Qianqian Xie, Hua Xu\",\"doi\":\"10.1055/s-0044-1800750\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: Large language models (LLMs) are revolutionizing the natural language pro-cessing (NLP) landscape within healthcare, prompting the need to synthesize the latest ad-vancements and their diverse medical applications. We attempt to summarize the current state of research in this rapidly evolving space.Methods: We conducted a review of the most recent studies on biomedical NLP facilitated by LLMs, sourcing literature from PubMed, the Association for Computational Linguistics Anthology, IEEE Explore, and Google Scholar (the latter particularly for preprints). Given the ongoing exponential growth in LLM-related publications, our survey was inherently selective. We attempted to abstract key findings in terms of (i) LLMs customized for medical texts, and (ii) the type of medical text being leveraged by LLMs, namely medical literature, electronic health records (EHRs), and social media. In addition to technical details, we touch upon topics such as privacy, bias, interpretability, and equitability.Results: We observed that while general-purpose LLMs (e.g., GPT-4) are most popular, there is a growing trend in training or customizing open-source LLMs for specific biomedi-cal texts and tasks. Several promising open-source LLMs are currently available, and appli-cations involving EHRs and biomedical literature are more prominent relative to noisier data sources such as social media. For supervised classification and named entity recogni-tion tasks, traditional (encoder only) transformer-based models still outperform new-age LLMs, and the latter are typically suited for few-shot settings and generative tasks such as summarization. There is still a paucity of research on evaluation, bias, privacy, reproduci-bility, and equitability of LLMs.Conclusions: LLMs have the potential to transform NLP tasks within the broader medical domain. While technical progress continues, biomedical application focused research must prioritize aspects not necessarily related to performance such as task-oriented evaluation, bias, and equitable use.\",\"PeriodicalId\":40027,\"journal\":{\"name\":\"Yearbook of medical informatics\",\"volume\":\"33 1\",\"pages\":\"229-240\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020548/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Yearbook of medical informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1055/s-0044-1800750\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Yearbook of medical informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1055/s-0044-1800750","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

目标：大型语言模型（LLMs）正在彻底改变医疗保健领域的自然语言处理（NLP）格局，这促使我们需要综合最新的研究成果及其在医疗领域的各种应用。我们试图总结这一快速发展领域的研究现状：我们从 PubMed、计算语言学协会文选、IEEE Explore 和 Google Scholar（后者尤其针对预印本）中获取文献，对最近由 LLM 推动的生物医学 NLP 研究进行了回顾。鉴于与语言学硕士相关的出版物呈指数级增长，我们的调查本身就具有选择性。我们试图从以下两个方面抽象出主要发现：(i) 为医学文本定制的 LLM；(ii) LLM 所利用的医学文本类型，即医学文献、电子健康记录 (EHR) 和社交媒体。除技术细节外，我们还涉及隐私、偏见、可解释性和公平性等话题：我们发现，虽然通用 LLM（如 GPT-4）最受欢迎，但针对特定生物医学文本和任务训练或定制开源 LLM 的趋势也在不断增长。目前已有几种前景看好的开源 LLM，相对于社交媒体等嘈杂的数据源，涉及电子病历和生物医学文献的应用更为突出。对于有监督的分类和命名实体识别任务，传统的（仅编码器）基于变换器的模型仍然优于新时代的 LLM，后者通常适用于少数据设置和生成任务（如摘要）。关于 LLMs 的评估、偏差、隐私、可重现性和公平性的研究仍然很少：LLMs 有潜力在更广泛的医学领域内改变 NLP 任务。在技术不断进步的同时，以生物医学应用为重点的研究必须优先考虑不一定与性能相关的方面，如面向任务的评估、偏差和公平使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Natural Language Processing for Digital Health in the Era of Large Language Models.

查看原文本刊更多论文

Natural Language Processing for Digital Health in the Era of Large Language Models.

Objectives: Large language models (LLMs) are revolutionizing the natural language pro-cessing (NLP) landscape within healthcare, prompting the need to synthesize the latest ad-vancements and their diverse medical applications. We attempt to summarize the current state of research in this rapidly evolving space.

Methods: We conducted a review of the most recent studies on biomedical NLP facilitated by LLMs, sourcing literature from PubMed, the Association for Computational Linguistics Anthology, IEEE Explore, and Google Scholar (the latter particularly for preprints). Given the ongoing exponential growth in LLM-related publications, our survey was inherently selective. We attempted to abstract key findings in terms of (i) LLMs customized for medical texts, and (ii) the type of medical text being leveraged by LLMs, namely medical literature, electronic health records (EHRs), and social media. In addition to technical details, we touch upon topics such as privacy, bias, interpretability, and equitability.

Results: We observed that while general-purpose LLMs (e.g., GPT-4) are most popular, there is a growing trend in training or customizing open-source LLMs for specific biomedi-cal texts and tasks. Several promising open-source LLMs are currently available, and appli-cations involving EHRs and biomedical literature are more prominent relative to noisier data sources such as social media. For supervised classification and named entity recogni-tion tasks, traditional (encoder only) transformer-based models still outperform new-age LLMs, and the latter are typically suited for few-shot settings and generative tasks such as summarization. There is still a paucity of research on evaluation, bias, privacy, reproduci-bility, and equitability of LLMs.

Conclusions: LLMs have the potential to transform NLP tasks within the broader medical domain. While technical progress continues, biomedical application focused research must prioritize aspects not necessarily related to performance such as task-oriented evaluation, bias, and equitable use.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Yearbook of medical informatics Medicine-Medicine (all)

CiteScore

4.10

自引率

0.00%

发文量

期刊介绍： Published by the International Medical Informatics Association, this annual publication includes the best papers in medical informatics from around the world.