Consensus on the Potential of Large Language Models in Healthcare: Insights from a Delphi Survey in Korea.

IF 2.1 Q3 MEDICAL INFORMATICS

Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI:10.4258/hir.2025.31.2.146

Ah-Ram Sul, Seihee Kim

{"title":"Consensus on the Potential of Large Language Models in Healthcare: Insights from a Delphi Survey in Korea.","authors":"Ah-Ram Sul, Seihee Kim","doi":"10.4258/hir.2025.31.2.146","DOIUrl":null,"url":null,"abstract":"Objectives: Given the rapidly growing expectations for large language models (LLMs) in healthcare, this study systematically collected perspectives from Korean experts on the potential benefits and risks of LLMs, aiming to promote their safe and effective utilization.Methods: A web-based mini-Delphi survey was conducted from August 27 to October 14, 2024, with 20 selected panelists. The expert questionnaire comprised 84 judgment items across five domains: potential applications, benefits, risks, reliability requirements, and safe usage. These items were developed through a literature review and expert consultation. Participants rated their agreement or perceived importance on a 5-point scale. Items meeting predefined thresholds (content validity ratio ≥0.49, degree of convergence ≤0.50, and degree of consensus ≥0.75) were prioritized.Results: Seventeen participants (85%) responded to the first round, and 16 participants (80%) completed the second round. Consensus was achieved on several potential applications, benefits, and reliability requirements for the use of LLMs in healthcare. However, significant heterogeneity was found regarding perceptions of associated risks and criteria for safe usage of LLMs. Of the 84 total items, 52 met the criteria for statistical validity, confirming the diversity of expert opinions.Conclusions: Experts reached a consensus on certain aspects of LLM utilization in healthcare. Nonetheless, notable differences remained concerning risks and requirements for safe implementation, highlighting the need for further investigation. This study provides foundational insights to guide future research and inform policy development for the responsible introduction of LLMs into the healthcare field.","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"146-155"},"PeriodicalIF":2.1000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086437/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2025.31.2.146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: Given the rapidly growing expectations for large language models (LLMs) in healthcare, this study systematically collected perspectives from Korean experts on the potential benefits and risks of LLMs, aiming to promote their safe and effective utilization.

Methods: A web-based mini-Delphi survey was conducted from August 27 to October 14, 2024, with 20 selected panelists. The expert questionnaire comprised 84 judgment items across five domains: potential applications, benefits, risks, reliability requirements, and safe usage. These items were developed through a literature review and expert consultation. Participants rated their agreement or perceived importance on a 5-point scale. Items meeting predefined thresholds (content validity ratio ≥0.49, degree of convergence ≤0.50, and degree of consensus ≥0.75) were prioritized.

Results: Seventeen participants (85%) responded to the first round, and 16 participants (80%) completed the second round. Consensus was achieved on several potential applications, benefits, and reliability requirements for the use of LLMs in healthcare. However, significant heterogeneity was found regarding perceptions of associated risks and criteria for safe usage of LLMs. Of the 84 total items, 52 met the criteria for statistical validity, confirming the diversity of expert opinions.

Conclusions: Experts reached a consensus on certain aspects of LLM utilization in healthcare. Nonetheless, notable differences remained concerning risks and requirements for safe implementation, highlighting the need for further investigation. This study provides foundational insights to guide future research and inform policy development for the responsible introduction of LLMs into the healthcare field.

Abstract Image

查看原文本刊更多论文

关于医疗保健中大型语言模型潜力的共识：来自韩国德尔菲调查的见解。

目的：鉴于医疗保健领域对大型语言模型（llm）的期望迅速增长，本研究系统地收集了韩国专家对llm潜在收益和风险的观点，旨在促进llm的安全有效利用。方法：于2024年8月27日至10月14日对20名选定的小组成员进行了基于网络的小型德尔菲调查。专家问卷包括五个领域的84个判断项：潜在应用、收益、风险、可靠性要求和安全使用。这些项目是通过文献回顾和专家咨询制定的。参与者将他们的同意或认为的重要性分为5分。满足预定义阈值（内容效度≥0.49，收敛度≤0.50，共识度≥0.75）的项目被优先考虑。结果：17名参与者（85%）对第一轮有反应，16名参与者（80%）完成了第二轮。在医疗保健中使用llm的几个潜在应用、好处和可靠性要求上达成了共识。然而，在相关风险的认知和llm安全使用标准方面，发现了显著的异质性。在84个项目中，52个项目符合统计效度标准，证实了专家意见的多样性。结论：专家们就法学硕士在医疗保健领域应用的某些方面达成了共识。尽管如此，在安全实施的风险和要求方面仍然存在显著差异，这突出了进一步调查的必要性。本研究为指导未来的研究提供了基础见解，并为负责任的将法学硕士引入医疗保健领域的政策制定提供了信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Healthcare Informatics Research MEDICAL INFORMATICS-

CiteScore

4.90

自引率

6.90%

发文量