Shahabeddin Abhari, Yasna Afshari, Farhad Fatehi, Hosna Salmani, Ali Garavand, Dmytro Chumachenko, Somayyeh Zakerabasali, Plinio P Morita
{"title":"Exploring ChatGPT in clinical inquiry: a scoping review of characteristics, applications, challenges, and evaluation.","authors":"Shahabeddin Abhari, Yasna Afshari, Farhad Fatehi, Hosna Salmani, Ali Garavand, Dmytro Chumachenko, Somayyeh Zakerabasali, Plinio P Morita","doi":"10.1097/MS9.0000000000002716","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Recent advancements in generative AI, exemplified by ChatGPT, hold promise for healthcare applications such as decision-making support, education, and patient engagement. However, rigorous evaluation is crucial to ensure reliability and safety in clinical contexts. This scoping review explores ChatGPT's role in clinical inquiry, focusing on its characteristics, applications, challenges, and evaluation.</p><p><strong>Methods: </strong>This review, conducted in 2023, followed PRISMA-ScR guidelines (Supplemental Digital Content 1, http://links.lww.com/MS9/A636). Searches were performed across PubMed, Scopus, IEEE, Web of Science, Cochrane, and Google Scholar using relevant keywords. The review explored ChatGPT's effectiveness in various medical domains, evaluation methods, target users, and comparisons with other AI models. Data synthesis and analysis incorporated both quantitative and qualitative approaches.</p><p><strong>Results: </strong>Analysis of 41 academic studies highlights ChatGPT's potential in medical education, patient care, and decision support, though performance varies by medical specialty and linguistic context. GPT-3.5, frequently referenced in 26 studies, demonstrated adaptability across diverse scenarios. Challenges include limited access to official answer keys and inconsistent performance, underscoring the need for ongoing refinement. Evaluation methods, including expert comparisons and statistical analyses, provided significant insights into ChatGPT's efficacy. The identification of target users, such as medical educators and nonexpert clinicians, illustrates its broad applicability.</p><p><strong>Conclusion: </strong>ChatGPT shows significant potential in enhancing clinical practice and medical education. Nevertheless, continuous refinement is essential for its successful integration into healthcare, aiming to improve patient care outcomes, and address the evolving needs of the medical community.</p>","PeriodicalId":8025,"journal":{"name":"Annals of Medicine and Surgery","volume":"86 12","pages":"7094-7104"},"PeriodicalIF":1.7000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623824/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Medicine and Surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/MS9.0000000000002716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Recent advancements in generative AI, exemplified by ChatGPT, hold promise for healthcare applications such as decision-making support, education, and patient engagement. However, rigorous evaluation is crucial to ensure reliability and safety in clinical contexts. This scoping review explores ChatGPT's role in clinical inquiry, focusing on its characteristics, applications, challenges, and evaluation.
Methods: This review, conducted in 2023, followed PRISMA-ScR guidelines (Supplemental Digital Content 1, http://links.lww.com/MS9/A636). Searches were performed across PubMed, Scopus, IEEE, Web of Science, Cochrane, and Google Scholar using relevant keywords. The review explored ChatGPT's effectiveness in various medical domains, evaluation methods, target users, and comparisons with other AI models. Data synthesis and analysis incorporated both quantitative and qualitative approaches.
Results: Analysis of 41 academic studies highlights ChatGPT's potential in medical education, patient care, and decision support, though performance varies by medical specialty and linguistic context. GPT-3.5, frequently referenced in 26 studies, demonstrated adaptability across diverse scenarios. Challenges include limited access to official answer keys and inconsistent performance, underscoring the need for ongoing refinement. Evaluation methods, including expert comparisons and statistical analyses, provided significant insights into ChatGPT's efficacy. The identification of target users, such as medical educators and nonexpert clinicians, illustrates its broad applicability.
Conclusion: ChatGPT shows significant potential in enhancing clinical practice and medical education. Nevertheless, continuous refinement is essential for its successful integration into healthcare, aiming to improve patient care outcomes, and address the evolving needs of the medical community.
简介:以ChatGPT为例,生成式人工智能的最新进展为医疗保健应用(如决策支持、教育和患者参与)带来了希望。然而,严格的评估对于确保临床环境中的可靠性和安全性至关重要。这篇综述探讨了ChatGPT在临床研究中的作用,重点是它的特点、应用、挑战和评估。方法:本综述于2023年进行,遵循PRISMA-ScR指南(补充数字内容1,http://links.lww.com/MS9/A636)。使用相关关键词在PubMed、Scopus、IEEE、Web of Science、Cochrane和b谷歌Scholar上进行搜索。该综述探讨了ChatGPT在各种医疗领域、评估方法、目标用户以及与其他人工智能模型的比较中的有效性。数据综合和分析结合了定量和定性方法。结果:对41项学术研究的分析强调了ChatGPT在医学教育、患者护理和决策支持方面的潜力,尽管其表现因医学专业和语言背景而异。GPT-3.5在26项研究中经常被引用,显示出在不同情况下的适应性。挑战包括访问官方答案密钥的受限和不一致的性能,强调需要不断改进。评估方法,包括专家比较和统计分析,为ChatGPT的疗效提供了重要的见解。确定目标用户,如医学教育工作者和非专家临床医生,说明了其广泛的适用性。结论:ChatGPT在加强临床实践和医学教育方面具有重要的应用潜力。然而,持续改进对于其成功整合到医疗保健中至关重要,旨在改善患者护理结果,并满足医学界不断变化的需求。