Evaluation of a Large Language Model's Ability to Assist in an Orthopedic Hand Clinic.

IF 1.8 Q2 ORTHOPEDICS
HAND Pub Date : 2025-09-01 Epub Date: 2024-06-22 DOI:10.1177/15589447241257643
Travis Kotzur, Aaron Singh, John Parker, Blaire Peterson, Brian Sager, Ryan Rose, Fred Corley, Christina Brady
{"title":"Evaluation of a Large Language Model's Ability to Assist in an Orthopedic Hand Clinic.","authors":"Travis Kotzur, Aaron Singh, John Parker, Blaire Peterson, Brian Sager, Ryan Rose, Fred Corley, Christina Brady","doi":"10.1177/15589447241257643","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Advancements in artificial intelligence technology, such as OpenAI's large language model, ChatGPT, could transform medicine through applications in a clinical setting. This study aimed to assess the utility of ChatGPT as a clinical assistant in an orthopedic hand clinic.</p><p><strong>Methods: </strong>Nine clinical vignettes, describing various common and uncommon hand pathologies, were constructed and reviewed by 4 fellowship-trained orthopedic hand surgeons and an orthopedic resident. ChatGPT was given these vignettes and asked to generate a differential diagnosis, potential workup plan, and provide treatment options for its top differential. Responses were graded for accuracy and the overall utility scored on a 5-point Likert scale.</p><p><strong>Results: </strong>The diagnostic accuracy of ChatGPT was 7 out of 9 cases, indicating an overall accuracy rate of 78%. ChatGPT was less reliable with more complex pathologies and failed to identify an intentionally incorrect presentation. ChatGPT received a score of 3.8 ± 1.4 for correct diagnosis, 3.4 ± 1.4 for helpfulness in guiding patient management, 4.1 ± 1.0 for appropriate workup for the actual diagnosis, 4.3 ± 0.8 for an appropriate recommended treatment plan for the diagnosis, and 4.4 ± 0.8 for the helpfulness of treatment options in managing patients.</p><p><strong>Conclusion: </strong>ChatGPT was successful in diagnosing most of the conditions; however, the overall utility of its advice was variable. While it performed well in recommending treatments, it faced difficulties in providing appropriate diagnoses for uncommon pathologies. In addition, it failed to identify an obvious error in presenting pathology.</p>","PeriodicalId":12902,"journal":{"name":"HAND","volume":" ","pages":"900-909"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11571334/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HAND","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/15589447241257643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/22 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Advancements in artificial intelligence technology, such as OpenAI's large language model, ChatGPT, could transform medicine through applications in a clinical setting. This study aimed to assess the utility of ChatGPT as a clinical assistant in an orthopedic hand clinic.

Methods: Nine clinical vignettes, describing various common and uncommon hand pathologies, were constructed and reviewed by 4 fellowship-trained orthopedic hand surgeons and an orthopedic resident. ChatGPT was given these vignettes and asked to generate a differential diagnosis, potential workup plan, and provide treatment options for its top differential. Responses were graded for accuracy and the overall utility scored on a 5-point Likert scale.

Results: The diagnostic accuracy of ChatGPT was 7 out of 9 cases, indicating an overall accuracy rate of 78%. ChatGPT was less reliable with more complex pathologies and failed to identify an intentionally incorrect presentation. ChatGPT received a score of 3.8 ± 1.4 for correct diagnosis, 3.4 ± 1.4 for helpfulness in guiding patient management, 4.1 ± 1.0 for appropriate workup for the actual diagnosis, 4.3 ± 0.8 for an appropriate recommended treatment plan for the diagnosis, and 4.4 ± 0.8 for the helpfulness of treatment options in managing patients.

Conclusion: ChatGPT was successful in diagnosing most of the conditions; however, the overall utility of its advice was variable. While it performed well in recommending treatments, it faced difficulties in providing appropriate diagnoses for uncommon pathologies. In addition, it failed to identify an obvious error in presenting pathology.

评估大型语言模型在手部矫形诊所中的辅助能力。
背景:人工智能技术的进步,如 OpenAI 的大型语言模型 ChatGPT,可以通过在临床环境中的应用改变医学。本研究旨在评估 ChatGPT 作为手部整形诊所临床助手的实用性:方法:由 4 名经过研究培训的手部整形外科医生和 1 名骨科住院医师编写了 9 个临床小故事,描述了各种常见和不常见的手部病理。他们向 ChatGPT 提供了这些病例,并要求 ChatGPT 针对这些病例做出鉴别诊断、制定可能的检查计划,并提供最佳鉴别诊断的治疗方案。结果显示,ChatGPT 的诊断准确率达到了 99.9%:结果:在 9 个病例中,ChatGPT 的诊断准确率为 7%,总体准确率为 78%。对于较复杂的病理情况,ChatGPT 的可靠性较低,而且无法识别故意错误的表现。ChatGPT 的正确诊断率为(3.8 ± 1.4)分,对指导患者管理的帮助率为(3.4 ± 1.4)分,实际诊断的适当检查率为(4.1 ± 1.0)分,针对诊断推荐的适当治疗方案为(4.3 ± 0.8)分,治疗方案对管理患者的帮助率为(4.4 ± 0.8)分:结论:ChatGPT 能成功诊断大多数疾病,但其建议的整体效用却参差不齐。虽然它在推荐治疗方法方面表现出色,但在为不常见的病症提供适当诊断方面却遇到了困难。此外,它还未能识别出病理陈述中的明显错误。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
HAND
HAND Medicine-Surgery
CiteScore
3.30
自引率
0.00%
发文量
209
期刊介绍: HAND is the official journal of the American Association for Hand Surgery and is a peer-reviewed journal featuring articles written by clinicians worldwide presenting current research and clinical work in the field of hand surgery. It features articles related to all aspects of hand and upper extremity surgery and the post operative care and rehabilitation of the hand.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信