Experimenting With the New Frontier: Artificial Intelligence-Powered Chat Bots in Hand Surgery.

IF 1.8 Q2 ORTHOPEDICS
HAND Pub Date : 2025-07-01 Epub Date: 2024-03-25 DOI:10.1177/15589447241238372
Zayd M Al Rawi, Benjamin J Kirby, Peter A Albrecht, Julia A V Nuelle, Daniel A London
{"title":"Experimenting With the New Frontier: Artificial Intelligence-Powered Chat Bots in Hand Surgery.","authors":"Zayd M Al Rawi, Benjamin J Kirby, Peter A Albrecht, Julia A V Nuelle, Daniel A London","doi":"10.1177/15589447241238372","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background:</b> Increased utilization of artificial intelligence (AI)-driven search and large language models by the lay and medical community requires us to evaluate the accuracy of AI responses to common hand surgery questions. We hypothesized that the answers to most hand surgery questions posed to an AI large language model would be correct. <b>Methods:</b> Using the topics covered in Green's <i>Operative Hand Surgery</i> 8<sup>th</sup> Edition as a guide, 56 hand surgery questions were compiled and posed to ChatGPT (OpenAI, San Francisco, CA). Two attending hand surgeons then independently reviewed ChatGPT's answers for response accuracy, completeness, and usefulness. A Cohen's kappa analysis was performed to assess interrater agreement. <b>Results:</b> An average of 45 of the 56 questions posed to ChatGPT were deemed correct (80%), 39 responses were deemed useful (70%), and 32 responses were deemed complete (57%) by the reviewers. Kappa analysis demonstrated \"fair to moderate\" agreement between the two raters. Reviewers disagreed on 11 questions regarding correctness, 16 questions regarding usefulness, and 19 questions regarding completeness. <b>Conclusions:</b> Large language models have the potential to both positively and negatively impact patient perceptions and guide referral patterns based on the accuracy, completeness, and usefulness of their responses. While most responses fit these criteria, more precise responses are needed to ensure patient safety and avoid misinformation. Individual hand surgeons and surgical societies must understand these technologies and interface with the companies developing them to provide our patients with the best possible care.</p>","PeriodicalId":12902,"journal":{"name":"HAND","volume":" ","pages":"795-800"},"PeriodicalIF":1.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11571578/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HAND","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/15589447241238372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Increased utilization of artificial intelligence (AI)-driven search and large language models by the lay and medical community requires us to evaluate the accuracy of AI responses to common hand surgery questions. We hypothesized that the answers to most hand surgery questions posed to an AI large language model would be correct. Methods: Using the topics covered in Green's Operative Hand Surgery 8th Edition as a guide, 56 hand surgery questions were compiled and posed to ChatGPT (OpenAI, San Francisco, CA). Two attending hand surgeons then independently reviewed ChatGPT's answers for response accuracy, completeness, and usefulness. A Cohen's kappa analysis was performed to assess interrater agreement. Results: An average of 45 of the 56 questions posed to ChatGPT were deemed correct (80%), 39 responses were deemed useful (70%), and 32 responses were deemed complete (57%) by the reviewers. Kappa analysis demonstrated "fair to moderate" agreement between the two raters. Reviewers disagreed on 11 questions regarding correctness, 16 questions regarding usefulness, and 19 questions regarding completeness. Conclusions: Large language models have the potential to both positively and negatively impact patient perceptions and guide referral patterns based on the accuracy, completeness, and usefulness of their responses. While most responses fit these criteria, more precise responses are needed to ensure patient safety and avoid misinformation. Individual hand surgeons and surgical societies must understand these technologies and interface with the companies developing them to provide our patients with the best possible care.

实验新前沿:手外科人工智能聊天机器人。
背景:人工智能(AI)驱动的搜索和大型语言模型在非专业人士和医学界的使用越来越多,这要求我们评估人工智能回答常见手外科问题的准确性。我们假设,向人工智能大型语言模型提出的大多数手外科问题的答案都是正确的。方法以《格林手外科手术学》第 8 版中涉及的主题为指导,编译了 56 个手外科问题并将其提交给 ChatGPT(OpenAI,加利福尼亚州旧金山)。然后由两名手外科主治医生对 ChatGPT 的回答进行独立审核,以确定回答的准确性、完整性和实用性。进行科恩卡帕分析以评估交互者之间的一致性。结果:在向 ChatGPT 提出的 56 个问题中,平均有 45 个问题被认为是正确的(80%),39 个回答被认为是有用的(70%),32 个回答被审阅者认为是完整的(57%)。Kappa 分析表明,两位评审员之间的一致性为 "一般到中等"。评审员在 11 个关于正确性的问题、16 个关于有用性的问题和 19 个关于完整性的问题上存在分歧。结论:根据患者回答的准确性、完整性和实用性,大型语言模型有可能对患者的看法产生积极或消极的影响,并指导转诊模式。虽然大多数回答都符合这些标准,但仍需要更精确的回答来确保患者安全并避免错误信息。手外科医生和外科协会必须了解这些技术,并与开发这些技术的公司进行沟通,以便为患者提供最好的治疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
HAND
HAND Medicine-Surgery
CiteScore
3.30
自引率
0.00%
发文量
209
期刊介绍: HAND is the official journal of the American Association for Hand Surgery and is a peer-reviewed journal featuring articles written by clinicians worldwide presenting current research and clinical work in the field of hand surgery. It features articles related to all aspects of hand and upper extremity surgery and the post operative care and rehabilitation of the hand.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信