Consulting the Digital Doctor: Efficacy of ChatGPT-3.5 in Answering Questions Related to Diabetic Foot Ulcer Care.

IF 1.7 4区医学 Q3 DERMATOLOGY

Advances in Skin & Wound Care Pub Date : 2025-06-18 DOI:10.1097/ASW.0000000000000317

Rachel N Rohrich, Karen R Li, Christian X Lava, Isabel Snee, Sami Alahmadi, Richard C Youn, John S Steinberg, Jayson M Atves, Christopher E Attinger, Karen K Evans

{"title":"Consulting the Digital Doctor: Efficacy of ChatGPT-3.5 in Answering Questions Related to Diabetic Foot Ulcer Care.","authors":"Rachel N Rohrich, Karen R Li, Christian X Lava, Isabel Snee, Sami Alahmadi, Richard C Youn, John S Steinberg, Jayson M Atves, Christopher E Attinger, Karen K Evans","doi":"10.1097/ASW.0000000000000317","DOIUrl":null,"url":null,"abstract":"Background: Diabetic foot ulcer (DFU) care is a challenge in reconstructive surgery. Artificial intelligence (AI) tools represent a new resource for patients with DFUs to seek information.Objective: To evaluate the efficacy of ChatGPT-3.5 in responding to frequently asked questions related to DFU care.Methods: Researchers posed 11 DFU care questions to ChatGPT-3.5 in December 2023. Questions were divided into topic categories of wound care, concerning symptoms, and surgical management. Four plastic surgeons in the authors' wound care department evaluated responses on a 10-point Likert-type scale for accuracy, comprehensiveness, and danger, in addition to providing qualitative feedback. Readability was assessed using 10 readability indexes.Results: ChatGPT-3.5 answered questions with a mean accuracy of 8.7±0.3, comprehensiveness of 8.0±0.7, and danger of 2.2±0.6. ChatGPT-3.5 answered at the mean grade level of 11.9±1.8. Physician reviewers complimented the simplicity of the responses (n=11/11) and the AI's ability to provide general information (n=4/11). Three responses presented incorrect information, and the majority of responses (n=10/11) left out key information, such as deep vein thrombosis symptoms and comorbid conditions impacting limb salvage.Conclusions: The researchers observed that ChatGPT-3.5 provided misinformation, omitted crucial details, and responded at nearly 4 grade levels higher than the American average. However, ChatGPT-3.5 was sufficient in its ability to provide general information, which may enable patients with DFUs to make more informed decisions and better engage in their care. Physicians must proactively address the potential benefits and limitations of AI.","PeriodicalId":7489,"journal":{"name":"Advances in Skin & Wound Care","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Skin & Wound Care","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/ASW.0000000000000317","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DERMATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Diabetic foot ulcer (DFU) care is a challenge in reconstructive surgery. Artificial intelligence (AI) tools represent a new resource for patients with DFUs to seek information.

Objective: To evaluate the efficacy of ChatGPT-3.5 in responding to frequently asked questions related to DFU care.

Methods: Researchers posed 11 DFU care questions to ChatGPT-3.5 in December 2023. Questions were divided into topic categories of wound care, concerning symptoms, and surgical management. Four plastic surgeons in the authors' wound care department evaluated responses on a 10-point Likert-type scale for accuracy, comprehensiveness, and danger, in addition to providing qualitative feedback. Readability was assessed using 10 readability indexes.

Results: ChatGPT-3.5 answered questions with a mean accuracy of 8.7±0.3, comprehensiveness of 8.0±0.7, and danger of 2.2±0.6. ChatGPT-3.5 answered at the mean grade level of 11.9±1.8. Physician reviewers complimented the simplicity of the responses (n=11/11) and the AI's ability to provide general information (n=4/11). Three responses presented incorrect information, and the majority of responses (n=10/11) left out key information, such as deep vein thrombosis symptoms and comorbid conditions impacting limb salvage.

Conclusions: The researchers observed that ChatGPT-3.5 provided misinformation, omitted crucial details, and responded at nearly 4 grade levels higher than the American average. However, ChatGPT-3.5 was sufficient in its ability to provide general information, which may enable patients with DFUs to make more informed decisions and better engage in their care. Physicians must proactively address the potential benefits and limitations of AI.

查看原文本刊更多论文

咨询数字医生：ChatGPT-3.5在回答糖尿病足溃疡护理相关问题中的疗效。

背景：糖尿病足溃疡（DFU）的护理是重建外科的一个挑战。人工智能（AI）工具为DFUs患者寻求信息提供了新的资源。目的：评价ChatGPT-3.5在回答与DFU护理相关的常见问题方面的疗效。方法：研究人员于2023年12月向ChatGPT-3.5提交了11个DFU护理问题。问题被分为伤口护理、症状和手术处理的主题类别。作者的伤口护理部门的四名整形外科医生除了提供定性反馈外，还根据10分李克特式量表对反应的准确性、全面性和危险性进行了评估。采用10项可读性指标评估可读性。结果：ChatGPT-3.5回答问题的平均准确率为8.7±0.3，全面性为8.0±0.7，危险性为2.2±0.6。ChatGPT-3.5的平均等级水平为11.9±1.8。医师审稿人称赞了回答的简单性（n=11/11）和人工智能提供一般信息的能力（n=4/11）。3个应答信息不正确，大多数应答（n=10/11）遗漏了关键信息，如深静脉血栓形成症状和影响肢体保留的合并症。结论：研究人员观察到，ChatGPT-3.5提供了错误的信息，遗漏了关键的细节，并且比美国平均水平高出近4个等级。然而，ChatGPT-3.5在提供一般信息方面已经足够，这可能使dfu患者做出更明智的决定并更好地参与他们的护理。医生必须积极主动地解决人工智能的潜在好处和局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advances in Skin & Wound Care DERMATOLOGY-NURSING

CiteScore

2.50

自引率

12.50%

发文量

271

审稿时长

>12 weeks

期刊介绍： A peer-reviewed, multidisciplinary journal, Advances in Skin & Wound Care is highly regarded for its unique balance of cutting-edge original research and practical clinical management articles on wounds and other problems of skin integrity. Each issue features CME/CE for physicians and nurses, the first journal in the field to regularly offer continuing education for both disciplines.