Evaluation of Online AI-Generated Foot and Ankle Surgery Information

IF 1.3 4区医学 Q2 Medicine

Journal of Foot & Ankle Surgery Pub Date : 2024-07-03 DOI:10.1053/j.jfas.2024.06.009

{"title":"Evaluation of Online AI-Generated Foot and Ankle Surgery Information","authors":"","doi":"10.1053/j.jfas.2024.06.009","DOIUrl":null,"url":null,"abstract":"<div><div><span>As a natural progression from educational pamphlets to the worldwide web, and now artificial intelligence (AI), OpenAI chatbots provide a simple way of obtaining pathology-specific patient information, however, little is known concerning the readability and quality of foot and ankle surgery information. This investigation compares such information using the commercially available OpenAI ChatGPT Chatbot and FootCareMD®. A list of common foot and ankle pathologies from FootCareMD® were queried and compared with similar results using ChatGPT. From both resources, the Flesch Reading Ease Score (FRES) and Flesch-Kincaid Grade Level (FKGL) scores were calculated for each condition. Qualitative analysis of each query was performed using the JAMA Benchmark Criteria Score and the DISCERN Score.The overall ChatGPT and FootCareMD® FRES scores were 31.12 ± 7.86 and 55.18 ± 7.27, respectively (</span><em>p</em> < .0001). The overall ChatGPT and FootCareMD® FKGL scores were 13.79 ± 1.22 and 9.60 ± 1.24 respectively (<em>p</em> < .0001), except for the pilon fracture FKGL scores (<em>p</em> = .09). The average JAMA Benchmark for all information obtained through ChatGPT and FootCareMD® were 0 ± 0 and 1.95 ± 0.15 (<em>p</em> < .001), respectively. The DISCERN Score for all information obtained through ChatGPT and FootCareMD® were 52.53 ± 5.39 and 66.93 ± 4.57 (<em>p</em> < .001), respectively. AI-assisted queries concerning common foot and ankle pathologies are written at a higher grade level and with less reliability and accuracy compared to similar information available on FootCareMD®. With the ease of use and increase in AI technology, consideration should be given to the nature and quality of information being shared with respect to the diagnosis and treatment of foot and ankle conditions.</div></div>","PeriodicalId":50191,"journal":{"name":"Journal of Foot & Ankle Surgery","volume":"63 6","pages":"Pages 680-683"},"PeriodicalIF":1.3000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Foot & Ankle Surgery","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1067251624001431","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

As a natural progression from educational pamphlets to the worldwide web, and now artificial intelligence (AI), OpenAI chatbots provide a simple way of obtaining pathology-specific patient information, however, little is known concerning the readability and quality of foot and ankle surgery information. This investigation compares such information using the commercially available OpenAI ChatGPT Chatbot and FootCareMD®. A list of common foot and ankle pathologies from FootCareMD® were queried and compared with similar results using ChatGPT. From both resources, the Flesch Reading Ease Score (FRES) and Flesch-Kincaid Grade Level (FKGL) scores were calculated for each condition. Qualitative analysis of each query was performed using the JAMA Benchmark Criteria Score and the DISCERN Score.The overall ChatGPT and FootCareMD® FRES scores were 31.12 ± 7.86 and 55.18 ± 7.27, respectively (p < .0001). The overall ChatGPT and FootCareMD® FKGL scores were 13.79 ± 1.22 and 9.60 ± 1.24 respectively (p < .0001), except for the pilon fracture FKGL scores (p = .09). The average JAMA Benchmark for all information obtained through ChatGPT and FootCareMD® were 0 ± 0 and 1.95 ± 0.15 (p < .001), respectively. The DISCERN Score for all information obtained through ChatGPT and FootCareMD® were 52.53 ± 5.39 and 66.93 ± 4.57 (p < .001), respectively. AI-assisted queries concerning common foot and ankle pathologies are written at a higher grade level and with less reliability and accuracy compared to similar information available on FootCareMD®. With the ease of use and increase in AI technology, consideration should be given to the nature and quality of information being shared with respect to the diagnosis and treatment of foot and ankle conditions.

查看原文本刊更多论文

评估在线人工智能生成的足踝手术信息。

从教育小册子到全球网络，再到现在的人工智能（AI），OpenAI 聊天机器人提供了一种获取病理特定患者信息的简单方法，但人们对足踝手术信息的可读性和质量知之甚少。本调查使用市售的 OpenAI ChatGPT 聊天机器人和 FootCareMD® 对此类信息进行了比较。我们查询了 FootCareMD® 中的常见足踝病症列表，并将其与 ChatGPT 中的类似结果进行了比较。通过这两种资源，计算出了每种情况下的弗莱什阅读容易程度得分（FRES）和弗莱什-金凯德等级水平（FKGL）得分。使用 JAMA 基准标准评分和 DISCERN 评分对每个查询进行了定性分析。ChatGPT 和 FootCareMD® FRES 的总分分别为（31.12±7.86）分和（55.18±7.27）分（P＜0.05）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Foot & Ankle Surgery ORTHOPEDICS-SURGERY

CiteScore

2.30

自引率

7.70%

发文量

234

审稿时长

29.8 weeks

期刊介绍： The Journal of Foot & Ankle Surgery is the leading source for original, clinically-focused articles on the surgical and medical management of the foot and ankle. Each bi-monthly, peer-reviewed issue addresses relevant topics to the profession, such as: adult reconstruction of the forefoot; adult reconstruction of the hindfoot and ankle; diabetes; medicine/rheumatology; pediatrics; research; sports medicine; trauma; and tumors.