Evaluation of Online AI-Generated Foot and Ankle Surgery Information

IF 1.3 4区 医学 Q2 Medicine
{"title":"Evaluation of Online AI-Generated Foot and Ankle Surgery Information","authors":"","doi":"10.1053/j.jfas.2024.06.009","DOIUrl":null,"url":null,"abstract":"<div><div><span>As a natural progression from educational pamphlets to the worldwide web, and now artificial intelligence (AI), OpenAI chatbots provide a simple way of obtaining pathology-specific patient information, however, little is known concerning the readability and quality of foot and ankle surgery information. This investigation compares such information using the commercially available OpenAI ChatGPT Chatbot and FootCareMD®. A list of common foot and ankle pathologies from FootCareMD® were queried and compared with similar results using ChatGPT. From both resources, the Flesch Reading Ease Score (FRES) and Flesch-Kincaid Grade Level (FKGL) scores were calculated for each condition. Qualitative analysis of each query was performed using the JAMA Benchmark Criteria Score and the DISCERN Score.The overall ChatGPT and FootCareMD® FRES scores were 31.12 ± 7.86 and 55.18 ± 7.27, respectively (</span><em>p</em> &lt; .0001). The overall ChatGPT and FootCareMD® FKGL scores were 13.79 ± 1.22 and 9.60 ± 1.24 respectively (<em>p</em> &lt; .0001), except for the pilon fracture FKGL scores (<em>p</em> = .09). The average JAMA Benchmark for all information obtained through ChatGPT and FootCareMD® were 0 ± 0 and 1.95 ± 0.15 (<em>p</em> &lt; .001), respectively. The DISCERN Score for all information obtained through ChatGPT and FootCareMD® were 52.53 ± 5.39 and 66.93 ± 4.57 (<em>p</em> &lt; .001), respectively. AI-assisted queries concerning common foot and ankle pathologies are written at a higher grade level and with less reliability and accuracy compared to similar information available on FootCareMD®. With the ease of use and increase in AI technology, consideration should be given to the nature and quality of information being shared with respect to the diagnosis and treatment of foot and ankle conditions.</div></div>","PeriodicalId":50191,"journal":{"name":"Journal of Foot & Ankle Surgery","volume":"63 6","pages":"Pages 680-683"},"PeriodicalIF":1.3000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Foot & Ankle Surgery","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1067251624001431","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

As a natural progression from educational pamphlets to the worldwide web, and now artificial intelligence (AI), OpenAI chatbots provide a simple way of obtaining pathology-specific patient information, however, little is known concerning the readability and quality of foot and ankle surgery information. This investigation compares such information using the commercially available OpenAI ChatGPT Chatbot and FootCareMD®. A list of common foot and ankle pathologies from FootCareMD® were queried and compared with similar results using ChatGPT. From both resources, the Flesch Reading Ease Score (FRES) and Flesch-Kincaid Grade Level (FKGL) scores were calculated for each condition. Qualitative analysis of each query was performed using the JAMA Benchmark Criteria Score and the DISCERN Score.The overall ChatGPT and FootCareMD® FRES scores were 31.12 ± 7.86 and 55.18 ± 7.27, respectively (p < .0001). The overall ChatGPT and FootCareMD® FKGL scores were 13.79 ± 1.22 and 9.60 ± 1.24 respectively (p < .0001), except for the pilon fracture FKGL scores (p = .09). The average JAMA Benchmark for all information obtained through ChatGPT and FootCareMD® were 0 ± 0 and 1.95 ± 0.15 (p < .001), respectively. The DISCERN Score for all information obtained through ChatGPT and FootCareMD® were 52.53 ± 5.39 and 66.93 ± 4.57 (p < .001), respectively. AI-assisted queries concerning common foot and ankle pathologies are written at a higher grade level and with less reliability and accuracy compared to similar information available on FootCareMD®. With the ease of use and increase in AI technology, consideration should be given to the nature and quality of information being shared with respect to the diagnosis and treatment of foot and ankle conditions.
评估在线人工智能生成的足踝手术信息。
从教育小册子到全球网络,再到现在的人工智能(AI),OpenAI 聊天机器人提供了一种获取病理特定患者信息的简单方法,但人们对足踝手术信息的可读性和质量知之甚少。本调查使用市售的 OpenAI ChatGPT 聊天机器人和 FootCareMD® 对此类信息进行了比较。我们查询了 FootCareMD® 中的常见足踝病症列表,并将其与 ChatGPT 中的类似结果进行了比较。通过这两种资源,计算出了每种情况下的弗莱什阅读容易程度得分(FRES)和弗莱什-金凯德等级水平(FKGL)得分。使用 JAMA 基准标准评分和 DISCERN 评分对每个查询进行了定性分析。ChatGPT 和 FootCareMD® FRES 的总分分别为(31.12±7.86)分和(55.18±7.27)分(P<0.05)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Foot & Ankle Surgery
Journal of Foot & Ankle Surgery ORTHOPEDICS-SURGERY
CiteScore
2.30
自引率
7.70%
发文量
234
审稿时长
29.8 weeks
期刊介绍: The Journal of Foot & Ankle Surgery is the leading source for original, clinically-focused articles on the surgical and medical management of the foot and ankle. Each bi-monthly, peer-reviewed issue addresses relevant topics to the profession, such as: adult reconstruction of the forefoot; adult reconstruction of the hindfoot and ankle; diabetes; medicine/rheumatology; pediatrics; research; sports medicine; trauma; and tumors.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信