ChatGPT-4o's performance on pediatric Vesicoureteral reflux.

IF 2 3区医学 Q2 PEDIATRICS

Journal of Pediatric Urology Pub Date : 2024-12-07 DOI:10.1016/j.jpurol.2024.12.002

Esra Nagehan Akyol Onder, Esra Ensari, Pelin Ertan

{"title":"ChatGPT-4o's performance on pediatric Vesicoureteral reflux.","authors":"Esra Nagehan Akyol Onder, Esra Ensari, Pelin Ertan","doi":"10.1016/j.jpurol.2024.12.002","DOIUrl":null,"url":null,"abstract":"Introduction: Vesicoureteral reflux (VUR) is a common congenital or acquired urinary disorder in children. Chat Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence-driven platform offering medical information. This research aims to assess the reliability and readability of ChatGPT-4o's answers regarding pediatric VUR for general, non-medical audience.Materials and methods: Twenty of the most frequently asked English-language questions about VUR in children were used to evaluate ChatGPT-4o's responses. Two independent reviewers rated the reliability and quality using the Global Quality Scale (GQS) and a modified version of the DISCERN tool. The readability of ChatGPT responses was assessed through the Flesch Reading Ease (FRE) Score, Flesch-Kincaid Grade Level (FKGL), Gunning Fog Index (GFI), Coleman-Liau Index (CLI), and Simple Measure of Gobbledygook (SMOG).Results: Median mDISCERN and GQS scores were 4 (4-5) and 5 (3-5), respectively. Most of the responses of ChatGPT have moderate (55 %) and good (45 %) reliability according to the mDISCERN score and high quality (95 %) according to GQS. The mean ± standard deviation scores for FRE, FKGL, SMOG, GFI, and CLI of the text were 26 ± 12, 15 ± 2.5, 16.3 ± 2, 18.8 ± 2.9, and 15.3 ± 2.2, respectively, indicating a high level of reading difficulty.Discussion: While ChatGPT-4o offers accurate and high-quality information about pediatric VUR, its readability poses challenges, as the content is difficult to understand for a general audience.Conclusion: ChatGPT provides high-quality, accessible information about VUR. However, improving readability should be a priority to make this information more user-friendly for a broader audience.","PeriodicalId":16747,"journal":{"name":"Journal of Pediatric Urology","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pediatric Urology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jpurol.2024.12.002","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Vesicoureteral reflux (VUR) is a common congenital or acquired urinary disorder in children. Chat Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence-driven platform offering medical information. This research aims to assess the reliability and readability of ChatGPT-4o's answers regarding pediatric VUR for general, non-medical audience.

Materials and methods: Twenty of the most frequently asked English-language questions about VUR in children were used to evaluate ChatGPT-4o's responses. Two independent reviewers rated the reliability and quality using the Global Quality Scale (GQS) and a modified version of the DISCERN tool. The readability of ChatGPT responses was assessed through the Flesch Reading Ease (FRE) Score, Flesch-Kincaid Grade Level (FKGL), Gunning Fog Index (GFI), Coleman-Liau Index (CLI), and Simple Measure of Gobbledygook (SMOG).

Results: Median mDISCERN and GQS scores were 4 (4-5) and 5 (3-5), respectively. Most of the responses of ChatGPT have moderate (55 %) and good (45 %) reliability according to the mDISCERN score and high quality (95 %) according to GQS. The mean ± standard deviation scores for FRE, FKGL, SMOG, GFI, and CLI of the text were 26 ± 12, 15 ± 2.5, 16.3 ± 2, 18.8 ± 2.9, and 15.3 ± 2.2, respectively, indicating a high level of reading difficulty.

Discussion: While ChatGPT-4o offers accurate and high-quality information about pediatric VUR, its readability poses challenges, as the content is difficult to understand for a general audience.

Conclusion: ChatGPT provides high-quality, accessible information about VUR. However, improving readability should be a priority to make this information more user-friendly for a broader audience.

查看原文本刊更多论文

chatgpt - 40治疗小儿膀胱输尿管反流的疗效。

膀胱输尿管反流（VUR）是儿童常见的先天性或后天性泌尿系统疾病。聊天生成预训练转换器（ChatGPT）是一个人工智能驱动的医疗信息平台。本研究旨在评估chatgpt - 40关于儿科VUR的答案对一般非医疗受众的可靠性和可读性。材料和方法：使用20个最常见的关于儿童VUR的英语问题来评估chatgpt - 40的回答。两名独立评审员使用全球质量量表（GQS）和一个修改版的DISCERN工具对可靠性和质量进行了评估。通过Flesch Reading Ease (FRE) Score、Flesch- kincaid Grade Level （FKGL）、Gunning Fog Index （GFI）、Coleman-Liau Index （CLI）和Simple Measure of Gobbledygook （SMOG）来评估ChatGPT回答的可读性。结果：mDISCERN和GQS评分中位数分别为4（4-5）和5（3-5）。根据mDISCERN评分，ChatGPT的大多数回答具有中等（55%）和良好（45%）的可靠性，根据GQS， ChatGPT的大多数回答具有高质量（95%）。文本的FRE、FKGL、SMOG、GFI和CLI的平均±标准差分别为26±12、15±2.5、16.3±2、18.8±2.9和15.3±2.2，表明阅读困难程度较高。讨论：虽然chatgpt - 40提供了关于儿科VUR的准确和高质量的信息，但其可读性存在挑战，因为内容难以被普通受众理解。结论：ChatGPT提供了高质量、可访问的VUR信息。然而，提高可读性应该是一个优先事项，以使这些信息对更广泛的受众更友好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Pediatric Urology PEDIATRICS-UROLOGY & NEPHROLOGY

CiteScore

3.70

自引率

15.00%

发文量

330

审稿时长

4-8 weeks

期刊介绍： The Journal of Pediatric Urology publishes submitted research and clinical articles relating to Pediatric Urology which have been accepted after adequate peer review. It publishes regular articles that have been submitted after invitation, that cover the curriculum of Pediatric Urology, and enable trainee surgeons to attain theoretical competence of the sub-specialty. It publishes regular reviews of pediatric urological articles appearing in other journals. It publishes invited review articles by recognised experts on modern or controversial aspects of the sub-specialty. It enables any affiliated society to advertise society events or information in the journal without charge and will publish abstracts of papers to be read at society meetings.