Artificial intelligence insights into osteoporosis: assessing ChatGPT's information quality and readability.

IF 3.1 3区医学 Q2 ENDOCRINOLOGY & METABOLISM

Archives of Osteoporosis Pub Date : 2024-03-19 DOI:10.1007/s11657-024-01376-5

Yakup Erden, Mustafa Hüseyin Temel, Fatih Bağcıer

{"title":"Artificial intelligence insights into osteoporosis: assessing ChatGPT's information quality and readability.","authors":"Yakup Erden, Mustafa Hüseyin Temel, Fatih Bağcıer","doi":"10.1007/s11657-024-01376-5","DOIUrl":null,"url":null,"abstract":"Accessible, accurate information, and readability play crucial role in empowering individuals managing osteoporosis. This study showed that the responses generated by ChatGPT regarding osteoporosis had serious problems with quality and were at a level of complexity that that necessitates an educational background of approximately 17 years.Purpose: The use of artificial intelligence (AI) applications as a source of information in the field of health is increasing. Readable and accurate information plays a critical role in empowering patients to make decisions about their disease. The aim was to examine the quality and readability of responses provided by ChatGPT, an AI chatbot, to commonly asked questions regarding osteoporosis, representing a major public health problem.Methods: \"Osteoporosis,\" \"female osteoporosis,\" and \"male osteoporosis\" were identified by using Google trends for the 25 most frequently searched keywords on Google. A selected set of 38 keywords was sequentially inputted into the chat interface of the ChatGPT. The responses were evaluated with tools of the Ensuring Quality Information for Patients (EQIP), the Flesch-Kincaid Grade Level (FKGL), and the Flesch-Kincaid Reading Ease (FKRE).Results: The EQIP score of the texts ranged from a minimum of 36.36 to a maximum of 61.76 with a mean value of 48.71 as having \"serious problems with quality.\" The FKRE scores spanned from 13.71 to 56.06 with a mean value of 28.71 and the FKGL varied between 8.48 and 17.63, with a mean value of 13.25. There were no statistically significant correlations between the EQIP score and the FKGL or FKRE scores.Conclusions: Although ChatGPT is easily accessible for patients to obtain information about osteoporosis, its current quality and readability fall short of meeting comprehensive healthcare standards.","PeriodicalId":8283,"journal":{"name":"Archives of Osteoporosis","volume":"19 1","pages":"17"},"PeriodicalIF":3.1000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Osteoporosis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11657-024-01376-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}

引用次数: 0

Abstract

Accessible, accurate information, and readability play crucial role in empowering individuals managing osteoporosis. This study showed that the responses generated by ChatGPT regarding osteoporosis had serious problems with quality and were at a level of complexity that that necessitates an educational background of approximately 17 years.

Purpose: The use of artificial intelligence (AI) applications as a source of information in the field of health is increasing. Readable and accurate information plays a critical role in empowering patients to make decisions about their disease. The aim was to examine the quality and readability of responses provided by ChatGPT, an AI chatbot, to commonly asked questions regarding osteoporosis, representing a major public health problem.

Methods: "Osteoporosis," "female osteoporosis," and "male osteoporosis" were identified by using Google trends for the 25 most frequently searched keywords on Google. A selected set of 38 keywords was sequentially inputted into the chat interface of the ChatGPT. The responses were evaluated with tools of the Ensuring Quality Information for Patients (EQIP), the Flesch-Kincaid Grade Level (FKGL), and the Flesch-Kincaid Reading Ease (FKRE).

Results: The EQIP score of the texts ranged from a minimum of 36.36 to a maximum of 61.76 with a mean value of 48.71 as having "serious problems with quality." The FKRE scores spanned from 13.71 to 56.06 with a mean value of 28.71 and the FKGL varied between 8.48 and 17.63, with a mean value of 13.25. There were no statistically significant correlations between the EQIP score and the FKGL or FKRE scores.

Conclusions: Although ChatGPT is easily accessible for patients to obtain information about osteoporosis, its current quality and readability fall short of meeting comprehensive healthcare standards.

查看原文本刊更多论文

人工智能洞察骨质疏松症：评估 ChatGPT 的信息质量和可读性。

信息的可访问性、准确性和可读性在增强骨质疏松症患者的能力方面起着至关重要的作用。这项研究表明，由 ChatGPT 生成的有关骨质疏松症的回复存在严重的质量问题，其复杂程度需要约 17 年的教育背景。可读且准确的信息在增强患者对自身疾病做出决定的能力方面发挥着至关重要的作用。我们的目的是研究人工智能聊天机器人 ChatGPT 在回答有关骨质疏松症这一重大公共卫生问题的常见问题时所提供答复的质量和可读性："骨质疏松症"、"女性骨质疏松症 "和 "男性骨质疏松症 "是通过谷歌趋势对谷歌上搜索频率最高的 25 个关键词进行识别的。将选定的 38 个关键词依次输入 ChatGPT 的聊天界面。使用确保患者信息质量（EQIP）、弗莱什-金凯德分级（FKGL）和弗莱什-金凯德阅读轻松度（FKRE）工具对回复进行评估：文本的 EQIP 分数最低为 36.36 分，最高为 61.76 分，平均值为 48.71 分，即 "质量存在严重问题"。FKRE 分数从 13.71 到 56.06 不等，平均值为 28.71；FKGL 分数从 8.48 到 17.63 不等，平均值为 13.25。EQIP 分数与 FKGL 或 FKRE 分数之间没有统计学意义上的相关性：虽然 ChatGPT 方便患者获取有关骨质疏松症的信息，但其目前的质量和可读性还不能满足全面的医疗保健标准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Archives of Osteoporosis ENDOCRINOLOGY & METABOLISMORTHOPEDICS -ORTHOPEDICS

CiteScore

5.50

自引率

10.00%

发文量

133

期刊介绍： Archives of Osteoporosis is an international multidisciplinary journal which is a joint initiative of the International Osteoporosis Foundation and the National Osteoporosis Foundation of the USA. The journal will highlight the specificities of different regions around the world concerning epidemiology, reference values for bone density and bone metabolism, as well as clinical aspects of osteoporosis and other bone diseases.