Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome

IF 3.1 3区医学 Q2 NUTRITION & DIETETICS

Nutrition Research Pub Date : 2025-01-01 DOI:10.1016/j.nutres.2024.11.005

Elif Ulug , Irmak Gunesli , Aylin Acıkgoz Pinar , Bulent Okan Yildiz

{"title":"Evaluating reliability, quality, and readability of ChatGPT's nutritional recommendations for women with polycystic ovary syndrome","authors":"Elif Ulug , Irmak Gunesli , Aylin Acıkgoz Pinar , Bulent Okan Yildiz","doi":"10.1016/j.nutres.2024.11.005","DOIUrl":null,"url":null,"abstract":"<div><div>Patients with polycystic ovary syndrome (PCOS) often have many questions about nutrition and turn to chatbots such as Chat Generative Pretrained Transformer (ChatGPT) for advice. This study aims to evaluate the reliability, quality, and readability of ChatGPT's responses to nutrition-related questions asked by women with PCOS. Frequently asked nutrition-related questions from women with PCOS were reviewed in both Turkish and English. The reliability and quality of the answers were independently evaluated by 2 authors and a panel of 10 expert dietitians, using modified DISCERN and global quality score. Additionally, the readability of the answers was calculated using frequently used readability formulas. The mean modified DISCERN scores for English and Turkish versions were 27.6±0.87 and 27.2±0.87, respectively, indicating a fair level of reliability in the responses (16–31 points or 40%–79%). According to the global quality score, 100% of the responses in English and 90.9% of the responses in Turkish were rated as high quality. The readability of responses was classified as “difficult to read” with the readership levels assessed at college level and above for both English and Turkish. The correlation and regression analyses indicated no relationship between reliability, quality, and readability in English. However, a significant relationship was observed between quality and readability indexes in Turkish (<em>P</em> < .05). Our results suggest that ChatGPT's responses to nutrition-related questions about PCOS are generally of high quality, but improvements in both reliability and readability are still necessary. Although ChatGPT can offer general information and guidance on nutrition for PCOS, it should not be considered a substitute for personalized medical advice from health care professionals for effective management of the syndrome.</div></div>","PeriodicalId":19245,"journal":{"name":"Nutrition Research","volume":"133 ","pages":"Pages 46-53"},"PeriodicalIF":3.1000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nutrition Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0271531724001507","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NUTRITION & DIETETICS","Score":null,"Total":0}

引用次数: 0

Abstract

Patients with polycystic ovary syndrome (PCOS) often have many questions about nutrition and turn to chatbots such as Chat Generative Pretrained Transformer (ChatGPT) for advice. This study aims to evaluate the reliability, quality, and readability of ChatGPT's responses to nutrition-related questions asked by women with PCOS. Frequently asked nutrition-related questions from women with PCOS were reviewed in both Turkish and English. The reliability and quality of the answers were independently evaluated by 2 authors and a panel of 10 expert dietitians, using modified DISCERN and global quality score. Additionally, the readability of the answers was calculated using frequently used readability formulas. The mean modified DISCERN scores for English and Turkish versions were 27.6±0.87 and 27.2±0.87, respectively, indicating a fair level of reliability in the responses (16–31 points or 40%–79%). According to the global quality score, 100% of the responses in English and 90.9% of the responses in Turkish were rated as high quality. The readability of responses was classified as “difficult to read” with the readership levels assessed at college level and above for both English and Turkish. The correlation and regression analyses indicated no relationship between reliability, quality, and readability in English. However, a significant relationship was observed between quality and readability indexes in Turkish (P < .05). Our results suggest that ChatGPT's responses to nutrition-related questions about PCOS are generally of high quality, but improvements in both reliability and readability are still necessary. Although ChatGPT can offer general information and guidance on nutrition for PCOS, it should not be considered a substitute for personalized medical advice from health care professionals for effective management of the syndrome.

查看原文本刊更多论文

评估 ChatGPT 为多囊卵巢综合征妇女提供的营养建议的可靠性、质量和可读性。

多囊卵巢综合征（PCOS）患者经常有很多关于营养的问题，并转向聊天机器人，如聊天生成预训练变压器（ChatGPT）寻求建议。本研究旨在评估ChatGPT对多囊卵巢综合征女性营养相关问题的回答的可靠性、质量和可读性。用土耳其语和英语对多囊卵巢综合征妇女常见的营养相关问题进行了审查。答案的可靠性和质量由2位作者和10位专家营养师组成的小组独立评估，使用改良的DISCERN和全球质量评分。此外，使用常用的可读性公式计算答案的可读性。英语和土耳其语版本的平均修正辨别分数分别为27.6±0.87和27.2±0.87，表明回答的可靠性水平相当（16-31分或40%-79%）。根据全球质量评分，100%的英语回答和90.9%的土耳其语回答被评为高质量。回答的可读性被归类为“难以阅读”，英语和土耳其语的读者水平被评估为大学及以上水平。相关分析和回归分析显示信度、质量和英文可读性之间没有关系。然而，土耳其语的质量和可读性指标之间存在显著相关性（P < 0.05）。我们的研究结果表明，ChatGPT对PCOS营养相关问题的回答总体上是高质量的，但可靠性和可读性仍然需要改进。虽然ChatGPT可以为多囊卵巢综合征患者提供营养方面的一般信息和指导，但它不应被视为医疗保健专业人员为有效治疗多囊卵巢综合征而提供的个性化医疗建议的替代品。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Nutrition Research 医学-营养学

CiteScore

7.60

自引率

2.20%

发文量

107

审稿时长

58 days

期刊介绍： Nutrition Research publishes original research articles, communications, and reviews on basic and applied nutrition. The mission of Nutrition Research is to serve as the journal for global communication of nutrition and life sciences research on diet and health. The field of nutrition sciences includes, but is not limited to, the study of nutrients during growth, reproduction, aging, health, and disease. Articles covering basic and applied research on all aspects of nutrition sciences are encouraged, including: nutritional biochemistry and metabolism; metabolomics, nutrient gene interactions; nutrient requirements for health; nutrition and disease; digestion and absorption; nutritional anthropology; epidemiology; the influence of socioeconomic and cultural factors on nutrition of the individual and the community; the impact of nutrient intake on disease response and behavior; the consequences of nutritional deficiency on growth and development, endocrine and nervous systems, and immunity; nutrition and gut microbiota; food intolerance and allergy; nutrient drug interactions; nutrition and aging; nutrition and cancer; obesity; diabetes; and intervention programs.