Examining the Accuracy and Reproducibility of Responses to Nutrition Questions Related to Inflammatory Bowel Disease by Generative Pre-trained Transformer-4.

IF 1.8 Q3 GASTROENTEROLOGY & HEPATOLOGY
Crohn's & Colitis 360 Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI:10.1093/crocol/otae077
Jamil S Samaan, Kelly Issokson, Erin Feldman, Christina Fasulo, Nithya Rajeev, Wee Han Ng, Barbara Hollander, Yee Hui Yeo, Eric Vasiliauskas
{"title":"Examining the Accuracy and Reproducibility of Responses to Nutrition Questions Related to Inflammatory Bowel Disease by Generative Pre-trained Transformer-4.","authors":"Jamil S Samaan, Kelly Issokson, Erin Feldman, Christina Fasulo, Nithya Rajeev, Wee Han Ng, Barbara Hollander, Yee Hui Yeo, Eric Vasiliauskas","doi":"10.1093/crocol/otae077","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Generative pre-trained transformer-4 (GPT-4) is a large language model (LLM) trained on a vast corpus of data, including the medical literature. Nutrition plays an important role in managing inflammatory bowel disease (IBD), with an unmet need for nutrition-related patient education resources. This study examines the accuracy, comprehensiveness, and reproducibility of responses by GPT-4 to patient nutrition questions related to IBD.</p><p><strong>Methods: </strong>Questions were obtained from adult IBD clinic visits, Facebook, and Reddit. Two IBD-focused registered dieticians independently graded the accuracy and reproducibility of GPT-4's responses while a third senior IBD-focused registered dietitian arbitrated. Each question was inputted twice into the model.</p><p><strong>Results: </strong>88 questions were selected. The model correctly responded to 73/88 questions (83.0%), with 61 (69.0%) graded as comprehensive. 15/88 (17%) responses were graded as mixed with correct and incorrect/outdated data. The model comprehensively responded to 10 (62.5%) questions related to \"Nutrition and diet needs for surgery,\" 12 (92.3%) \"Tube feeding and parenteral nutrition,\" 11 (64.7%) \"General diet questions,\" 10 (50%) \"Diet for reducing symptoms/inflammation,\" and 18 (81.8%) \"Micronutrients/supplementation needs.\" The model provided reproducible responses to 81/88 (92.0%) questions.</p><p><strong>Conclusions: </strong>GPT-4 comprehensively answered most questions, demonstrating the promising potential of LLMs as supplementary tools for IBD patients seeking nutrition-related information. However, 17% of responses contained incorrect information, highlighting the need for continuous refinement prior to incorporation into clinical practice. Future studies should emphasize leveraging LLMs to enhance patient outcomes and promoting patient and healthcare professional proficiency in using LLMs to maximize their efficacy.</p>","PeriodicalId":10847,"journal":{"name":"Crohn's & Colitis 360","volume":"7 1","pages":"otae077"},"PeriodicalIF":1.8000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11897593/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crohn's & Colitis 360","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/crocol/otae077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Generative pre-trained transformer-4 (GPT-4) is a large language model (LLM) trained on a vast corpus of data, including the medical literature. Nutrition plays an important role in managing inflammatory bowel disease (IBD), with an unmet need for nutrition-related patient education resources. This study examines the accuracy, comprehensiveness, and reproducibility of responses by GPT-4 to patient nutrition questions related to IBD.

Methods: Questions were obtained from adult IBD clinic visits, Facebook, and Reddit. Two IBD-focused registered dieticians independently graded the accuracy and reproducibility of GPT-4's responses while a third senior IBD-focused registered dietitian arbitrated. Each question was inputted twice into the model.

Results: 88 questions were selected. The model correctly responded to 73/88 questions (83.0%), with 61 (69.0%) graded as comprehensive. 15/88 (17%) responses were graded as mixed with correct and incorrect/outdated data. The model comprehensively responded to 10 (62.5%) questions related to "Nutrition and diet needs for surgery," 12 (92.3%) "Tube feeding and parenteral nutrition," 11 (64.7%) "General diet questions," 10 (50%) "Diet for reducing symptoms/inflammation," and 18 (81.8%) "Micronutrients/supplementation needs." The model provided reproducible responses to 81/88 (92.0%) questions.

Conclusions: GPT-4 comprehensively answered most questions, demonstrating the promising potential of LLMs as supplementary tools for IBD patients seeking nutrition-related information. However, 17% of responses contained incorrect information, highlighting the need for continuous refinement prior to incorporation into clinical practice. Future studies should emphasize leveraging LLMs to enhance patient outcomes and promoting patient and healthcare professional proficiency in using LLMs to maximize their efficacy.

通过生成预训练的Transformer-4检测炎症性肠病相关营养问题反应的准确性和可重复性。
背景:生成预训练转换器-4 (GPT-4)是一种大型语言模型(LLM),它是在包括医学文献在内的大量数据语料库上训练的。营养在治疗炎症性肠病(IBD)中起着重要作用,对营养相关患者教育资源的需求尚未得到满足。本研究考察了GPT-4对与IBD相关的患者营养问题反应的准确性、全面性和可重复性。方法:从成人IBD门诊就诊、Facebook和Reddit上获得问题。两名专注于ibd的注册营养师独立对GPT-4反应的准确性和可重复性进行评分,而第三名专注于ibd的高级注册营养师进行仲裁。每个问题在模型中输入两次。结果:共选取问题88个。该模型正确回答了73/88个问题(83.0%),其中61个问题(69.0%)被评为全面。15/88(17%)的回答分为混合正确和不正确/过时的数据。该模型综合回答了10个(62.5%)与“手术的营养和饮食需求”相关的问题,12个(92.3%)管饲和肠外营养11例(64.7%)“一般饮食问题”,10 (50%)“减轻症状/炎症的饮食”和18 (81.8%)“微量元素/补充需求。”该模型对81/88(92.0%)个问题提供了可重复的回答。结论:GPT-4全面回答了大多数问题,显示了LLMs作为IBD患者寻求营养相关信息的补充工具的潜力。然而,17%的答复包含不正确的信息,突出了在纳入临床实践之前需要不断改进。未来的研究应强调利用法学硕士来提高患者的治疗效果,并促进患者和医疗保健专业人员熟练使用法学硕士来最大化其疗效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Crohn's & Colitis 360
Crohn's & Colitis 360 Medicine-Gastroenterology
CiteScore
2.50
自引率
0.00%
发文量
41
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信