Evaluation of the Reliability of ChatGPT to Provide Guidance on Recombinant Zoster Vaccination for Patients With Rheumatic and Musculoskeletal Diseases.

IF 1.8 4区医学 Q2 RHEUMATOLOGY

JCR: Journal of Clinical Rheumatology Pub Date : 2025-06-01 Epub Date: 2025-01-15 DOI:10.1097/RHU.0000000000002198

Akhil Sood, Amanda Moyer, Pegah Jahangiri, Diane Mar, Prachaya Nitichaikulvatana, Nitya Ramreddy, Liya Stolyar, Janice Lin

{"title":"Evaluation of the Reliability of ChatGPT to Provide Guidance on Recombinant Zoster Vaccination for Patients With Rheumatic and Musculoskeletal Diseases.","authors":"Akhil Sood, Amanda Moyer, Pegah Jahangiri, Diane Mar, Prachaya Nitichaikulvatana, Nitya Ramreddy, Liya Stolyar, Janice Lin","doi":"10.1097/RHU.0000000000002198","DOIUrl":null,"url":null,"abstract":"Introduction: Large language models (LLMs) such as ChatGPT can potentially transform the delivery of health information. This study aims to evaluate the accuracy and completeness of ChatGPT in responding to questions on recombinant zoster vaccination (RZV) in patients with rheumatic and musculoskeletal diseases.Methods: A cross-sectional study was conducted using 20 prompts based on information from the Centers for Disease Control and Prevention (CDC), the Advisory Committee on Immunization Practices (ACIP), and the American College of Rheumatology (ACR). These prompts were inputted into ChatGPT 3.5. Five rheumatologists independently scored the ChatGPT responses for accuracy (Likert 1 to 5) and completeness (Likert 1 to 3) compared with validated information sources (CDC, ACIP, and ACR).Results: The overall mean accuracy of ChatGPT responses on a 5-point scale was 4.04, with 80% of responses scoring ≥4. The mean completeness score of ChatGPT response on a 3-point scale was 2.3, with 95% of responses scoring ≥2. Among the 5 raters, ChatGPT unanimously scored with high accuracy and completeness to various patient and physician questions surrounding RZV. There was one instance where it scored with low accuracy and completeness. Although not significantly different, ChatGPT demonstrated the highest accuracy and completeness in answering questions related to ACIP guidelines compared with other information sources.Conclusions: ChatGPT exhibits promising ability to address specific queries regarding RZV for rheumatic and musculoskeletal disease patients. However, it is essential to approach ChatGPT with caution due to risk of misinformation. This study emphasizes the importance of rigorously validating LLMs as a health information source.","PeriodicalId":14745,"journal":{"name":"JCR: Journal of Clinical Rheumatology","volume":" ","pages":"156-161"},"PeriodicalIF":1.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12251431/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCR: Journal of Clinical Rheumatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/RHU.0000000000002198","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/15 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Large language models (LLMs) such as ChatGPT can potentially transform the delivery of health information. This study aims to evaluate the accuracy and completeness of ChatGPT in responding to questions on recombinant zoster vaccination (RZV) in patients with rheumatic and musculoskeletal diseases.

Methods: A cross-sectional study was conducted using 20 prompts based on information from the Centers for Disease Control and Prevention (CDC), the Advisory Committee on Immunization Practices (ACIP), and the American College of Rheumatology (ACR). These prompts were inputted into ChatGPT 3.5. Five rheumatologists independently scored the ChatGPT responses for accuracy (Likert 1 to 5) and completeness (Likert 1 to 3) compared with validated information sources (CDC, ACIP, and ACR).

Results: The overall mean accuracy of ChatGPT responses on a 5-point scale was 4.04, with 80% of responses scoring ≥4. The mean completeness score of ChatGPT response on a 3-point scale was 2.3, with 95% of responses scoring ≥2. Among the 5 raters, ChatGPT unanimously scored with high accuracy and completeness to various patient and physician questions surrounding RZV. There was one instance where it scored with low accuracy and completeness. Although not significantly different, ChatGPT demonstrated the highest accuracy and completeness in answering questions related to ACIP guidelines compared with other information sources.

Conclusions: ChatGPT exhibits promising ability to address specific queries regarding RZV for rheumatic and musculoskeletal disease patients. However, it is essential to approach ChatGPT with caution due to risk of misinformation. This study emphasizes the importance of rigorously validating LLMs as a health information source.

查看原文本刊更多论文

ChatGPT为风湿病和肌肉骨骼疾病患者重组带状疱疹疫苗接种提供指导的可靠性评价

简介：像ChatGPT这样的大型语言模型（llm）可以潜在地改变健康信息的传递。本研究旨在评价ChatGPT在回答风湿病和肌肉骨骼疾病患者重组带状疱疹疫苗接种（RZV）问题时的准确性和完整性。方法：采用基于疾病控制和预防中心（CDC）、免疫实践咨询委员会（ACIP）和美国风湿病学会（ACR）信息的20个提示进行横断面研究。这些提示输入到ChatGPT 3.5中。五位风湿病学家独立地对ChatGPT的准确性（李克特1至5）和完整性（李克特1至3）进行评分，并与有效的信息源（CDC、ACIP和ACR）进行比较。结果：ChatGPT在5分制上的总体平均准确性为4.04,80%的回答得分≥4。ChatGPT反应在3分制上的平均完整性得分为2.3分，95%的反应得分≥2分。在5个评分者中，ChatGPT对RZV相关的各种患者和医生问题一致给出了较高的准确性和完整性。有一个例子，它得分的准确性和完整性都很低。与其他信息源相比，ChatGPT在回答与ACIP指南相关的问题时显示出最高的准确性和完整性，尽管没有显著差异。结论：ChatGPT具有解决风湿病和肌肉骨骼疾病患者RZV特异性问题的良好能力。但是，由于存在错误信息的风险，必须谨慎处理ChatGPT。本研究强调严格验证法学硕士作为健康信息源的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JCR: Journal of Clinical Rheumatology 医学-风湿病学

CiteScore

3.50

自引率

2.90%

发文量

228

审稿时长

4-8 weeks

期刊介绍： JCR: Journal of Clinical Rheumatology the peer-reviewed, bimonthly journal that rheumatologists asked for. Each issue contains practical information on patient care in a clinically oriented, easy-to-read format. Our commitment is to timely, relevant coverage of the topics and issues shaping current practice. We pack each issue with original articles, case reports, reviews, brief reports, expert commentary, letters to the editor, and more. This is where you''ll find the answers to tough patient management issues as well as the latest information about technological advances affecting your practice.