Evaluation of the Reliability of ChatGPT to Provide Guidance on Recombinant Zoster Vaccination for Patients With Rheumatic and Musculoskeletal Diseases.
Akhil Sood, Amanda Moyer, Pegah Jahangiri, Diane Mar, Prachaya Nitichaikulvatana, Nitya Ramreddy, Liya Stolyar, Janice Lin
{"title":"Evaluation of the Reliability of ChatGPT to Provide Guidance on Recombinant Zoster Vaccination for Patients With Rheumatic and Musculoskeletal Diseases.","authors":"Akhil Sood, Amanda Moyer, Pegah Jahangiri, Diane Mar, Prachaya Nitichaikulvatana, Nitya Ramreddy, Liya Stolyar, Janice Lin","doi":"10.1097/RHU.0000000000002198","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Large language models (LLMs) such as ChatGPT can potentially transform the delivery of health information. This study aims to evaluate the accuracy and completeness of ChatGPT in responding to questions on recombinant zoster vaccination (RZV) in patients with rheumatic and musculoskeletal diseases.</p><p><strong>Methods: </strong>A cross-sectional study was conducted using 20 prompts based on information from the Centers for Disease Control and Prevention (CDC), the Advisory Committee on Immunization Practices (ACIP), and the American College of Rheumatology (ACR). These prompts were inputted into ChatGPT 3.5. Five rheumatologists independently scored the ChatGPT responses for accuracy (Likert 1 to 5) and completeness (Likert 1 to 3) compared with validated information sources (CDC, ACIP, and ACR).</p><p><strong>Results: </strong>The overall mean accuracy of ChatGPT responses on a 5-point scale was 4.04, with 80% of responses scoring ≥4. The mean completeness score of ChatGPT response on a 3-point scale was 2.3, with 95% of responses scoring ≥2. Among the 5 raters, ChatGPT unanimously scored with high accuracy and completeness to various patient and physician questions surrounding RZV. There was one instance where it scored with low accuracy and completeness. Although not significantly different, ChatGPT demonstrated the highest accuracy and completeness in answering questions related to ACIP guidelines compared with other information sources.</p><p><strong>Conclusions: </strong>ChatGPT exhibits promising ability to address specific queries regarding RZV for rheumatic and musculoskeletal disease patients. However, it is essential to approach ChatGPT with caution due to risk of misinformation. This study emphasizes the importance of rigorously validating LLMs as a health information source.</p>","PeriodicalId":14745,"journal":{"name":"JCR: Journal of Clinical Rheumatology","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCR: Journal of Clinical Rheumatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/RHU.0000000000002198","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Large language models (LLMs) such as ChatGPT can potentially transform the delivery of health information. This study aims to evaluate the accuracy and completeness of ChatGPT in responding to questions on recombinant zoster vaccination (RZV) in patients with rheumatic and musculoskeletal diseases.
Methods: A cross-sectional study was conducted using 20 prompts based on information from the Centers for Disease Control and Prevention (CDC), the Advisory Committee on Immunization Practices (ACIP), and the American College of Rheumatology (ACR). These prompts were inputted into ChatGPT 3.5. Five rheumatologists independently scored the ChatGPT responses for accuracy (Likert 1 to 5) and completeness (Likert 1 to 3) compared with validated information sources (CDC, ACIP, and ACR).
Results: The overall mean accuracy of ChatGPT responses on a 5-point scale was 4.04, with 80% of responses scoring ≥4. The mean completeness score of ChatGPT response on a 3-point scale was 2.3, with 95% of responses scoring ≥2. Among the 5 raters, ChatGPT unanimously scored with high accuracy and completeness to various patient and physician questions surrounding RZV. There was one instance where it scored with low accuracy and completeness. Although not significantly different, ChatGPT demonstrated the highest accuracy and completeness in answering questions related to ACIP guidelines compared with other information sources.
Conclusions: ChatGPT exhibits promising ability to address specific queries regarding RZV for rheumatic and musculoskeletal disease patients. However, it is essential to approach ChatGPT with caution due to risk of misinformation. This study emphasizes the importance of rigorously validating LLMs as a health information source.
期刊介绍:
JCR: Journal of Clinical Rheumatology the peer-reviewed, bimonthly journal that rheumatologists asked for. Each issue contains practical information on patient care in a clinically oriented, easy-to-read format. Our commitment is to timely, relevant coverage of the topics and issues shaping current practice. We pack each issue with original articles, case reports, reviews, brief reports, expert commentary, letters to the editor, and more. This is where you''ll find the answers to tough patient management issues as well as the latest information about technological advances affecting your practice.