Kaitlin Shinn, Cory S Henderson, Aldo L Schenone, Sascha N Goonewardena, Supriya Shore, Venkatesh L Murthy, Chaitanya Madamanchi
{"title":"Can ChatGPT answer patients' questions about nuclear stress tests and <sup>18</sup>F-Flurodeoxyglucose PET for myocardial inflammation?","authors":"Kaitlin Shinn, Cory S Henderson, Aldo L Schenone, Sascha N Goonewardena, Supriya Shore, Venkatesh L Murthy, Chaitanya Madamanchi","doi":"10.1016/j.nuclcard.2025.102174","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Several modalities are used for stress testing and require specific patient preparation. <sup>18</sup>F-Flurodeoxyglucose positron emission tomography (FDG PET) is an important tool in the diagnosis and risk stratification of patients suspected of cardiac sarcoidosis and endocarditis[1-3]. There is a need for improved patient access to questions regarding cardiac testing to ensure proper adherence to instructions. We sought to evaluate the effectiveness of ChatGPT in answering questions about stress testing and cardiac FDG PET inflammation scans.</p><p><strong>Methods and results: </strong>We generated fifty-eight questions about stress testing and cardiac FDG PET inflammation scans. OpenAI ChatGPT 3.5 and 4o were used to answer the questions. The answers were graded by three nuclear cardiologists as: 1 = correct and complete, 2= somewhat correct/somewhat complete, 3 = incorrect: no benefit, or 4 = incorrect: harmful/misleading information (Table I). Of the 174 grades assigned to responses to the questions from ChatGPT 3.5, 62/174 (36%) were correct and complete, 93/174 (53%) were somewhat correct/somewhat complete, 12/174 (7%) were incorrect: no benefit, 7/174 (4%) were incorrect: harmful/misleading information. Of the grades assigned to responses to questions from ChatGPT-4o, 107/174 (61%) were correct and complete, 62/174 (36%) were somewhat correct/somewhat complete, 3/174 (2%) were incorrect: no benefit, 2/174 (1%) were incorrect: harmful/misleading information.</p><p><strong>Conclusions: </strong>ChatGPT can provide some accurate responses to patient questions regarding stress tests and cardiac FDG PET inflammation studies, and its accuracy has improved over time, however it is not suitable as a primary resource for clinical care at this stage of development.</p>","PeriodicalId":16476,"journal":{"name":"Journal of Nuclear Cardiology","volume":" ","pages":"102174"},"PeriodicalIF":3.0000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nuclear Cardiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.nuclcard.2025.102174","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Several modalities are used for stress testing and require specific patient preparation. 18F-Flurodeoxyglucose positron emission tomography (FDG PET) is an important tool in the diagnosis and risk stratification of patients suspected of cardiac sarcoidosis and endocarditis[1-3]. There is a need for improved patient access to questions regarding cardiac testing to ensure proper adherence to instructions. We sought to evaluate the effectiveness of ChatGPT in answering questions about stress testing and cardiac FDG PET inflammation scans.
Methods and results: We generated fifty-eight questions about stress testing and cardiac FDG PET inflammation scans. OpenAI ChatGPT 3.5 and 4o were used to answer the questions. The answers were graded by three nuclear cardiologists as: 1 = correct and complete, 2= somewhat correct/somewhat complete, 3 = incorrect: no benefit, or 4 = incorrect: harmful/misleading information (Table I). Of the 174 grades assigned to responses to the questions from ChatGPT 3.5, 62/174 (36%) were correct and complete, 93/174 (53%) were somewhat correct/somewhat complete, 12/174 (7%) were incorrect: no benefit, 7/174 (4%) were incorrect: harmful/misleading information. Of the grades assigned to responses to questions from ChatGPT-4o, 107/174 (61%) were correct and complete, 62/174 (36%) were somewhat correct/somewhat complete, 3/174 (2%) were incorrect: no benefit, 2/174 (1%) were incorrect: harmful/misleading information.
Conclusions: ChatGPT can provide some accurate responses to patient questions regarding stress tests and cardiac FDG PET inflammation studies, and its accuracy has improved over time, however it is not suitable as a primary resource for clinical care at this stage of development.
期刊介绍:
Journal of Nuclear Cardiology is the only journal in the world devoted to this dynamic and growing subspecialty. Physicians and technologists value the Journal not only for its peer-reviewed articles, but also for its timely discussions about the current and future role of nuclear cardiology. Original articles address all aspects of nuclear cardiology, including interpretation, diagnosis, imaging equipment, and use of radiopharmaceuticals. As the official publication of the American Society of Nuclear Cardiology, the Journal also brings readers the latest information emerging from the Society''s task forces and publishes guidelines and position papers as they are adopted.