Large language models to develop evidence-based strategies for primary and secondary cardiovascular prevention.

IF 4.4 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS
European heart journal. Digital health Pub Date : 2025-08-14 eCollection Date: 2025-09-01 DOI:10.1093/ehjdh/ztaf085
Giulia Lorenzoni, Camilla Zanotto, Anna Sordo, Alberto Cipriani, Martina Perazzolo Marra, Francesco Tona, Daniele Gasparini, Dario Gregori
{"title":"Large language models to develop evidence-based strategies for primary and secondary cardiovascular prevention.","authors":"Giulia Lorenzoni, Camilla Zanotto, Anna Sordo, Alberto Cipriani, Martina Perazzolo Marra, Francesco Tona, Daniele Gasparini, Dario Gregori","doi":"10.1093/ehjdh/ztaf085","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Cardiovascular diseases are the leading global cause of mortality, with ischaemic heart disease contributing significantly to the burden. Primary and secondary prevention strategies are essential to reducing the incidence and recurrence of acute myocardial infarction. Healthcare professionals are no longer the sole source of health education; the Internet, including tools powered by artificial intelligence, is also widely utilized. This study evaluates the accuracy and the readability of large language model (LLM)-generated information on cardiovascular primary and secondary prevention.</p><p><strong>Methods and results: </strong>An observational study assessed LLM's responses to two tailored questions about acute myocardial infarction risk prevention. The LLM used was ChatGPT (4o version). Expert cardiologists evaluated the accuracy of each response using a Likert scale, while readability was assessed with the Flesch Reading Ease Score (FRES). ChatGPT-4o provided comprehensive and accurate responses for 15 out of 20 (75%) of the items. Readability scores were low, with median FRES indicating that both primary and secondary prevention content were difficult to understand. Specialized clinical topics exhibited lower accuracy and readability compared to the other topics.</p><p><strong>Conclusion: </strong>The current study demonstrated that ChatGPT-4o provided accurate information on primary and secondary prevention, although its readability was assessed as difficult. However, clinical oversight still remains critical to bridge gaps in accuracy and readability and ensure optimal patient outcomes.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 5","pages":"1069-1075"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12450516/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European heart journal. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/ehjdh/ztaf085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: Cardiovascular diseases are the leading global cause of mortality, with ischaemic heart disease contributing significantly to the burden. Primary and secondary prevention strategies are essential to reducing the incidence and recurrence of acute myocardial infarction. Healthcare professionals are no longer the sole source of health education; the Internet, including tools powered by artificial intelligence, is also widely utilized. This study evaluates the accuracy and the readability of large language model (LLM)-generated information on cardiovascular primary and secondary prevention.

Methods and results: An observational study assessed LLM's responses to two tailored questions about acute myocardial infarction risk prevention. The LLM used was ChatGPT (4o version). Expert cardiologists evaluated the accuracy of each response using a Likert scale, while readability was assessed with the Flesch Reading Ease Score (FRES). ChatGPT-4o provided comprehensive and accurate responses for 15 out of 20 (75%) of the items. Readability scores were low, with median FRES indicating that both primary and secondary prevention content were difficult to understand. Specialized clinical topics exhibited lower accuracy and readability compared to the other topics.

Conclusion: The current study demonstrated that ChatGPT-4o provided accurate information on primary and secondary prevention, although its readability was assessed as difficult. However, clinical oversight still remains critical to bridge gaps in accuracy and readability and ensure optimal patient outcomes.

Abstract Image

Abstract Image

Abstract Image

开发基于证据的心血管一级和二级预防策略的大型语言模型。
目的:心血管疾病是全球主要的死亡原因,缺血性心脏病是造成这一负担的主要原因。一级和二级预防策略对于降低急性心肌梗死的发生率和复发率至关重要。医疗保健专业人员不再是健康教育的唯一来源;互联网,包括人工智能驱动的工具,也被广泛使用。本研究评估了大语言模型(LLM)生成的心血管一级和二级预防信息的准确性和可读性。方法和结果:一项观察性研究评估了LLM对关于急性心肌梗死风险预防的两个定制问题的反应。使用的LLM是ChatGPT(40版本)。心脏病专家使用李克特量表评估每个反应的准确性,而可读性则使用Flesch Reading Ease Score (FRES)进行评估。chatgpt - 40对20个项目中的15个(75%)提供了全面而准确的回答。可读性评分较低,FRES中位数表明初级和二级预防内容都难以理解。与其他主题相比,专业临床主题的准确性和可读性较低。结论:目前的研究表明,chatgpt - 40提供了一级和二级预防的准确信息,尽管其可读性被评估为困难。然而,临床监督仍然是弥合准确性和可读性差距并确保最佳患者结果的关键。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信