Yuval Avidan , Ibrahim Naoum , Razi Khoury , Sameha Zahra , Nissan Ben Dov , Jorge E Schliamser , Asaf Danon , Amir Aker
{"title":"Can ChatGPT accurately detect atrial fibrillation using smartwatch ECG?","authors":"Yuval Avidan , Ibrahim Naoum , Razi Khoury , Sameha Zahra , Nissan Ben Dov , Jorge E Schliamser , Asaf Danon , Amir Aker","doi":"10.1016/j.hrtlng.2025.04.032","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Current guidelines require physician confirmation for smartwatch-diagnosed atrial fibrillation (AF), increasing telemedicine workloads. The newest ChatGPT-4o (GPT-4o) incorporates advanced image input capabilities.</div></div><div><h3>Objective</h3><div>To assess GPT-4o’s performance in identifying AF from smartwatch recordings.</div></div><div><h3>Methods</h3><div>Consecutive 120 patients with AF and 60 controls with sinus rhythm (SR), confirmed by conventional 12-lead ECG, recorded single-lead ECGs using an Apple Watch (AW) Series 6®. Two blinded cardiologists independently classified the smartwatch recordings as AF, SR, or undetermined. GPT-4o was subsequently prompted to analyze all smartwatch ECGs.</div></div><div><h3>Results</h3><div>Six AF cases were excluded due to undetermined AW-ECG recordings, leaving 114 AF patients (mean age: 73.4 ± 10.4 years) and 60 controls. The AW algorithm achieved 97.3 % and 100 % accuracy for AF and SR, respectively, while GPT-4o correctly analyzed 47.3 % of AF and 71.6 % of SR tracings. None of the AF characteristics—chronicity, heart rate, QRS width, fibrillatory wave amplitude, or R-wave amplitude and polarity—were predictive of GPT-4o's diagnostic accuracy.</div></div><div><h3>Conclusion</h3><div>The current capabilities of GPT-4o are insufficient to make a reliable diagnosis of AF from smartwatch ECGs. Despite the theoretical appeal of leveraging this innovative technology for such purpose, the findings highlight that human expertise remains indispensable. Consumers must remain aware of the current limitations of this technology.</div></div>","PeriodicalId":55064,"journal":{"name":"Heart & Lung","volume":"73 ","pages":"Pages 90-94"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heart & Lung","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0147956325001062","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Current guidelines require physician confirmation for smartwatch-diagnosed atrial fibrillation (AF), increasing telemedicine workloads. The newest ChatGPT-4o (GPT-4o) incorporates advanced image input capabilities.
Objective
To assess GPT-4o’s performance in identifying AF from smartwatch recordings.
Methods
Consecutive 120 patients with AF and 60 controls with sinus rhythm (SR), confirmed by conventional 12-lead ECG, recorded single-lead ECGs using an Apple Watch (AW) Series 6®. Two blinded cardiologists independently classified the smartwatch recordings as AF, SR, or undetermined. GPT-4o was subsequently prompted to analyze all smartwatch ECGs.
Results
Six AF cases were excluded due to undetermined AW-ECG recordings, leaving 114 AF patients (mean age: 73.4 ± 10.4 years) and 60 controls. The AW algorithm achieved 97.3 % and 100 % accuracy for AF and SR, respectively, while GPT-4o correctly analyzed 47.3 % of AF and 71.6 % of SR tracings. None of the AF characteristics—chronicity, heart rate, QRS width, fibrillatory wave amplitude, or R-wave amplitude and polarity—were predictive of GPT-4o's diagnostic accuracy.
Conclusion
The current capabilities of GPT-4o are insufficient to make a reliable diagnosis of AF from smartwatch ECGs. Despite the theoretical appeal of leveraging this innovative technology for such purpose, the findings highlight that human expertise remains indispensable. Consumers must remain aware of the current limitations of this technology.
期刊介绍:
Heart & Lung: The Journal of Cardiopulmonary and Acute Care, the official publication of The American Association of Heart Failure Nurses, presents original, peer-reviewed articles on techniques, advances, investigations, and observations related to the care of patients with acute and critical illness and patients with chronic cardiac or pulmonary disorders.
The Journal''s acute care articles focus on the care of hospitalized patients, including those in the critical and acute care settings. Because most patients who are hospitalized in acute and critical care settings have chronic conditions, we are also interested in the chronically critically ill, the care of patients with chronic cardiopulmonary disorders, their rehabilitation, and disease prevention. The Journal''s heart failure articles focus on all aspects of the care of patients with this condition. Manuscripts that are relevant to populations across the human lifespan are welcome.