Elizabeth Knight, Evangelos K Oikonomou, Arya Aminorroaya, Aline F Pedroso, Rohan Khera
{"title":"Wearable-Echo-FM: An ECG-echo foundation model for single lead electrocardiography.","authors":"Elizabeth Knight, Evangelos K Oikonomou, Arya Aminorroaya, Aline F Pedroso, Rohan Khera","doi":"10.1101/2025.06.10.25329163","DOIUrl":null,"url":null,"abstract":"<p><p>Artificial intelligence (AI) models can now detect patterns of structural heart diseases (SHDs) from electrocardiograms (ECGs), though scaling them requires the broader use of single-lead ECGs that are now ubiquitous in wearable and portable devices. However, model development for these devices is limited by a lack of diagnostic labels for SHDs for wearable ECGs. Here, we present Wearable-Echo-FM, a foundation model that encodes single-lead ECGs with information from echocardiographic text reports. Using 274,057 single-lead ECG-echo pairs from 77,378 adults (2015-2019), we contrastively pre-trained convolutional neural network (CNN) and RoBERTa encoders. The ECG encoder was fine-tuned on a distinct progressively larger ECG set (250 to 250,260 ECGs) to detect different cardiac disorders (i) left-ventricular systolic dysfunction (LVSD), (ii) diastolic dysfunction, and (iii) a composite SHD. This was compared with a randomly initialized CNN, with both approaches evaluated in an independent held-out test set. With the full training set, Wearable-Echo-FM matched the baseline CNN (AUROC 0.894 vs 0.884 for LVSD; 0.849 vs 0.843 diastolic dysfunction; 0.887 vs 0.869 composite). With only 0.5% (~1000 ECGs) of data, it markedly outperformed baseline (0.855 vs 0.548; 0.819 vs 0.582; 0.863 vs 0.496, respectively). Contrastive pre-training of single-lead ECGs on echocardiographic text reduces label requirements for SHD screening on wearable and portable devices.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204444/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.06.10.25329163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial intelligence (AI) models can now detect patterns of structural heart diseases (SHDs) from electrocardiograms (ECGs), though scaling them requires the broader use of single-lead ECGs that are now ubiquitous in wearable and portable devices. However, model development for these devices is limited by a lack of diagnostic labels for SHDs for wearable ECGs. Here, we present Wearable-Echo-FM, a foundation model that encodes single-lead ECGs with information from echocardiographic text reports. Using 274,057 single-lead ECG-echo pairs from 77,378 adults (2015-2019), we contrastively pre-trained convolutional neural network (CNN) and RoBERTa encoders. The ECG encoder was fine-tuned on a distinct progressively larger ECG set (250 to 250,260 ECGs) to detect different cardiac disorders (i) left-ventricular systolic dysfunction (LVSD), (ii) diastolic dysfunction, and (iii) a composite SHD. This was compared with a randomly initialized CNN, with both approaches evaluated in an independent held-out test set. With the full training set, Wearable-Echo-FM matched the baseline CNN (AUROC 0.894 vs 0.884 for LVSD; 0.849 vs 0.843 diastolic dysfunction; 0.887 vs 0.869 composite). With only 0.5% (~1000 ECGs) of data, it markedly outperformed baseline (0.855 vs 0.548; 0.819 vs 0.582; 0.863 vs 0.496, respectively). Contrastive pre-training of single-lead ECGs on echocardiographic text reduces label requirements for SHD screening on wearable and portable devices.
人工智能(AI)模型现在可以从心电图(ecg)中检测出结构性心脏病(SHDs)的模式,尽管扩展它们需要更广泛地使用单导联心电图,而单导联心电图现在在可穿戴和便携式设备中无处不在。然而,这些设备的模型开发受到缺乏用于可穿戴心电图的shd诊断标签的限制。在这里,我们提出了可穿戴回声调频,这是一种基础模型,可以用超声心动图文本报告的信息编码单导联心电图。使用77,378名成年人(2015-2019)的274,057对单导联心电图回波,我们对比预训练卷积神经网络(CNN)和RoBERTa编码器。心电图编码器在一个明显的逐渐增大的心电图集(250到250,260个心电图)上进行微调,以检测不同的心脏疾病(i)左心室收缩功能障碍(LVSD), (ii)舒张功能障碍,和(iii)复合SHD。这与随机初始化的CNN进行了比较,两种方法都在一个独立的测试集中进行了评估。在完整的训练集下,可穿戴回声fm与基线CNN匹配(AUROC为0.894 vs LVSD为0.884;0.849 vs 0.843舒张功能不全;0.887 vs 0.869综合指数)。只有0.5%(~1000个心电图)的数据,它明显优于基线(0.855 vs 0.548;0.819 vs 0.582;0.863 vs 0.496)。超声心动图文本上单导联心电图的对比预训练减少了可穿戴和便携式设备上SHD筛查的标签要求。