Wenhan Liu , Shurong Pan , Sheng Chang , Qijun Huang , Nan Jiang
{"title":"Self-supervised learning for Electrocardiogram classification using Lead Correlation and Decorrelation","authors":"Wenhan Liu , Shurong Pan , Sheng Chang , Qijun Huang , Nan Jiang","doi":"10.1016/j.asoc.2025.112871","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the development of deep learning has shown potential in the automatic analysis of electrocardiogram (ECG), aiding cardiologists in detecting cardiovascular diseases (CVDs). Generally, deep learning models depend on numerous labeled ECGs to train, but manual labeling of ECGs is costly as it requires considerable time and expertise. Self-supervised learning (SSL) can solve this problem by pretraining deep learning models with unlabeled ECGs, mitigating their reliance on labeled ECGs. This work proposes lead correlation and decorrelation (LCD) for effective and efficient SSL of ECGs. Concretely, LCD combines intra-lead correlation, inter-lead correlation, intra-lead and inter-lead decorrelation in pretraining. These mechanisms utilize multilead ECG characteristics: intra-lead invariance, inter-lead invariance, inter-lead variance, and intra-lead redundancy. After pretraining, LCD can provide a generic encoder for feature extraction of any ECG lead in a classification task. Benefitting from the effective pretraining mechanism, models with the encoders pretrained by LCD outperform most of the baselines. Compared with the best baseline, they achieve better/comparable classification performances in the same tasks with less pretraining time. Furthermore, LCD helps the models focus on critical features when training with insufficient labeled ECGs, reducing the reliance on labeled ECGs by 4<span><math><mo>∼</mo></math></span>6<span><math><mo>×</mo></math></span>. All the results demonstrate that LCD is an effective and efficient method, boosting a broader application of deep learning to automatic ECG analysis. The code is available at <span><span>https://github.com/Aiwiscal/ECG_SSL_LCD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"172 ","pages":"Article 112871"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625001826","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the development of deep learning has shown potential in the automatic analysis of electrocardiogram (ECG), aiding cardiologists in detecting cardiovascular diseases (CVDs). Generally, deep learning models depend on numerous labeled ECGs to train, but manual labeling of ECGs is costly as it requires considerable time and expertise. Self-supervised learning (SSL) can solve this problem by pretraining deep learning models with unlabeled ECGs, mitigating their reliance on labeled ECGs. This work proposes lead correlation and decorrelation (LCD) for effective and efficient SSL of ECGs. Concretely, LCD combines intra-lead correlation, inter-lead correlation, intra-lead and inter-lead decorrelation in pretraining. These mechanisms utilize multilead ECG characteristics: intra-lead invariance, inter-lead invariance, inter-lead variance, and intra-lead redundancy. After pretraining, LCD can provide a generic encoder for feature extraction of any ECG lead in a classification task. Benefitting from the effective pretraining mechanism, models with the encoders pretrained by LCD outperform most of the baselines. Compared with the best baseline, they achieve better/comparable classification performances in the same tasks with less pretraining time. Furthermore, LCD helps the models focus on critical features when training with insufficient labeled ECGs, reducing the reliance on labeled ECGs by 46. All the results demonstrate that LCD is an effective and efficient method, boosting a broader application of deep learning to automatic ECG analysis. The code is available at https://github.com/Aiwiscal/ECG_SSL_LCD.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.