{"title":"Augmentation-Free Contrastive Learning for EKG Classification.","authors":"Junheng Wang, Milos Hauskrecht","doi":"10.1007/978-3-031-95838-0_46","DOIUrl":null,"url":null,"abstract":"<p><p>Electrocardiogram (ECG/EKG) analysis is a vital diagnostic tool for assessing heart conditions, extensively used in clinical applications such as patient monitoring, surgical support, and heart disease research. With the rising demand for automated EKG interpretation, particularly for disease diagnosis and waveform labeling, machine learning models have become essential. However, the scarcity of large, well-labeled EKG datasets poses a significant challenge for training EKG classification models in the supervised form. This has shifted the attention towards unsupervised model pre-training, which often outperforms pure supervised methods when only a limited number of labeled data is available. This study explores the adaptation of the contrastive representation learning framework for EKG classification. Traditional contrastive learning methods rely on data augmentations to create diverse views of the same sample, but these augmentations are domain-specific, difficult to design, and can unpredictably impact model performance across different tasks. In this work, we address these limitations by proposing a novel, augmentation-free approach that integrates seamlessly with existing contrastive frameworks by eliminating their dependence on augmentations and hence their potential drawbacks. We evaluate our approach on the PTB-XL [1] dataset, and demonstrate its benefits in the unsupervised model pre-training step. Our solution offers a promising pathway for enhancing cardiac disease diagnostics in data-constrained environments.</p>","PeriodicalId":72303,"journal":{"name":"Artificial intelligence in medicine. Conference on Artificial Intelligence in Medicine (2005- )","volume":"15734 ","pages":"468-479"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12276877/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence in medicine. Conference on Artificial Intelligence in Medicine (2005- )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-95838-0_46","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/23 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Electrocardiogram (ECG/EKG) analysis is a vital diagnostic tool for assessing heart conditions, extensively used in clinical applications such as patient monitoring, surgical support, and heart disease research. With the rising demand for automated EKG interpretation, particularly for disease diagnosis and waveform labeling, machine learning models have become essential. However, the scarcity of large, well-labeled EKG datasets poses a significant challenge for training EKG classification models in the supervised form. This has shifted the attention towards unsupervised model pre-training, which often outperforms pure supervised methods when only a limited number of labeled data is available. This study explores the adaptation of the contrastive representation learning framework for EKG classification. Traditional contrastive learning methods rely on data augmentations to create diverse views of the same sample, but these augmentations are domain-specific, difficult to design, and can unpredictably impact model performance across different tasks. In this work, we address these limitations by proposing a novel, augmentation-free approach that integrates seamlessly with existing contrastive frameworks by eliminating their dependence on augmentations and hence their potential drawbacks. We evaluate our approach on the PTB-XL [1] dataset, and demonstrate its benefits in the unsupervised model pre-training step. Our solution offers a promising pathway for enhancing cardiac disease diagnostics in data-constrained environments.