{"title":"一种新的多模态自监督心电图心律失常分类框架。","authors":"Jianqiang Hu, Cheng Li, Jinde Cao, Bo Kou","doi":"10.1016/j.compbiomed.2025.111137","DOIUrl":null,"url":null,"abstract":"<p><p>The electrocardiogram (ECG) has emerged as a primary tool in clinical practice for identifying cardiovascular diseases, owing to its low cost, simplicity, and non-invasiveness. Given the high cost associated with acquiring a substantial amount of ECG signals that require annotation by medical professionals, advanced self-supervised learning (SSL) techniques can effectively leverage abundant unlabeled data for learning, mitigating the performance impact of insufficient ECG classification labels. Contrastive learning has been successful as a self-supervised pre-training approach in image and time series domains. Inspired by this success, a novel pre-training technique, i.e., a simple multimodal self-supervised framework for ECG arrhythmia classification, is proposed in this paper by utilizing multi-modal data from ECG signals to enhance model initialization. Compared to other modalities, the expectation is that representations based on time and frequency for the same example should be brought as close together as possible. The pre-training is achieved through self-supervision by constructing time-domain contrastive learning loss and time-frequency loss, effectively learning features of ECG signals. The proposed method evaluates datasets containing both multi-lead and single-lead ECG data. Experimental results demonstrate that, by applying the pre-training method followed by fine-tuning for downstream tasks, the proposed algorithm outperforms standard contrastive learning paradigms on ACC and AUC, respectively, and even outperforms supervised learning.</p>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"198 Pt A","pages":"111137"},"PeriodicalIF":6.3000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel multimodal self-supervised framework for ECG arrhythmia classification.\",\"authors\":\"Jianqiang Hu, Cheng Li, Jinde Cao, Bo Kou\",\"doi\":\"10.1016/j.compbiomed.2025.111137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The electrocardiogram (ECG) has emerged as a primary tool in clinical practice for identifying cardiovascular diseases, owing to its low cost, simplicity, and non-invasiveness. Given the high cost associated with acquiring a substantial amount of ECG signals that require annotation by medical professionals, advanced self-supervised learning (SSL) techniques can effectively leverage abundant unlabeled data for learning, mitigating the performance impact of insufficient ECG classification labels. Contrastive learning has been successful as a self-supervised pre-training approach in image and time series domains. Inspired by this success, a novel pre-training technique, i.e., a simple multimodal self-supervised framework for ECG arrhythmia classification, is proposed in this paper by utilizing multi-modal data from ECG signals to enhance model initialization. Compared to other modalities, the expectation is that representations based on time and frequency for the same example should be brought as close together as possible. The pre-training is achieved through self-supervision by constructing time-domain contrastive learning loss and time-frequency loss, effectively learning features of ECG signals. The proposed method evaluates datasets containing both multi-lead and single-lead ECG data. Experimental results demonstrate that, by applying the pre-training method followed by fine-tuning for downstream tasks, the proposed algorithm outperforms standard contrastive learning paradigms on ACC and AUC, respectively, and even outperforms supervised learning.</p>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"198 Pt A\",\"pages\":\"111137\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.compbiomed.2025.111137\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.compbiomed.2025.111137","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
A novel multimodal self-supervised framework for ECG arrhythmia classification.
The electrocardiogram (ECG) has emerged as a primary tool in clinical practice for identifying cardiovascular diseases, owing to its low cost, simplicity, and non-invasiveness. Given the high cost associated with acquiring a substantial amount of ECG signals that require annotation by medical professionals, advanced self-supervised learning (SSL) techniques can effectively leverage abundant unlabeled data for learning, mitigating the performance impact of insufficient ECG classification labels. Contrastive learning has been successful as a self-supervised pre-training approach in image and time series domains. Inspired by this success, a novel pre-training technique, i.e., a simple multimodal self-supervised framework for ECG arrhythmia classification, is proposed in this paper by utilizing multi-modal data from ECG signals to enhance model initialization. Compared to other modalities, the expectation is that representations based on time and frequency for the same example should be brought as close together as possible. The pre-training is achieved through self-supervision by constructing time-domain contrastive learning loss and time-frequency loss, effectively learning features of ECG signals. The proposed method evaluates datasets containing both multi-lead and single-lead ECG data. Experimental results demonstrate that, by applying the pre-training method followed by fine-tuning for downstream tasks, the proposed algorithm outperforms standard contrastive learning paradigms on ACC and AUC, respectively, and even outperforms supervised learning.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.