Maximilian Kapsecker , Matthias C. Möller , Stephan M. Jonas
{"title":"Disentangled representational learning for anomaly detection in single-lead electrocardiogram signals using variational autoencoder","authors":"Maximilian Kapsecker , Matthias C. Möller , Stephan M. Jonas","doi":"10.1016/j.compbiomed.2024.109422","DOIUrl":null,"url":null,"abstract":"<div><div>Wearable technology enables the unsupervised recording of electrocardiogram (ECG) signals. Analyzing these high-dimensional ECG data poses challenges regarding statistical approaches and explainability. This work investigates the feasibility of medically explainable anomaly detection through disentangled representational learning of ECGs and personalization to mitigate inter-subject variations. Five open-source ECG datasets were converted into a set of denoised one-second traces of lead I signal, each covering individual features such as wave morphologies and pathologies. A beta total correlation variational autoencoder was optimized on four of these datasets for 68 systematic parameterization variants. The best-performing model revealed disentanglement in the 12-dimensional embedding space, specifically between atrial- and ventricular features. Within the embedding space, a k-nearest neighbor classifier was evaluated on a left-out test set tailored for anomaly detection. The result is a F1 score of 0.94 for the binary prediction of sinus rhythm and the pathological classes: Left bundle branch block, right bundle branch block, myocardial infarction, and AV block (1st degree). The 90.94% accuracy in anomaly detection falls within the range of established detectors (89.38%–99.77%) but offers the advantage of being explainable and largely unsupervised. Model fine-tuning for each of 100 randomly sampled individuals of the Icentia11k dataset mitigated inter-subject variations. The associated F1 score for predicting normal, premature atrial contraction, and premature ventricular contraction from the embedding space was 0.93. The distribution plots of pathologies along the explainable axis were reasonably consistent with medical expertise. The results suggest the presented disentangled variational autoencoder as a robust method for explainable ECG representation.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"184 ","pages":"Article 109422"},"PeriodicalIF":7.0000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482524015075","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Wearable technology enables the unsupervised recording of electrocardiogram (ECG) signals. Analyzing these high-dimensional ECG data poses challenges regarding statistical approaches and explainability. This work investigates the feasibility of medically explainable anomaly detection through disentangled representational learning of ECGs and personalization to mitigate inter-subject variations. Five open-source ECG datasets were converted into a set of denoised one-second traces of lead I signal, each covering individual features such as wave morphologies and pathologies. A beta total correlation variational autoencoder was optimized on four of these datasets for 68 systematic parameterization variants. The best-performing model revealed disentanglement in the 12-dimensional embedding space, specifically between atrial- and ventricular features. Within the embedding space, a k-nearest neighbor classifier was evaluated on a left-out test set tailored for anomaly detection. The result is a F1 score of 0.94 for the binary prediction of sinus rhythm and the pathological classes: Left bundle branch block, right bundle branch block, myocardial infarction, and AV block (1st degree). The 90.94% accuracy in anomaly detection falls within the range of established detectors (89.38%–99.77%) but offers the advantage of being explainable and largely unsupervised. Model fine-tuning for each of 100 randomly sampled individuals of the Icentia11k dataset mitigated inter-subject variations. The associated F1 score for predicting normal, premature atrial contraction, and premature ventricular contraction from the embedding space was 0.93. The distribution plots of pathologies along the explainable axis were reasonably consistent with medical expertise. The results suggest the presented disentangled variational autoencoder as a robust method for explainable ECG representation.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.