Sanketa Hegde, Merten Prüser, Nikola Cenic, Anatol Bollinger, Marie Arens, Jan Köhlen, Eimo Martens, Christoph Dieterich
{"title":"ECG Synthesis and Utility Analysis - A Diffusion Model Based Approach.","authors":"Sanketa Hegde, Merten Prüser, Nikola Cenic, Anatol Bollinger, Marie Arens, Jan Köhlen, Eimo Martens, Christoph Dieterich","doi":"10.3233/SHTI251414","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>With the growing demand for privacy-preserving healthcare solutions, the generation of synthetic electrocardiograms (ECGs) offers a valuable alternative to using real patient data.</p><p><strong>Methods: </strong>In this study, we present the adaptation of the SSSD-ECG diffusion model to generate high-quality synthetic 12-lead ECGs for Sinus Rhythm/Normal and Atrial Fibrillation (AF) conditions using 10-second recordings from the 12-lead MIMIC-IV ECG dataset.</p><p><strong>Results: </strong>We validate the utility of the generated ECGs through downstream classification tasks, with models trained on synthetic ECG features achieving an F1-score of 0.80 when tested on real data, and 0.91 when trained on real data and tested on synthetic data. Additionally, blind tests conducted by physicians at two university hospital sites demonstrated that the synthetic signals effectively mimic real ECGs in both morphology and key features.</p><p><strong>Conclusion: </strong>This work establishes diffusion-based models as an effective tool for generating realistic synthetic ECGs, providing valuable resources for model development, supporting testing of clinical decision- making solutions, and enabling research in contexts where real data is scarce or not shareable.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"331 ","pages":"346-356"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: With the growing demand for privacy-preserving healthcare solutions, the generation of synthetic electrocardiograms (ECGs) offers a valuable alternative to using real patient data.
Methods: In this study, we present the adaptation of the SSSD-ECG diffusion model to generate high-quality synthetic 12-lead ECGs for Sinus Rhythm/Normal and Atrial Fibrillation (AF) conditions using 10-second recordings from the 12-lead MIMIC-IV ECG dataset.
Results: We validate the utility of the generated ECGs through downstream classification tasks, with models trained on synthetic ECG features achieving an F1-score of 0.80 when tested on real data, and 0.91 when trained on real data and tested on synthetic data. Additionally, blind tests conducted by physicians at two university hospital sites demonstrated that the synthetic signals effectively mimic real ECGs in both morphology and key features.
Conclusion: This work establishes diffusion-based models as an effective tool for generating realistic synthetic ECGs, providing valuable resources for model development, supporting testing of clinical decision- making solutions, and enabling research in contexts where real data is scarce or not shareable.