M. R. Bogdanov, G. R. Shakhmametova, I. S. Shaibakov, N. Oskin
{"title":"Opportunities to Reduce the Risk of Cardiovascular Death by Improving Machine Learning Methods","authors":"M. R. Bogdanov, G. R. Shakhmametova, I. S. Shaibakov, N. Oskin","doi":"10.17587/it.30.159-167","DOIUrl":null,"url":null,"abstract":"Improving algorithms for automatic recognition of electrocardiograms requires increasing of training dataset, which is not always possible due to the rarity of certain cardiac diseases or ethical issues. It is possible to improve the algorithms for generating synthetic electrocardiograms using mechanistic models and generative-descriptive neural networks (GANs). At the same time, when evaluating the effectiveness of the proposed solutions, various authors offer different quality assessment metrics from subjective expert assessment to the squared mean error. We compare two approaches to generating synthetic electrocardiograms: pseudo-ECG generation using a one-dimensional cardiomyocyte model and GAN based on long-term short-term memory in terms of machine learning metrics: accuracy, recall, f1-score. We solve the problem of binary classification with bagging method, class 0 — normal sinus rhythm, class 1 — atrial fibrillation. We found that classifier trained on synthetic ECGs generated using GAN is slightly more effective compared to pseudo-ECGs generated using mechanistic models ones. At the same time, it turned out that GANs are not stable enough. The generation of synthetic electrocardiograms using both mechanistic models and GAN can be used to enrich the training set in case of recognition of rare cardiac diseases.","PeriodicalId":504905,"journal":{"name":"Informacionnye Tehnologii","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informacionnye Tehnologii","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17587/it.30.159-167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Improving algorithms for automatic recognition of electrocardiograms requires increasing of training dataset, which is not always possible due to the rarity of certain cardiac diseases or ethical issues. It is possible to improve the algorithms for generating synthetic electrocardiograms using mechanistic models and generative-descriptive neural networks (GANs). At the same time, when evaluating the effectiveness of the proposed solutions, various authors offer different quality assessment metrics from subjective expert assessment to the squared mean error. We compare two approaches to generating synthetic electrocardiograms: pseudo-ECG generation using a one-dimensional cardiomyocyte model and GAN based on long-term short-term memory in terms of machine learning metrics: accuracy, recall, f1-score. We solve the problem of binary classification with bagging method, class 0 — normal sinus rhythm, class 1 — atrial fibrillation. We found that classifier trained on synthetic ECGs generated using GAN is slightly more effective compared to pseudo-ECGs generated using mechanistic models ones. At the same time, it turned out that GANs are not stable enough. The generation of synthetic electrocardiograms using both mechanistic models and GAN can be used to enrich the training set in case of recognition of rare cardiac diseases.
改进心电图自动识别算法需要增加训练数据集,但由于某些心脏疾病的罕见性或伦理问题,这并不总是可能的。利用机理模型和生成-描述神经网络(GAN)改进生成合成心电图的算法是可行的。同时,在评估所提出解决方案的有效性时,不同的作者提出了不同的质量评估指标,从专家主观评估到均方误差。我们比较了生成合成心电图的两种方法:使用一维心肌细胞模型生成伪心电图和基于长期短期记忆的 GAN,并从准确率、召回率和 f1 分数等机器学习指标方面进行了比较。我们用袋装法解决了二元分类问题,0 类--正常窦性心律,1 类--心房颤动。我们发现,与使用机理模型生成的伪心电图相比,在使用 GAN 生成的合成心电图上训练的分类器略微更有效。与此同时,我们发现 GAN 还不够稳定。在识别罕见心脏病时,使用机理模型和 GAN 生成合成心电图可用于丰富训练集。