Preprocessing Selection for Deep Learning Classification of Arrhythmia Using ECG Time-Frequency Representations

Technologies Pub Date : 2023-05-11 DOI:10.3390/technologies11030068

Rafael Holanda, R. Monteiro, C. Bastos-Filho

{"title":"Preprocessing Selection for Deep Learning Classification of Arrhythmia Using ECG Time-Frequency Representations","authors":"Rafael Holanda, R. Monteiro, C. Bastos-Filho","doi":"10.3390/technologies11030068","DOIUrl":null,"url":null,"abstract":"The trend of using deep learning techniques to classify arbitrary tasks has grown significantly in the last decade. Such techniques in the background provide a stack of non-linear functions to solve tasks that cannot be solved in a linear manner. Naturally, deep learning models can always solve almost any problem with the right amount of functional parameters. However, with the right set of preprocessing techniques, these models might become much more accessible by negating the need for a large set of model parameters and the concomitant computational costs that accompany the need for many parameters. This paper studies the effects of such preprocessing techniques, and is focused, more specifically, on the resulting learning representations, so as to classify the arrhythmia task provided by the ECG MIT-BIH signal dataset. The types of noise we filter out from such signals are the Baseline Wander (BW) and the Powerline Interference (PLI). The learning representations we use as input to a Convolutional Neural Network (CNN) model are the spectrograms extracted by the Short-time Fourier Transform (STFT) and the scalograms extracted by the Continuous Wavelet Transform (CWT). These features are extracted using different parameter values, such as the window size of the Fourier Transform and the number of scales from the mother wavelet. We highlight that the noise with the most significant influence on a CNN’s classification performance is the BW noise. The most accurate classification performance was achieved using the 64 wavelet scales scalogram with the Mexican Hat and with only the BW noise suppressed. The deployed CNN has less than 90k parameters and achieved an average F1-Score of 90.11%.","PeriodicalId":22341,"journal":{"name":"Technologies","volume":"9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/technologies11030068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The trend of using deep learning techniques to classify arbitrary tasks has grown significantly in the last decade. Such techniques in the background provide a stack of non-linear functions to solve tasks that cannot be solved in a linear manner. Naturally, deep learning models can always solve almost any problem with the right amount of functional parameters. However, with the right set of preprocessing techniques, these models might become much more accessible by negating the need for a large set of model parameters and the concomitant computational costs that accompany the need for many parameters. This paper studies the effects of such preprocessing techniques, and is focused, more specifically, on the resulting learning representations, so as to classify the arrhythmia task provided by the ECG MIT-BIH signal dataset. The types of noise we filter out from such signals are the Baseline Wander (BW) and the Powerline Interference (PLI). The learning representations we use as input to a Convolutional Neural Network (CNN) model are the spectrograms extracted by the Short-time Fourier Transform (STFT) and the scalograms extracted by the Continuous Wavelet Transform (CWT). These features are extracted using different parameter values, such as the window size of the Fourier Transform and the number of scales from the mother wavelet. We highlight that the noise with the most significant influence on a CNN’s classification performance is the BW noise. The most accurate classification performance was achieved using the 64 wavelet scales scalogram with the Mexican Hat and with only the BW noise suppressed. The deployed CNN has less than 90k parameters and achieved an average F1-Score of 90.11%.

查看原文本刊更多论文

基于心电时频表征的心律失常深度学习分类预处理选择

在过去十年中，使用深度学习技术对任意任务进行分类的趋势显著增长。这种技术在后台提供了一堆非线性函数来解决不能用线性方式解决的任务。当然，深度学习模型总是可以用适当数量的功能参数来解决几乎任何问题。然而，使用正确的预处理技术集，这些模型可能会变得更容易访问，因为不需要大量的模型参数集，也不需要大量参数所伴随的计算成本。本文对这些预处理技术的效果进行了研究，更具体地说，是对得到的学习表征进行研究，从而对ECG MIT-BIH信号数据集提供的心律失常任务进行分类。我们从这些信号中过滤出的噪声类型是基线漂移(BW)和电力线干扰(PLI)。作为卷积神经网络(CNN)模型的输入，我们使用的学习表征是短时傅里叶变换(STFT)提取的频谱图和连续小波变换(CWT)提取的尺度图。使用不同的参数值提取这些特征，例如傅里叶变换的窗口大小和母小波的尺度数。我们强调，对CNN分类性能影响最大的噪声是BW噪声。使用带有墨西哥帽的64个小波尺度尺度图，仅抑制BW噪声，获得了最准确的分类性能。部署的CNN参数少于90k，平均F1-Score达到90.11%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Technologies

自引率

0.00%

发文量