An active learning enhanced data programming (ActDP) framework for ECG time series

Priyanka Gupta, Manik Gupta, Vijay Kumar
{"title":"An active learning enhanced data programming (ActDP) framework for ECG time series","authors":"Priyanka Gupta, Manik Gupta, Vijay Kumar","doi":"10.1088/2632-2153/ad5fda","DOIUrl":null,"url":null,"abstract":"\n Supervised machine learning requires the estimation of multiple parameters by using large amounts of labelled data. Getting labelled data generally requires a substantial allocation of resources in terms of both cost and time. In such scenarios, weak supervised learning techniques like data programming (DP) and active learning (AL) can be advantageous for time-series classification tasks. These paradigms can be used to assign data labels in an automated manner, and time-series classification can subsequently be carried out on the labelled data. This work proposes a novel framework titled active learning enhanced data programming (ActDP). It uses DP and AL for ECG classification using single-lead data. ECG classification is pivotal in cardiology and healthcare for diagnosing a broad spectrum of heart conditions and arrhythmias. To establish the usefulness of this proposed ActDP framework, the experiments have been conducted using the MIT-BIH dataset with 94,224 ECG beats. DP assigns a probabilistic label to each ECG beat using nine novel polar labelling functions and a generative model in this work. Further, AL improves the result of DP by replacing the labels for sampled ECG beats of a generative model with ground truth. Subsequently, a discriminative model is trained on these labels for each iteration. The experimental results show that by incorporating AL to DP in the ActDP framework, the accuracy of ECG classification strictly increases from 85.7 % to 97.34 % in 58 iterations. Comparatively, the proposed framework (ActDP) has demonstrated a higher classification accuracy of 97.34 % In contrast, DP with data augmentation (DA) achieves an accuracy of 92.2 %, while DP without DA results in an accuracy of 85.7 %, majority vote yields an accuracy of 50.2 %, and the generative model achieves an accuracy of only 66.5 %.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"132 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning: Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ad5fda","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Supervised machine learning requires the estimation of multiple parameters by using large amounts of labelled data. Getting labelled data generally requires a substantial allocation of resources in terms of both cost and time. In such scenarios, weak supervised learning techniques like data programming (DP) and active learning (AL) can be advantageous for time-series classification tasks. These paradigms can be used to assign data labels in an automated manner, and time-series classification can subsequently be carried out on the labelled data. This work proposes a novel framework titled active learning enhanced data programming (ActDP). It uses DP and AL for ECG classification using single-lead data. ECG classification is pivotal in cardiology and healthcare for diagnosing a broad spectrum of heart conditions and arrhythmias. To establish the usefulness of this proposed ActDP framework, the experiments have been conducted using the MIT-BIH dataset with 94,224 ECG beats. DP assigns a probabilistic label to each ECG beat using nine novel polar labelling functions and a generative model in this work. Further, AL improves the result of DP by replacing the labels for sampled ECG beats of a generative model with ground truth. Subsequently, a discriminative model is trained on these labels for each iteration. The experimental results show that by incorporating AL to DP in the ActDP framework, the accuracy of ECG classification strictly increases from 85.7 % to 97.34 % in 58 iterations. Comparatively, the proposed framework (ActDP) has demonstrated a higher classification accuracy of 97.34 % In contrast, DP with data augmentation (DA) achieves an accuracy of 92.2 %, while DP without DA results in an accuracy of 85.7 %, majority vote yields an accuracy of 50.2 %, and the generative model achieves an accuracy of only 66.5 %.
针对心电图时间序列的主动学习增强型数据编程(ActDP)框架
有监督的机器学习需要使用大量标记数据来估计多个参数。获取标记数据通常需要在成本和时间上分配大量资源。在这种情况下,数据编程(DP)和主动学习(AL)等弱监督学习技术在时间序列分类任务中具有优势。这些范式可用于自动分配数据标签,随后对标签数据进行时间序列分类。这项工作提出了一个名为主动学习增强数据编程(ActDP)的新框架。它使用 DP 和 AL 对单导联数据进行心电图分类。心电图分类在心脏病学和医疗保健中至关重要,可用于诊断各种心脏疾病和心律失常。为了证明所提出的 ActDP 框架的实用性,我们使用包含 94,224 个心电图节拍的麻省理工学院-BIH 数据集进行了实验。在这项工作中,DP 使用九个新颖的极性标签函数和一个生成模型为每个心电图搏动分配一个概率标签。此外,AL 通过用地面实况替换生成模型的心电图搏动采样标签,改进了 DP 的结果。随后,每次迭代都会根据这些标签训练一个判别模型。实验结果表明,在 ActDP 框架中将 AL 加入 DP 后,心电图分类的准确率在 58 次迭代中从 85.7% 严格提高到 97.34%。相比之下,所提出的框架(ActDP)的分类准确率更高,达到 97.34%。相比之下,带有数据增强(DA)的 DP 的准确率为 92.2%,而不带数据增强的 DP 的准确率为 85.7%,多数投票的准确率为 50.2%,生成模型的准确率仅为 66.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信