Classification of metabolic syndrome subjects and marathon runners with the k-means algorithm using heart rate variability features

Gilberto Perpiñan, E. Severeyn, M. Altuve, Sara Wong
{"title":"Classification of metabolic syndrome subjects and marathon runners with the k-means algorithm using heart rate variability features","authors":"Gilberto Perpiñan, E. Severeyn, M. Altuve, Sara Wong","doi":"10.1109/STSIVA.2016.7743352","DOIUrl":null,"url":null,"abstract":"In this paper, we have applied the k-means clustering algorithm to classify three study groups (people with metabolic syndrome, marathon runners, and sedentary subjects) that underwent a 5-sample 2-hour oral glucose tolerance test (OGTT). For this purpose, time-domain, frequency-domain and non-linear parameters of the heart rate variability (HRV), extracted from ECG recordings acquired at five different instants of the OGTT, were used as unidimensional observations to the k-means algorithm. Specifically, standard deviation of RR intervals (SDNN), root-mean-square differences of successive RR intervals (RMSSD), frequency power in the low frequency (LF) and high-frequency (HF) bands, LF/HF ratio, Poincaré descriptors SD1 and SD2, fractal scaling exponents α1 and α2, and approximate entropy were used as observations. Experiments were carried out with k = 2 and k = 3 clusters and using the squared Euclidean and Cityblock distances. Results showed that the Cityblock distance outperformed the squared Euclidean distance for this kind of observations. In addition, the parameter SDNN at the end of the OGTT gave the best classification performance (69.2%). Parameters SDNN, RMSSD, SD1 and SD2 at fast and at 30 min of the test differentiated subjects with metabolic syndrome with classification a performance greater than 60%.","PeriodicalId":373420,"journal":{"name":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STSIVA.2016.7743352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

In this paper, we have applied the k-means clustering algorithm to classify three study groups (people with metabolic syndrome, marathon runners, and sedentary subjects) that underwent a 5-sample 2-hour oral glucose tolerance test (OGTT). For this purpose, time-domain, frequency-domain and non-linear parameters of the heart rate variability (HRV), extracted from ECG recordings acquired at five different instants of the OGTT, were used as unidimensional observations to the k-means algorithm. Specifically, standard deviation of RR intervals (SDNN), root-mean-square differences of successive RR intervals (RMSSD), frequency power in the low frequency (LF) and high-frequency (HF) bands, LF/HF ratio, Poincaré descriptors SD1 and SD2, fractal scaling exponents α1 and α2, and approximate entropy were used as observations. Experiments were carried out with k = 2 and k = 3 clusters and using the squared Euclidean and Cityblock distances. Results showed that the Cityblock distance outperformed the squared Euclidean distance for this kind of observations. In addition, the parameter SDNN at the end of the OGTT gave the best classification performance (69.2%). Parameters SDNN, RMSSD, SD1 and SD2 at fast and at 30 min of the test differentiated subjects with metabolic syndrome with classification a performance greater than 60%.
基于心率变异性特征的k-means算法对代谢综合征受试者和马拉松运动员进行分类
在本文中,我们应用k-means聚类算法对三个研究组(代谢综合征患者、马拉松运动员和久坐受试者)进行了5个样本的2小时口服葡萄糖耐量试验(OGTT)。为此,从OGTT的五个不同时刻采集的心电记录中提取心率变异性(HRV)的时域、频域和非线性参数,作为k-means算法的一维观测值。其中,以RR区间的标准差(SDNN)、连续RR区间的均方根差(RMSSD)、低频(LF)和高频(HF)频段的频率功率、LF/HF比、poincar描述子SD1和SD2、分形标度指数α1和α2以及近似熵作为观测值。实验采用k = 2和k = 3的聚类,并使用欧几里得和Cityblock距离的平方进行。结果表明,Cityblock距离优于欧式距离的平方。此外,OGTT结束时参数SDNN的分类性能最好(69.2%)。SDNN、RMSSD、SD1和SD2参数在测试快速和30 min时对代谢综合征受试者进行了分类,分类成功率大于60%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信