从连续活动和心率监测中产生的预测模型的数据质量下降:使用模拟的探索性分析。

Q2 Medicine

JMIR Cardio Pub Date : 2023-05-03 DOI:10.2196/40524

Jason Hearn, Jef Van den Eynde, Bhargava Chinni, Ari Cedars, Danielle Gottlieb Sen, Shelby Kutty, Cedric Manlhiot

{"title":"从连续活动和心率监测中产生的预测模型的数据质量下降:使用模拟的探索性分析。","authors":"Jason Hearn, Jef Van den Eynde, Bhargava Chinni, Ari Cedars, Danielle Gottlieb Sen, Shelby Kutty, Cedric Manlhiot","doi":"10.2196/40524","DOIUrl":null,"url":null,"abstract":"Background: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated.Objective: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings.Methods: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set.Results: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE.Conclusions: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.","PeriodicalId":14706,"journal":{"name":"JMIR Cardio","volume":"7 ","pages":"e40524"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193221/pdf/","citationCount":"0","resultStr":"{\"title\":\"Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation.\",\"authors\":\"Jason Hearn, Jef Van den Eynde, Bhargava Chinni, Ari Cedars, Danielle Gottlieb Sen, Shelby Kutty, Cedric Manlhiot\",\"doi\":\"10.2196/40524\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated.Objective: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings.Methods: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set.Results: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE.Conclusions: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.\",\"PeriodicalId\":14706,\"journal\":{\"name\":\"JMIR Cardio\",\"volume\":\"7 \",\"pages\":\"e40524\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193221/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Cardio\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/40524\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cardio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/40524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

摘要

背景:有限的数据准确性经常被认为是在护理管理路径中整合从面向消费者的可穿戴设备获得的生理数据时需要谨慎的原因。从这些数据生成的预测模型的准确性降低的影响以前没有被研究过。目的:本研究的目的是模拟数据退化对由这些数据生成的预测模型可靠性的影响，从而确定较低的设备准确性可能或可能不限制其在临床环境中的使用的程度。方法:利用健康人多层次活动和睡眠监测数据集，包括21名健康志愿者的连续自由生活步数和心率数据，我们训练了一个随机森林模型来预测心脏能力。将缺失度、噪声、偏差和所有3种扰动的组合增加的75个扰动数据集中的模型性能与未扰动数据集的模型性能进行比较。结果:未扰动数据集预测心脏功能指数的均方根误差(RMSE)为0.079 (SD 0.001)。对于所有类型的扰动，RMSE在20%-30%的扰动下保持稳定。超过这个水平，RMSE开始增加，并达到模型不再预测的点，噪声为80%，缺失为50%，所有扰动的组合为35%。在基础数据中引入系统偏差对均方根误差没有影响。结论:在这项概念验证研究中，随着源数据质量的下降，由连续获取的生理数据生成的心脏功能预测模型的性能相对稳定。因此，面向消费者的可穿戴设备的准确性较低，可能并不是它们在临床预测模型中使用的绝对禁忌症。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation.

查看原文本刊更多论文

Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation.

Background: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated.

Objective: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings.

Methods: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set.

Results: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE.

Conclusions: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

JMIR Cardio Computer Science-Computer Science Applications

CiteScore

3.50

自引率

0.00%

发文量

审稿时长

12 weeks