John Kundrick, Aditi Naniwadekar, Virginia Singla, Krishna Kancharla, Aditya Bhonsale, Andrew Voigt, Alaa Shalaby, N.A. Mark Estes, Sandeep K Jain, Samir Saba
{"title":"Machine learning applied to wearable fitness tracker data and the risk of hospitalizations and cardiovascular events","authors":"John Kundrick, Aditi Naniwadekar, Virginia Singla, Krishna Kancharla, Aditya Bhonsale, Andrew Voigt, Alaa Shalaby, N.A. Mark Estes, Sandeep K Jain, Samir Saba","doi":"10.1016/j.ajpc.2025.101006","DOIUrl":null,"url":null,"abstract":"<div><div>Background: Wearable fitness trackers generate extensive physiological and activity data, offering potential to monitor health and predict outcomes. Machine learning (ML) techniques applied to these data may enable early identification of adverse health conditions, such as hospitalizations and development of cardiovascular diseases (CVD). This study aimed to evaluate ML models' ability to forecast the incidence of (1) hospitalizations from any cause and (2) of new diagnosis of CVD, including a composite of heart failure (HF), coronary artery disease or myocardial infarction (CAD-MI), cardiomyopathy (CMP), and atrial fibrillation (AF). Method and Results: Data from 14,157 participants in the All of Us study that included both Fitbit and electronic health record (EHR) information were censored on the date preceding events and analyzed using various ML classifiers for extracted feature data. Performance metrics included accuracy, area under the receiver operating characteristic (AUROC) curve, and F1 scores. Our overall study population was young (median age 54 years), with good representation of women (67%). For hospitalizations, a Random Forest classifier achieved the best performance (AUROC=0.95, accuracy=0.99, F1 score=0.92). For the CVD events, the best prediction model was gradient boosting (AUROC=0.80, accuracy=0.71, F1 score=0.15).</div><div>Conclusion: ML models applied to Fitbit data demonstrate promise in predicting clinical outcomes with strong performance for predicting all-cause hospitalizations and modest performance for predicting incident CVD. Wearable technology could play a role in risk assessment and patient management.</div></div>","PeriodicalId":72173,"journal":{"name":"American journal of preventive cardiology","volume":"22 ","pages":"Article 101006"},"PeriodicalIF":4.3000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of preventive cardiology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666667725000819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Wearable fitness trackers generate extensive physiological and activity data, offering potential to monitor health and predict outcomes. Machine learning (ML) techniques applied to these data may enable early identification of adverse health conditions, such as hospitalizations and development of cardiovascular diseases (CVD). This study aimed to evaluate ML models' ability to forecast the incidence of (1) hospitalizations from any cause and (2) of new diagnosis of CVD, including a composite of heart failure (HF), coronary artery disease or myocardial infarction (CAD-MI), cardiomyopathy (CMP), and atrial fibrillation (AF). Method and Results: Data from 14,157 participants in the All of Us study that included both Fitbit and electronic health record (EHR) information were censored on the date preceding events and analyzed using various ML classifiers for extracted feature data. Performance metrics included accuracy, area under the receiver operating characteristic (AUROC) curve, and F1 scores. Our overall study population was young (median age 54 years), with good representation of women (67%). For hospitalizations, a Random Forest classifier achieved the best performance (AUROC=0.95, accuracy=0.99, F1 score=0.92). For the CVD events, the best prediction model was gradient boosting (AUROC=0.80, accuracy=0.71, F1 score=0.15).
Conclusion: ML models applied to Fitbit data demonstrate promise in predicting clinical outcomes with strong performance for predicting all-cause hospitalizations and modest performance for predicting incident CVD. Wearable technology could play a role in risk assessment and patient management.
背景:可穿戴健身追踪器产生广泛的生理和活动数据,提供了监测健康和预测结果的潜力。将机器学习(ML)技术应用于这些数据,可以早期识别不良健康状况,例如住院治疗和心血管疾病(CVD)的发展。本研究旨在评估ML模型预测(1)任何原因住院和(2)CVD新诊断发生率的能力,包括心力衰竭(HF)、冠状动脉疾病或心肌梗死(CAD-MI)、心肌病(CMP)和心房颤动(AF)。方法和结果:来自14,157名All of Us研究参与者的数据,包括Fitbit和电子健康记录(EHR)信息,在事件发生之前的日期进行审查,并使用各种ML分类器对提取的特征数据进行分析。性能指标包括准确性、接受者工作特征(AUROC)曲线下面积和F1分数。我们的总体研究人群是年轻的(中位年龄54岁),女性的代表性很好(67%)。对于住院患者,随机森林分类器的表现最好(AUROC=0.95,准确率=0.99,F1评分=0.92)。对于CVD事件,最佳预测模型为梯度增强模型(AUROC=0.80,准确率=0.71,F1评分=0.15)。结论:应用于Fitbit数据的ML模型在预测临床结果方面表现良好,在预测全因住院方面表现良好,在预测心血管疾病发生率方面表现一般。可穿戴技术可以在风险评估和患者管理中发挥作用。