Observation Time vs. Performance in Digital Phenotyping

Thomas R Quisel, Wei-Nchih Lee, L. Foschini
{"title":"Observation Time vs. Performance in Digital Phenotyping","authors":"Thomas R Quisel, Wei-Nchih Lee, L. Foschini","doi":"10.1145/3089341.3089347","DOIUrl":null,"url":null,"abstract":"Mobile health (mHealth) technologies enable frequent sampling of physiological and psychological signals over time. In our recent work we used a convolutional neural network (CNN) model to predict self-reported phenotypes of chronic conditions from step and sleep data recorded from passive trackers in free living conditions. We investigated the impact of the time-granularity of the collected data and showed that training the models on higher- resolution (minute-level) data improved classification performance on conditions related to mental health and nervous system disorders, as compared to using only day-level totals. In the present work we shift the focus from the time resolution of the observation window to its duration. We study how the performance of the best-performing model on the highest-resolution data changes as the length of the data collection window is varied from 3 to 147 days for each user. We found that for mental health and nervous system disorders, a model trained on 30 days of mHealth data attains the same performance as using the full 147-day window of data, in terms of AUC increase over a baseline model that uses only demographics, height, and weight. Additionally, for the same cluster of conditions, only 7 days of data are sufficient to realize 62% of the maximum increase in AUC over baseline attainable using the full window. The results suggest that for some conditions health-related digital phenotyping in free-living conditions can potentially be performed in a relatively short amount of time, imposing minimal disruptions on user habits.","PeriodicalId":92197,"journal":{"name":"DigitalBiomarkers'17 : proceedings of the 1st Workshop on Digital Biomarkers : June 23, 2017, Niagara Falls, NY, USA. Workshop on Digital Biomarkers (1st : 2017 : Niagara Falls, N.Y.)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DigitalBiomarkers'17 : proceedings of the 1st Workshop on Digital Biomarkers : June 23, 2017, Niagara Falls, NY, USA. Workshop on Digital Biomarkers (1st : 2017 : Niagara Falls, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3089341.3089347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Mobile health (mHealth) technologies enable frequent sampling of physiological and psychological signals over time. In our recent work we used a convolutional neural network (CNN) model to predict self-reported phenotypes of chronic conditions from step and sleep data recorded from passive trackers in free living conditions. We investigated the impact of the time-granularity of the collected data and showed that training the models on higher- resolution (minute-level) data improved classification performance on conditions related to mental health and nervous system disorders, as compared to using only day-level totals. In the present work we shift the focus from the time resolution of the observation window to its duration. We study how the performance of the best-performing model on the highest-resolution data changes as the length of the data collection window is varied from 3 to 147 days for each user. We found that for mental health and nervous system disorders, a model trained on 30 days of mHealth data attains the same performance as using the full 147-day window of data, in terms of AUC increase over a baseline model that uses only demographics, height, and weight. Additionally, for the same cluster of conditions, only 7 days of data are sufficient to realize 62% of the maximum increase in AUC over baseline attainable using the full window. The results suggest that for some conditions health-related digital phenotyping in free-living conditions can potentially be performed in a relatively short amount of time, imposing minimal disruptions on user habits.
观察时间与数字表型的表现
移动医疗(mHealth)技术能够在一段时间内频繁采样生理和心理信号。在我们最近的工作中,我们使用卷积神经网络(CNN)模型,根据被动跟踪器在自由生活条件下记录的步骤和睡眠数据,预测慢性病的自我报告表型。我们调查了所收集数据的时间粒度的影响,并表明与仅使用日级别的总数相比,在更高分辨率(分钟级别)数据上训练模型可以提高对与精神健康和神经系统疾病相关的条件的分类性能。在本工作中,我们将重点从观测窗口的时间分辨率转移到其持续时间。我们研究了当每个用户的数据收集窗口长度从3天到147天不等时,在最高分辨率数据上表现最佳的模型的性能是如何变化的。我们发现,对于心理健康和神经系统疾病,在使用仅使用人口统计、身高和体重的基线模型的AUC增加方面,使用30天移动健康数据训练的模型与使用完整147天数据窗口的模型具有相同的性能。此外,对于同一组条件,仅7天的数据就足以实现使用全窗口可实现的AUC比基线最大增幅的62%。研究结果表明,在一些条件下,在自由生活条件下与健康相关的数字表型可以在相对较短的时间内进行,对用户习惯的干扰最小。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信