Predicting Sleep and Sleep Stage in Children Using Actigraphy and Heartrate via a Long Short-Term Memory Deep Learning Algorithm: A Performance Evaluation.
R Glenn Weaver, James W White, Olivia Finnegan, Hongpeng Yang, Zifei Zhong, Keagan Kiely, Catherine Jones, Yan Tong, Srihari Nelakuditi, Rahul Ghosal, David E Brown, Russ Pate, Gregory J Welk, Massimiliano de Zambotti, Yuan Wang, Sarah Burkart, Elizabeth L Adams, Bridget Armstrong, Michael W Beets
{"title":"Predicting Sleep and Sleep Stage in Children Using Actigraphy and Heartrate via a Long Short-Term Memory Deep Learning Algorithm: A Performance Evaluation.","authors":"R Glenn Weaver, James W White, Olivia Finnegan, Hongpeng Yang, Zifei Zhong, Keagan Kiely, Catherine Jones, Yan Tong, Srihari Nelakuditi, Rahul Ghosal, David E Brown, Russ Pate, Gregory J Welk, Massimiliano de Zambotti, Yuan Wang, Sarah Burkart, Elizabeth L Adams, Bridget Armstrong, Michael W Beets","doi":"10.1111/jsr.70149","DOIUrl":null,"url":null,"abstract":"<p><p>Children's ambulatory sleep is commonly measured via actigraphy. However, traditional actigraphy measured sleep (e.g., Sadeh algorithm) struggles to predict wake (i.e., specificity, values typically < 70) and cannot predict sleep stages. Long short-term memory (LSTM) is a machine learning algorithm that may address these deficiencies. This study evaluated the agreement of LSTM sleep estimates from actigraphy and heartrate (HR) data with polysomnography (PSG). Children (N = 238, 5-12 years, 52.8% male, 50% Black 31.9% White) participated in an overnight laboratory polysomnography. Participants were referred because of suspected sleep disruptions. Children wore an ActiGraph GT9X accelerometer and two of three consumer wearables (i.e., Apple Watch Series 7, Fitbit Sense, Garmin Vivoactive 4) on their non-dominant wrist during the polysomnogram. LSTM estimated sleep versus wake and sleep stage (wake, not-REM, REM) using raw actigraphy and HR data for each 30-s epoch. Logistic regression and random forest were also estimated as a benchmark for performance with which to compare the LSTM results. A 10-fold cross-validation technique was employed, and confusion matrices were constructed. Sensitivity and specificity were calculated to assess the agreement between research-grade and consumer wearables with the criterion polysomnography. For sleep versus wake classification, LSTM outperformed logistic regression and random forest with accuracy ranging from 94.1 to 95.1, sensitivity ranging from 94.9 to 95.9 across different devices, and specificity ranging from 84.5 to 89.6. The addition of HR improved the prediction of sleep stages but not binary sleep versus wake. LSTM is promising for predicting sleep and sleep staging from actigraphy data, and HR may improve sleep stage prediction.</p>","PeriodicalId":17057,"journal":{"name":"Journal of Sleep Research","volume":" ","pages":"e70149"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Sleep Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jsr.70149","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Children's ambulatory sleep is commonly measured via actigraphy. However, traditional actigraphy measured sleep (e.g., Sadeh algorithm) struggles to predict wake (i.e., specificity, values typically < 70) and cannot predict sleep stages. Long short-term memory (LSTM) is a machine learning algorithm that may address these deficiencies. This study evaluated the agreement of LSTM sleep estimates from actigraphy and heartrate (HR) data with polysomnography (PSG). Children (N = 238, 5-12 years, 52.8% male, 50% Black 31.9% White) participated in an overnight laboratory polysomnography. Participants were referred because of suspected sleep disruptions. Children wore an ActiGraph GT9X accelerometer and two of three consumer wearables (i.e., Apple Watch Series 7, Fitbit Sense, Garmin Vivoactive 4) on their non-dominant wrist during the polysomnogram. LSTM estimated sleep versus wake and sleep stage (wake, not-REM, REM) using raw actigraphy and HR data for each 30-s epoch. Logistic regression and random forest were also estimated as a benchmark for performance with which to compare the LSTM results. A 10-fold cross-validation technique was employed, and confusion matrices were constructed. Sensitivity and specificity were calculated to assess the agreement between research-grade and consumer wearables with the criterion polysomnography. For sleep versus wake classification, LSTM outperformed logistic regression and random forest with accuracy ranging from 94.1 to 95.1, sensitivity ranging from 94.9 to 95.9 across different devices, and specificity ranging from 84.5 to 89.6. The addition of HR improved the prediction of sleep stages but not binary sleep versus wake. LSTM is promising for predicting sleep and sleep staging from actigraphy data, and HR may improve sleep stage prediction.
期刊介绍:
The Journal of Sleep Research is dedicated to basic and clinical sleep research. The Journal publishes original research papers and invited reviews in all areas of sleep research (including biological rhythms). The Journal aims to promote the exchange of ideas between basic and clinical sleep researchers coming from a wide range of backgrounds and disciplines. The Journal will achieve this by publishing papers which use multidisciplinary and novel approaches to answer important questions about sleep, as well as its disorders and the treatment thereof.