Thomas P Kutcher, Isha Chakraborty, Kristin Kostick-Quenet, Akane Sano, Nidal Moukaddam, Jeffrey A Herron, Wayne K Goodman, Sameer A Sheth, Ashutosh Sabharwal, Nicole R Provenza
{"title":"Digital Biomarkers for Passive Remote Monitoring of Bipolar Disorder: Systematic Review.","authors":"Thomas P Kutcher, Isha Chakraborty, Kristin Kostick-Quenet, Akane Sano, Nidal Moukaddam, Jeffrey A Herron, Wayne K Goodman, Sameer A Sheth, Ashutosh Sabharwal, Nicole R Provenza","doi":"10.1101/2025.09.23.25336470","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Bipolar disorder (BD) features episodic shifts among (hypo)mania, depression, mixed states, and euthymia. Timely detection of mood transitions is difficult due to infrequent clinical touchpoints. Digital health technologies, including wearables and smartphones, offer a unique opportunity to passively and continuously monitor behavior and physiology that could reflect underlying mood dynamics in real-world settings.</p><p><strong>Objective: </strong>We aim to systematically review passively collected digital biomarkers for BD mood states, characterize devices/modalities and analytic approaches, appraise risk of bias, and identify design gaps and priorities for clinical translation.</p><p><strong>Methods: </strong>Following PRISMA guidelines (PROSPERO CRD42024607765), we searched MEDLINE, PsycINFO, Scopus, IEEE Xplore, and ACM Digital Library (February 7, 2025). We included peer-reviewed studies of adults with BD I/II that measured passively collected digital biomarkers and related them to depressive, (hypo)manic, mixed, or euthymic states. Active-only measures (e.g. lab tests, ecological-momentary assessment) and studies entangling BD with other diagnoses were excluded. Two independent reviewers screened studies and extracted study characteristics and results. We grouped digital biomarkers into categories and conducted narrative synthesis. Risk of bias was assessed with PROBAST (predictive models) and the Newcastle-Ottawa Scale (observational studies).</p><p><strong>Results: </strong>Of 8,355 records, 45 studies met criteria. Most enrolled ≤50 participants (64%) and monitored ≤100 days (49%); 29% collected data only in-clinic. Nine biomarker domains emerged: physical activity, heart rate (HR), electrodermal activity (EDA), geolocation, keyboard use, light exposure, sleep, socialization, and speech. Consistent patterns linked depression to reduced mobility and social interaction, later/variable sleep, and lower daytime light; (hypo)mania was associated with higher and more variable activity, shorter/advanced sleep, and increased communication. Circadian features derived from sleep/activity repeatedly aided prediction. EDA tended to be lower in depression; HRV findings were mixed across settings and methods. Keyboard and speech features (e.g., timing, prosody) showed associations and performed well in classifiers. Fifteen studies used ML; several reported strong performance for episode prediction/classification (AUROC ≈0.80-0.98 in larger cohorts), yet external validation was absent, samples were small, monitoring windows were often short relative to episode timescales, clinical labels were infrequent/misaligned, and missingness was rarely modeled despite likely informativeness.</p><p><strong>Conclusions: </strong>Passive digital biomarkers for BD show promise, with the most robust signals aligning with DSM-5 behavioral and circadian features (sleep-wake patterns, activity/mobility, socialization/geolocation, and speech). To move from promise to practice, future studies should adopt longer within-subject monitoring, align label cadence with sensing granularity, standardize features/reporting, pre-register analyses, externally validate models, minimize data to protect privacy, and expand physiological measurement beyond heart rate and electrodermal activity. These steps are essential to develop reliable, actionable tools for earlier detection and management of BD mood episodes.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486037/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.09.23.25336470","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Bipolar disorder (BD) features episodic shifts among (hypo)mania, depression, mixed states, and euthymia. Timely detection of mood transitions is difficult due to infrequent clinical touchpoints. Digital health technologies, including wearables and smartphones, offer a unique opportunity to passively and continuously monitor behavior and physiology that could reflect underlying mood dynamics in real-world settings.
Objective: We aim to systematically review passively collected digital biomarkers for BD mood states, characterize devices/modalities and analytic approaches, appraise risk of bias, and identify design gaps and priorities for clinical translation.
Methods: Following PRISMA guidelines (PROSPERO CRD42024607765), we searched MEDLINE, PsycINFO, Scopus, IEEE Xplore, and ACM Digital Library (February 7, 2025). We included peer-reviewed studies of adults with BD I/II that measured passively collected digital biomarkers and related them to depressive, (hypo)manic, mixed, or euthymic states. Active-only measures (e.g. lab tests, ecological-momentary assessment) and studies entangling BD with other diagnoses were excluded. Two independent reviewers screened studies and extracted study characteristics and results. We grouped digital biomarkers into categories and conducted narrative synthesis. Risk of bias was assessed with PROBAST (predictive models) and the Newcastle-Ottawa Scale (observational studies).
Results: Of 8,355 records, 45 studies met criteria. Most enrolled ≤50 participants (64%) and monitored ≤100 days (49%); 29% collected data only in-clinic. Nine biomarker domains emerged: physical activity, heart rate (HR), electrodermal activity (EDA), geolocation, keyboard use, light exposure, sleep, socialization, and speech. Consistent patterns linked depression to reduced mobility and social interaction, later/variable sleep, and lower daytime light; (hypo)mania was associated with higher and more variable activity, shorter/advanced sleep, and increased communication. Circadian features derived from sleep/activity repeatedly aided prediction. EDA tended to be lower in depression; HRV findings were mixed across settings and methods. Keyboard and speech features (e.g., timing, prosody) showed associations and performed well in classifiers. Fifteen studies used ML; several reported strong performance for episode prediction/classification (AUROC ≈0.80-0.98 in larger cohorts), yet external validation was absent, samples were small, monitoring windows were often short relative to episode timescales, clinical labels were infrequent/misaligned, and missingness was rarely modeled despite likely informativeness.
Conclusions: Passive digital biomarkers for BD show promise, with the most robust signals aligning with DSM-5 behavioral and circadian features (sleep-wake patterns, activity/mobility, socialization/geolocation, and speech). To move from promise to practice, future studies should adopt longer within-subject monitoring, align label cadence with sensing granularity, standardize features/reporting, pre-register analyses, externally validate models, minimize data to protect privacy, and expand physiological measurement beyond heart rate and electrodermal activity. These steps are essential to develop reliable, actionable tools for earlier detection and management of BD mood episodes.