Forecasting mental states in schizophrenia using digital phenotyping data.

PLOS digital health Pub Date : 2025-02-07 eCollection Date: 2025-02-01 DOI:10.1371/journal.pdig.0000734

Thierry Jean, Rose Guay Hottin, Pierre Orban

{"title":"Forecasting mental states in schizophrenia using digital phenotyping data.","authors":"Thierry Jean, Rose Guay Hottin, Pierre Orban","doi":"10.1371/journal.pdig.0000734","DOIUrl":null,"url":null,"abstract":"<p><p>The promise of machine learning successfully exploiting digital phenotyping data to forecast mental states in psychiatric populations could greatly improve clinical practice. Previous research focused on binary classification and continuous regression, disregarding the often ordinal nature of prediction targets derived from clinical rating scales. In addition, mental health ratings typically show important class imbalance or skewness that need to be accounted for when evaluating predictive performance. Besides it remains unclear which machine learning algorithm is best suited for forecast tasks, the eXtreme Gradient Boosting (XGBoost) and long short-term memory (LSTM) algorithms being 2 popular choices in digital phenotyping studies. The CrossCheck dataset includes 6,364 mental state surveys using 4-point ordinal rating scales and 23,551 days of smartphone sensor data contributed by patients with schizophrenia. We trained 120 machine learning models to forecast 10 mental states (e.g., Calm, Depressed, Seeing things) from passive sensor data on 2 predictive tasks (ordinal regression, binary classification) with 2 learning algorithms (XGBoost, LSTM) over 3 forecast horizons (same day, next day, next week). A majority of ordinal regression and binary classification models performed significantly above baseline, with macro-averaged mean absolute error values between 1.19 and 0.77, and balanced accuracy between 58% and 73%, which corresponds to similar levels of performance when these metrics are scaled. Results also showed that metrics that do not account for imbalance (mean absolute error, accuracy) systematically overestimated performance, XGBoost models performed on par with or better than LSTM models, and a significant yet very small decrease in performance was observed as the forecast horizon expanded. In conclusion, when using performance metrics that properly account for class imbalance, ordinal forecast models demonstrated comparable performance to the prevalent binary classification approach without losing valuable clinical information from self-reports, thus providing richer and easier to interpret predictions.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 2","pages":"e0000734"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11805420/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000734","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The promise of machine learning successfully exploiting digital phenotyping data to forecast mental states in psychiatric populations could greatly improve clinical practice. Previous research focused on binary classification and continuous regression, disregarding the often ordinal nature of prediction targets derived from clinical rating scales. In addition, mental health ratings typically show important class imbalance or skewness that need to be accounted for when evaluating predictive performance. Besides it remains unclear which machine learning algorithm is best suited for forecast tasks, the eXtreme Gradient Boosting (XGBoost) and long short-term memory (LSTM) algorithms being 2 popular choices in digital phenotyping studies. The CrossCheck dataset includes 6,364 mental state surveys using 4-point ordinal rating scales and 23,551 days of smartphone sensor data contributed by patients with schizophrenia. We trained 120 machine learning models to forecast 10 mental states (e.g., Calm, Depressed, Seeing things) from passive sensor data on 2 predictive tasks (ordinal regression, binary classification) with 2 learning algorithms (XGBoost, LSTM) over 3 forecast horizons (same day, next day, next week). A majority of ordinal regression and binary classification models performed significantly above baseline, with macro-averaged mean absolute error values between 1.19 and 0.77, and balanced accuracy between 58% and 73%, which corresponds to similar levels of performance when these metrics are scaled. Results also showed that metrics that do not account for imbalance (mean absolute error, accuracy) systematically overestimated performance, XGBoost models performed on par with or better than LSTM models, and a significant yet very small decrease in performance was observed as the forecast horizon expanded. In conclusion, when using performance metrics that properly account for class imbalance, ordinal forecast models demonstrated comparable performance to the prevalent binary classification approach without losing valuable clinical information from self-reports, thus providing richer and easier to interpret predictions.

查看原文本刊更多论文

利用数字表型数据预测精神分裂症患者的精神状态。

机器学习有望成功利用数字表型数据来预测精神病学人群的精神状态，这将极大地改善临床实践。以往的研究主要集中在二元分类和连续回归上，忽略了从临床评定量表中得出的预测目标往往是有序的。此外，心理健康评分通常显示出重要的阶级不平衡或偏倚，在评估预测性表现时需要考虑到这一点。此外，目前尚不清楚哪种机器学习算法最适合预测任务，极限梯度增强（XGBoost）和长短期记忆（LSTM）算法是数字表型研究中最受欢迎的两种选择。CrossCheck数据集包括6364份精神状态调查，使用4点顺序评定量表，以及23551天精神分裂症患者提供的智能手机传感器数据。我们训练了120个机器学习模型，从被动传感器数据中预测2个预测任务（有序回归，二元分类）上的10种心理状态（例如，冷静，抑郁，看到东西），使用2种学习算法（XGBoost， LSTM），在3个预测范围内（当天，第二天，下周）。大多数有序回归和二元分类模型的表现明显高于基线，宏观平均平均绝对误差值在1.19到0.77之间，平衡精度在58%到73%之间，当这些指标被缩放时，这对应于类似的性能水平。结果还显示，不考虑不平衡（平均绝对误差，精度）的指标系统高估了性能，XGBoost模型的性能与LSTM模型相当或更好，并且随着预测范围的扩大，可以观察到性能的显着但非常小的下降。综上所述，当使用适当考虑类别不平衡的性能指标时，序数预测模型显示出与流行的二元分类方法相当的性能，而不会丢失自我报告中有价值的临床信息，从而提供更丰富和更容易的解释预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PLOS digital health

自引率

0.00%

发文量