Speech Features as Predictors of Momentary Depression Severity in Patients With Depressive Disorder Undergoing Sleep Deprivation Therapy: Ambulatory Assessment Pilot Study.

IF 4.8 2区 医学 Q1 PSYCHIATRY
Jmir Mental Health Pub Date : 2024-01-18 DOI:10.2196/49222
Lisa-Marie Wadle, Ulrich W Ebner-Priemer, Jerome C Foo, Yoshiharu Yamamoto, Fabian Streit, Stephanie H Witt, Josef Frank, Lea Zillich, Matthias F Limberger, Ayimnisagul Ablimit, Tanja Schultz, Maria Gilles, Marcella Rietschel, Lea Sirignano
{"title":"Speech Features as Predictors of Momentary Depression Severity in Patients With Depressive Disorder Undergoing Sleep Deprivation Therapy: Ambulatory Assessment Pilot Study.","authors":"Lisa-Marie Wadle, Ulrich W Ebner-Priemer, Jerome C Foo, Yoshiharu Yamamoto, Fabian Streit, Stephanie H Witt, Josef Frank, Lea Zillich, Matthias F Limberger, Ayimnisagul Ablimit, Tanja Schultz, Maria Gilles, Marcella Rietschel, Lea Sirignano","doi":"10.2196/49222","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The use of mobile devices to continuously monitor objectively extracted parameters of depressive symptomatology is seen as an important step in the understanding and prevention of upcoming depressive episodes. Speech features such as pitch variability, speech pauses, and speech rate are promising indicators, but empirical evidence is limited, given the variability of study designs.</p><p><strong>Objective: </strong>Previous research studies have found different speech patterns when comparing single speech recordings between patients and healthy controls, but only a few studies have used repeated assessments to compare depressive and nondepressive episodes within the same patient. To our knowledge, no study has used a series of measurements within patients with depression (eg, intensive longitudinal data) to model the dynamic ebb and flow of subjectively reported depression and concomitant speech samples. However, such data are indispensable for detecting and ultimately preventing upcoming episodes.</p><p><strong>Methods: </strong>In this study, we captured voice samples and momentary affect ratings over the course of 3 weeks in a sample of patients (N=30) with an acute depressive episode receiving stationary care. Patients underwent sleep deprivation therapy, a chronotherapeutic intervention that can rapidly improve depression symptomatology. We hypothesized that within-person variability in depressive and affective momentary states would be reflected in the following 3 speech features: pitch variability, speech pauses, and speech rate. We parametrized them using the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) from open-source Speech and Music Interpretation by Large-Space Extraction (openSMILE; audEERING GmbH) and extracted them from a transcript. We analyzed the speech features along with self-reported momentary affect ratings, using multilevel linear regression analysis. We analyzed an average of 32 (SD 19.83) assessments per patient.</p><p><strong>Results: </strong>Analyses revealed that pitch variability, speech pauses, and speech rate were associated with depression severity, positive affect, valence, and energetic arousal; furthermore, speech pauses and speech rate were associated with negative affect, and speech pauses were additionally associated with calmness. Specifically, pitch variability was negatively associated with improved momentary states (ie, lower pitch variability was linked to lower depression severity as well as higher positive affect, valence, and energetic arousal). Speech pauses were negatively associated with improved momentary states, whereas speech rate was positively associated with improved momentary states.</p><p><strong>Conclusions: </strong>Pitch variability, speech pauses, and speech rate are promising features for the development of clinical prediction technologies to improve patient care as well as timely diagnosis and monitoring of treatment response. Our research is a step forward on the path to developing an automated depression monitoring system, facilitating individually tailored treatments and increased patient empowerment.</p>","PeriodicalId":48616,"journal":{"name":"Jmir Mental Health","volume":null,"pages":null},"PeriodicalIF":4.8000,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10835582/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jmir Mental Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/49222","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The use of mobile devices to continuously monitor objectively extracted parameters of depressive symptomatology is seen as an important step in the understanding and prevention of upcoming depressive episodes. Speech features such as pitch variability, speech pauses, and speech rate are promising indicators, but empirical evidence is limited, given the variability of study designs.

Objective: Previous research studies have found different speech patterns when comparing single speech recordings between patients and healthy controls, but only a few studies have used repeated assessments to compare depressive and nondepressive episodes within the same patient. To our knowledge, no study has used a series of measurements within patients with depression (eg, intensive longitudinal data) to model the dynamic ebb and flow of subjectively reported depression and concomitant speech samples. However, such data are indispensable for detecting and ultimately preventing upcoming episodes.

Methods: In this study, we captured voice samples and momentary affect ratings over the course of 3 weeks in a sample of patients (N=30) with an acute depressive episode receiving stationary care. Patients underwent sleep deprivation therapy, a chronotherapeutic intervention that can rapidly improve depression symptomatology. We hypothesized that within-person variability in depressive and affective momentary states would be reflected in the following 3 speech features: pitch variability, speech pauses, and speech rate. We parametrized them using the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) from open-source Speech and Music Interpretation by Large-Space Extraction (openSMILE; audEERING GmbH) and extracted them from a transcript. We analyzed the speech features along with self-reported momentary affect ratings, using multilevel linear regression analysis. We analyzed an average of 32 (SD 19.83) assessments per patient.

Results: Analyses revealed that pitch variability, speech pauses, and speech rate were associated with depression severity, positive affect, valence, and energetic arousal; furthermore, speech pauses and speech rate were associated with negative affect, and speech pauses were additionally associated with calmness. Specifically, pitch variability was negatively associated with improved momentary states (ie, lower pitch variability was linked to lower depression severity as well as higher positive affect, valence, and energetic arousal). Speech pauses were negatively associated with improved momentary states, whereas speech rate was positively associated with improved momentary states.

Conclusions: Pitch variability, speech pauses, and speech rate are promising features for the development of clinical prediction technologies to improve patient care as well as timely diagnosis and monitoring of treatment response. Our research is a step forward on the path to developing an automated depression monitoring system, facilitating individually tailored treatments and increased patient empowerment.

预测接受睡眠剥夺疗法的抑郁症患者瞬间抑郁严重程度的言语特征:门诊评估试点研究
背景:使用移动设备持续监测客观提取的抑郁症状参数被视为了解和预防即将到来的抑郁发作的重要一步。音调变化、语音停顿和语速等语音特征是很有前景的指标,但由于研究设计的差异性,实证证据很有限:以前的研究发现,在比较患者和健康对照组的单次语音记录时,会发现不同的语音模式,但只有少数研究使用重复评估来比较同一患者的抑郁发作和非抑郁发作。据我们所知,还没有研究使用抑郁症患者的一系列测量数据(如密集的纵向数据)来模拟主观报告的抑郁和伴随的言语样本的动态起伏。然而,这些数据对于检测并最终预防即将到来的抑郁症发作是不可或缺的:在这项研究中,我们采集了接受固定治疗的急性抑郁发作患者(30 人)3 周内的语音样本和瞬间情绪评分。患者接受了睡眠剥夺疗法,这是一种可迅速改善抑郁症状的慢性治疗干预措施。我们假设,抑郁和情感瞬间状态的人际变异将反映在以下三个语音特征中:音调变异、语音停顿和语速。我们使用开源的大空间提取语音和音乐解释(openSMILE; audEERING GmbH)中的扩展日内瓦极简声学参数集(eGeMAPS)对这些特征进行了参数化,并从文字记录中提取了这些特征。我们使用多层次线性回归分析法对语音特征和自我报告的瞬间情感评级进行了分析。我们分析了每位患者平均 32 次(SD 19.83)的评估结果:分析结果显示,音调变异性、语音停顿和语速与抑郁严重程度、积极情绪、情感和能量唤醒有关;此外,语音停顿和语速与消极情绪有关,语音停顿还与平静有关。具体来说,音调变异性与瞬间状态的改善呈负相关(即较低的音调变异性与较低的抑郁严重程度以及较高的积极情绪、情感和精力唤醒有关)。语音停顿与瞬间状态的改善呈负相关,而语速与瞬间状态的改善呈正相关:结论:音调变异、语音停顿和语速是开发临床预测技术的有利特征,可用于改善患者护理、及时诊断和监测治疗反应。我们的研究在开发自动抑郁监测系统的道路上又向前迈进了一步,有助于为患者提供量身定制的治疗,增强患者的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Jmir Mental Health
Jmir Mental Health Medicine-Psychiatry and Mental Health
CiteScore
10.80
自引率
3.80%
发文量
104
审稿时长
16 weeks
期刊介绍: JMIR Mental Health (JMH, ISSN 2368-7959) is a PubMed-indexed, peer-reviewed sister journal of JMIR, the leading eHealth journal (Impact Factor 2016: 5.175). JMIR Mental Health focusses on digital health and Internet interventions, technologies and electronic innovations (software and hardware) for mental health, addictions, online counselling and behaviour change. This includes formative evaluation and system descriptions, theoretical papers, review papers, viewpoint/vision papers, and rigorous evaluations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信