Analysis of running speech for the characterization of mood state in bipolar patients

A. Guidi, E. Scilingo, C. Gentili, G. Bertschy, L. Landini, N. Vanello
{"title":"Analysis of running speech for the characterization of mood state in bipolar patients","authors":"A. Guidi, E. Scilingo, C. Gentili, G. Bertschy, L. Landini, N. Vanello","doi":"10.1109/AEIT.2015.7415275","DOIUrl":null,"url":null,"abstract":"Speech analysis has been proposed for the characterization of subjects' mood state. Specifically, prosodie features have been found to carry information about subjects' depression severity as well as subjects' status in bipolar disorders. In such applications, the subjects have to be monitored continuously, in naturalistic scenarios and not only in the clinical setting. For this reason, it is important to test the robustness of feature extraction approaches against noise as well as to assess their performances as applied to running speech. In this work, the performance of an algorithm designed to estimate speech features from running speech are evaluated on a speech database, containing an associated electroglottographic signal. The algorithm consists of an automatic segmentation step, to detect voiced segments at syllable level, and a speech feature estimation step based on a spectral matching approach. Relevant parameters pertaining voiced segments identification are optimized. The performance of the algorithm in estimating speech features is tested against different noise sources. The chosen speech features are those related to fundamental frequency and its variability, as jitter and standard deviation estimated at syllables level. The results show the good performance of the algorithm in estimating fundamental frequency related features also in noisy environments. Preliminary results on bipolar patients, recorded in different mood states, are shown. Pairwise statistical comparison between different mood states revealed significant differences in fundamental frequency and jitter. A significant effect of the speech task performed by the subjects is observed.","PeriodicalId":368119,"journal":{"name":"2015 AEIT International Annual Conference (AEIT)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 AEIT International Annual Conference (AEIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEIT.2015.7415275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

Speech analysis has been proposed for the characterization of subjects' mood state. Specifically, prosodie features have been found to carry information about subjects' depression severity as well as subjects' status in bipolar disorders. In such applications, the subjects have to be monitored continuously, in naturalistic scenarios and not only in the clinical setting. For this reason, it is important to test the robustness of feature extraction approaches against noise as well as to assess their performances as applied to running speech. In this work, the performance of an algorithm designed to estimate speech features from running speech are evaluated on a speech database, containing an associated electroglottographic signal. The algorithm consists of an automatic segmentation step, to detect voiced segments at syllable level, and a speech feature estimation step based on a spectral matching approach. Relevant parameters pertaining voiced segments identification are optimized. The performance of the algorithm in estimating speech features is tested against different noise sources. The chosen speech features are those related to fundamental frequency and its variability, as jitter and standard deviation estimated at syllables level. The results show the good performance of the algorithm in estimating fundamental frequency related features also in noisy environments. Preliminary results on bipolar patients, recorded in different mood states, are shown. Pairwise statistical comparison between different mood states revealed significant differences in fundamental frequency and jitter. A significant effect of the speech task performed by the subjects is observed.
奔跑言语对双相情感障碍患者情绪状态表征的分析
语音分析已被提出用于表征受试者的情绪状态。具体地说,韵律特征已经被发现携带有关受试者抑郁严重程度以及受试者在双相情感障碍中的状态的信息。在这样的应用中,受试者必须在自然场景中连续监测,而不仅仅是在临床环境中。因此,测试特征提取方法对噪声的鲁棒性以及评估其应用于运行语音的性能是很重要的。在这项工作中,在包含相关声门电信号的语音数据库上评估了一种旨在从运行语音中估计语音特征的算法的性能。该算法包括一个自动分割步骤,即在音节水平上检测浊音片段,以及一个基于谱匹配方法的语音特征估计步骤。优化了浊音段识别的相关参数。对该算法在不同噪声源下的语音特征估计性能进行了测试。所选择的语音特征是那些与基频及其可变性有关的特征,如在音节水平上估计的抖动和标准偏差。结果表明,在噪声环境下,该算法在估计基频相关特征方面也具有良好的性能。显示了双相情感障碍患者在不同情绪状态下的初步结果。不同情绪状态的两两统计比较显示基频和抖动有显著差异。观察到受试者执行言语任务的显著效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信