An acoustic model of speech dysprosody in patients with Parkinson's disease.

IF 2.4 3区 医学 Q3 NEUROSCIENCES
Frontiers in Human Neuroscience Pub Date : 2025-04-28 eCollection Date: 2025-01-01 DOI:10.3389/fnhum.2025.1566274
Fredrik Nylén
{"title":"An acoustic model of speech dysprosody in patients with Parkinson's disease.","authors":"Fredrik Nylén","doi":"10.3389/fnhum.2025.1566274","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to determine the acoustic properties most indicative of dysprosody severity in patients with Parkinson's disease using an automated acoustic assessment procedure.</p><p><strong>Method: </strong>A total of 108 read speech recordings of 68 speakers with PD (45 male, 23 female, aged 65.0 ± 9.8 yea<i>rs</i>) were made with active levodopa treatment. A total of 40 of the patients were additionally recorded without levodopa treatment to increase the range of dysprosody severity in the sample. Four human clinical experts independently assessed the patients' recordings in terms of dysprosody severity. Separately, a speech processing pipeline extracted the acoustic properties of prosodic relevance from automatically identified portions of speech used as utterance proxies. Five machine learning models were trained on 75% of speech portions and the perceptual evaluations of the speaker's dysprosody severity in a 10-fold cross-validation procedure. They were evaluated regarding their ability to predict the perceptual assessments of recordings excluded during training. The models' performances were assessed by their ability to accurately predict clinical experts' dysprosody severity assessments.</p><p><strong>Results: </strong>The acoustic predictors of importance spanned several acoustic domains of prosodic relevance, with the variability in <i>f</i> <sub><i>o</i></sub> change between intonational turning points and the average first Mel-frequency cepstral coefficient at these points being the two top predictors. While predominant in the literature, variability in utterance-wide <i>f</i> <sub><i>o</i></sub> was <i>fo</i>und to be only the fifth strongest predictor.</p><p><strong>Conclusion: </strong>Human expert raters' assessments of dysprosody can be approximated by the automated procedure, affording application in clinical settings where an experienced expert is unavailable. Variability in pitch does not adequately describe the level of dysprosody due to Parkinson's disease.</p>","PeriodicalId":12536,"journal":{"name":"Frontiers in Human Neuroscience","volume":"19 ","pages":"1566274"},"PeriodicalIF":2.4000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12066530/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Human Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fnhum.2025.1566274","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aimed to determine the acoustic properties most indicative of dysprosody severity in patients with Parkinson's disease using an automated acoustic assessment procedure.

Method: A total of 108 read speech recordings of 68 speakers with PD (45 male, 23 female, aged 65.0 ± 9.8 years) were made with active levodopa treatment. A total of 40 of the patients were additionally recorded without levodopa treatment to increase the range of dysprosody severity in the sample. Four human clinical experts independently assessed the patients' recordings in terms of dysprosody severity. Separately, a speech processing pipeline extracted the acoustic properties of prosodic relevance from automatically identified portions of speech used as utterance proxies. Five machine learning models were trained on 75% of speech portions and the perceptual evaluations of the speaker's dysprosody severity in a 10-fold cross-validation procedure. They were evaluated regarding their ability to predict the perceptual assessments of recordings excluded during training. The models' performances were assessed by their ability to accurately predict clinical experts' dysprosody severity assessments.

Results: The acoustic predictors of importance spanned several acoustic domains of prosodic relevance, with the variability in f o change between intonational turning points and the average first Mel-frequency cepstral coefficient at these points being the two top predictors. While predominant in the literature, variability in utterance-wide f o was found to be only the fifth strongest predictor.

Conclusion: Human expert raters' assessments of dysprosody can be approximated by the automated procedure, affording application in clinical settings where an experienced expert is unavailable. Variability in pitch does not adequately describe the level of dysprosody due to Parkinson's disease.

帕金森病患者言语障碍的声学模型。
目的:本研究旨在使用自动声学评估程序确定帕金森病患者最能指示发音障碍严重程度的声学特性。方法:对68例PD患者(男45例,女23例,年龄65.0±9.8岁)进行主动左旋多巴治疗,录音108份。总共有40名患者在没有左旋多巴治疗的情况下进行了额外记录,以增加样本中不良情绪严重程度的范围。四名人类临床专家独立评估了患者的发音障碍严重程度记录。另外,语音处理管道从自动识别的语音部分提取韵律相关性的声学特性,用作话语代理。在10倍交叉验证过程中,对75%的语音部分和说话者发音障碍严重程度的感知评估进行了5个机器学习模型的训练。研究人员评估了他们预测在训练中被排除的录音的感知评估的能力。这些模型的性能是通过它们准确预测临床专家对言语障碍严重程度评估的能力来评估的。结果:重要的声学预测因子跨越了几个韵律相关的声学域,语调转折点之间的变化可变性和这些点上的平均第一梅尔频率倒谱系数是两个最重要的预测因子。虽然在文献中占主导地位,但在话语范围内的变异性被发现只是第五大预测因子。结论:人类专家评估师对发音障碍的评估可以通过自动化程序进行近似,在没有经验丰富的专家的临床环境中提供应用。音高的变异性并不能充分描述帕金森病引起的发音障碍的程度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Human Neuroscience
Frontiers in Human Neuroscience 医学-神经科学
CiteScore
4.70
自引率
6.90%
发文量
830
审稿时长
2-4 weeks
期刊介绍: Frontiers in Human Neuroscience is a first-tier electronic journal devoted to understanding the brain mechanisms supporting cognitive and social behavior in humans, and how these mechanisms might be altered in disease states. The last 25 years have seen an explosive growth in both the methods and the theoretical constructs available to study the human brain. Advances in electrophysiological, neuroimaging, neuropsychological, psychophysical, neuropharmacological and computational approaches have provided key insights into the mechanisms of a broad range of human behaviors in both health and disease. Work in human neuroscience ranges from the cognitive domain, including areas such as memory, attention, language and perception to the social domain, with this last subject addressing topics, such as interpersonal interactions, social discourse and emotional regulation. How these processes unfold during development, mature in adulthood and often decline in aging, and how they are altered in a host of developmental, neurological and psychiatric disorders, has become increasingly amenable to human neuroscience research approaches. Work in human neuroscience has influenced many areas of inquiry ranging from social and cognitive psychology to economics, law and public policy. Accordingly, our journal will provide a forum for human research spanning all areas of human cognitive, social, developmental and translational neuroscience using any research approach.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信