从筛选到精确:寻找声音障碍特定的声学和听觉感知度量。

IF 2.4 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden
{"title":"从筛选到精确:寻找声音障碍特定的声学和听觉感知度量。","authors":"Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden","doi":"10.1016/j.jvoice.2025.09.007","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.</p><p><strong>Methods: </strong>This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.</p><p><strong>Results: </strong>Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.</p><p><strong>Conclusion: </strong>Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494153/pdf/","citationCount":"0","resultStr":"{\"title\":\"From Screening to Precision: Searching for Voice Disorder-Specific Acoustic and Auditory-Perceptual Metrics.\",\"authors\":\"Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden\",\"doi\":\"10.1016/j.jvoice.2025.09.007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.</p><p><strong>Methods: </strong>This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.</p><p><strong>Results: </strong>Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.</p><p><strong>Conclusion: </strong>Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.</p>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494153/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jvoice.2025.09.007\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.09.007","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:声学和听觉感知参数是筛查临床显著声音障碍的常用工具。然而,支持临床鉴别诊断的疾病特异性声学特征的潜力在很大程度上仍未实现。此外,不同语音材料的声学模式的鲁棒性需要澄清,以便为灵活的、基于证据的临床协议和新兴的机器学习应用提供信息。方法:本研究使用感知语音质量数据库调查了障碍特定度量和语音材料一致性。广义线性模型检验了14种声学参数与常见声音病理[声带麻痹(VFP)、萎缩、病变和肌张力性发声障碍(MTD)]之间的关系。主成分分析(PCA)集成了声学和听觉感知测量来识别多维语音质量模式,而接收者工作特征(ROC)曲线评估了持续元音和连接语音的判别性能。结果:出现了两个主要的主成分:PC1(34.7%方差)整合了一般语音质量和感知评级,PC2(17.3%方差)对比了时间稳定性和谐波结构。我们发现了不同的疾病特异性模式:VFP在两个成分上都表现出很强的区别性(AUC≥0.75),而萎缩、病变和MTD与PC1表现出中度相关性(AUC = 0.52-0.66)。初步分析揭示了帕金森氏症的两种特征模式。重要的是,声学模式在语音材料中保持一致,支持任务灵活的临床评估方案。结论:特定的语音病理表现出独特的声学感知特征,可以通过多维分析可靠地识别。这些发现支持了一种基于精确的语音评估方法,从一般筛查转向特定疾病的诊断应用。跨语音材料模式的鲁棒性使临床协议变得灵活,而声学和感知测量的集成为增强的诊断工具和机器学习应用提供了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
From Screening to Precision: Searching for Voice Disorder-Specific Acoustic and Auditory-Perceptual Metrics.

Background: Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.

Methods: This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.

Results: Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.

Conclusion: Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Voice
Journal of Voice 医学-耳鼻喉科学
CiteScore
4.00
自引率
13.60%
发文量
395
审稿时长
59 days
期刊介绍: The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信