Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden
{"title":"从筛选到精确:寻找声音障碍特定的声学和听觉感知度量。","authors":"Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden","doi":"10.1016/j.jvoice.2025.09.007","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.</p><p><strong>Methods: </strong>This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.</p><p><strong>Results: </strong>Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.</p><p><strong>Conclusion: </strong>Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494153/pdf/","citationCount":"0","resultStr":"{\"title\":\"From Screening to Precision: Searching for Voice Disorder-Specific Acoustic and Auditory-Perceptual Metrics.\",\"authors\":\"Eric J Hunter, Lady Catherine Cantor-Cutiva, Patrick R Walden\",\"doi\":\"10.1016/j.jvoice.2025.09.007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.</p><p><strong>Methods: </strong>This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.</p><p><strong>Results: </strong>Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.</p><p><strong>Conclusion: </strong>Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.</p>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12494153/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jvoice.2025.09.007\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.09.007","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
From Screening to Precision: Searching for Voice Disorder-Specific Acoustic and Auditory-Perceptual Metrics.
Background: Acoustic and auditory-perceptual parameters are common tools for screening clinically significant voice disorders. However, the potential for disorder-specific acoustic signatures that support clinical differential diagnosis remains largely unrealized. Additionally, the robustness of acoustic patterns across different speech materials requires clarification to inform flexible, evidence-based clinical protocols and emerging machine learning applications.
Methods: This study investigated disorder-specific metrics and speech material consistency using the Perceptual Voice Qualities Database. Generalized Linear Models examined associations between 14 acoustic parameters and common voice pathologies [Vocal Fold Paralysis (VFP), Atrophy, Lesions, and Muscle Tension Dysphonia (MTD)]. Principal component analysis (PCA) integrated acoustic and auditory-perceptual measures to identify multidimensional voice quality patterns, while Receiver Operating Characteristic (ROC) curves evaluated discriminative performance across sustained vowels and connected speech.
Results: Two primary principal components emerged: PC1 (34.7% variance) integrating general voice quality and perceptual ratings, and PC2 (17.3% variance) contrasting temporal stability with harmonic structure. Distinct disorder-specific patterns were identified: VFP demonstrated strong discriminative performance on both components (AUC ≥ 0.75), while Atrophy, Lesions, and MTD showed moderate associations with PC1 (AUC = 0.52-0.66). Preliminary analysis revealed characteristic patterns for Parkinson's disease across both components. Importantly, acoustic patterns remained consistent across speech materials, supporting task-flexible clinical assessment protocols.
Conclusion: Specific voice pathologies exhibit distinct acoustic-perceptual signatures that can be reliably identified through multidimensional analysis. These findings support a precision-based approach to voice assessment, moving beyond general screening toward disorder-specific diagnostic applications. The robustness of patterns across speech materials enables flexible clinical protocols, while the integration of acoustic and perceptual measures provides a foundation for enhanced diagnostic tools and machine learning applications.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.