{"title":"epstral频谱声学分析用于发音障碍严重程度分类的研究","authors":"Önal İncebay , Ayşen Köse , Fatma Esen Aydinli , Shaheen N. Awan , Merve Dilbaz Gürsoy , Taner Yilmaz","doi":"10.1016/j.jvoice.2022.12.012","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>The advantages of cepstral measurements in the evaluation of dysphonia<span> have been noted in previous studies. However, there is an unclarity regarding the results of cepstral analyzes effect in determining the severity of dysphonia. The aims of this study were to determine the cut-off values of cepstral peak prominence, cepstral peak prominence standard deviation, low frequency/ high frequency ratio, low frequency/high frequency ratio standard deviation, and cepstral spectral index of dysphonia for predicting the voice severity within a Turkish speaking population, as well as to confirm the discriminative power of these cut-off values.</span></div></div><div><h3>Materials Methods</h3><div>One hundred ninety-five individuals with voice disorders and an equal number of age and gender-matched individuals without voice disorders were included. Included subjects had visited the Hacettepe University Hospitals Speech and Language Therapy<span> Department for voice evaluation between January 2017 and September 2021. The voice recordings from all participants included the six CAPE-V/Turkish sentences and sustained vowel /a/. Three raters provided auditory perceptual ratings of the voice samples using the GRBAS scale (grade) and overall severity for the CAPE-V/Turkish. Participants were categorized into normal and mild, moderate, and severely dysphonic groups based on the auditory perceptual evaluation. Analysis of Dysphonia in Speech and Voice (ADSV) software was used for cepstral spectral acoustic analysis.</span></div></div><div><h3>Results</h3><div>In the sustained vowel context, the area under the curve (ROC) for the CSID value was >0.8, except for mild vs. moderate dysphonia groups. In connected speech contexts, the ROC of the CPP value was also >0.8, except for normal vs. mild dysphonia groups. The cut-off values of CPP and CSID demonstrated high sensitivity and specificity for predicting voice severities.</div></div><div><h3>Conclusion</h3><div>The cut-off values for the parameters that predicted voice severities showed a significant degree of discriminative power for categorizing voice severities among Turkish-speaking people.</div></div>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":"39 3","pages":"Pages 844.e19-844.e30"},"PeriodicalIF":2.5000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Investigation of the Cepstral Spectral Acoustic Analysis for Classifying the Severity of Dysphonia\",\"authors\":\"Önal İncebay , Ayşen Köse , Fatma Esen Aydinli , Shaheen N. Awan , Merve Dilbaz Gürsoy , Taner Yilmaz\",\"doi\":\"10.1016/j.jvoice.2022.12.012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>The advantages of cepstral measurements in the evaluation of dysphonia<span> have been noted in previous studies. However, there is an unclarity regarding the results of cepstral analyzes effect in determining the severity of dysphonia. The aims of this study were to determine the cut-off values of cepstral peak prominence, cepstral peak prominence standard deviation, low frequency/ high frequency ratio, low frequency/high frequency ratio standard deviation, and cepstral spectral index of dysphonia for predicting the voice severity within a Turkish speaking population, as well as to confirm the discriminative power of these cut-off values.</span></div></div><div><h3>Materials Methods</h3><div>One hundred ninety-five individuals with voice disorders and an equal number of age and gender-matched individuals without voice disorders were included. Included subjects had visited the Hacettepe University Hospitals Speech and Language Therapy<span> Department for voice evaluation between January 2017 and September 2021. The voice recordings from all participants included the six CAPE-V/Turkish sentences and sustained vowel /a/. Three raters provided auditory perceptual ratings of the voice samples using the GRBAS scale (grade) and overall severity for the CAPE-V/Turkish. Participants were categorized into normal and mild, moderate, and severely dysphonic groups based on the auditory perceptual evaluation. Analysis of Dysphonia in Speech and Voice (ADSV) software was used for cepstral spectral acoustic analysis.</span></div></div><div><h3>Results</h3><div>In the sustained vowel context, the area under the curve (ROC) for the CSID value was >0.8, except for mild vs. moderate dysphonia groups. In connected speech contexts, the ROC of the CPP value was also >0.8, except for normal vs. mild dysphonia groups. The cut-off values of CPP and CSID demonstrated high sensitivity and specificity for predicting voice severities.</div></div><div><h3>Conclusion</h3><div>The cut-off values for the parameters that predicted voice severities showed a significant degree of discriminative power for categorizing voice severities among Turkish-speaking people.</div></div>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":\"39 3\",\"pages\":\"Pages 844.e19-844.e30\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0892199722004143\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0892199722004143","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
Investigation of the Cepstral Spectral Acoustic Analysis for Classifying the Severity of Dysphonia
Objectives
The advantages of cepstral measurements in the evaluation of dysphonia have been noted in previous studies. However, there is an unclarity regarding the results of cepstral analyzes effect in determining the severity of dysphonia. The aims of this study were to determine the cut-off values of cepstral peak prominence, cepstral peak prominence standard deviation, low frequency/ high frequency ratio, low frequency/high frequency ratio standard deviation, and cepstral spectral index of dysphonia for predicting the voice severity within a Turkish speaking population, as well as to confirm the discriminative power of these cut-off values.
Materials Methods
One hundred ninety-five individuals with voice disorders and an equal number of age and gender-matched individuals without voice disorders were included. Included subjects had visited the Hacettepe University Hospitals Speech and Language Therapy Department for voice evaluation between January 2017 and September 2021. The voice recordings from all participants included the six CAPE-V/Turkish sentences and sustained vowel /a/. Three raters provided auditory perceptual ratings of the voice samples using the GRBAS scale (grade) and overall severity for the CAPE-V/Turkish. Participants were categorized into normal and mild, moderate, and severely dysphonic groups based on the auditory perceptual evaluation. Analysis of Dysphonia in Speech and Voice (ADSV) software was used for cepstral spectral acoustic analysis.
Results
In the sustained vowel context, the area under the curve (ROC) for the CSID value was >0.8, except for mild vs. moderate dysphonia groups. In connected speech contexts, the ROC of the CPP value was also >0.8, except for normal vs. mild dysphonia groups. The cut-off values of CPP and CSID demonstrated high sensitivity and specificity for predicting voice severities.
Conclusion
The cut-off values for the parameters that predicted voice severities showed a significant degree of discriminative power for categorizing voice severities among Turkish-speaking people.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.