{"title":"Comparative Analysis of Objective and Subjective Voice Characteristics in Patients with Type 2 Diabetes Mellitus and Healthy Control Group.","authors":"Mahin Samadinia, Mehrnoosh Zakerkish, Ehsan Naderifar, Maryam Dastoorpoor, Mahshid Rashidi, Majid Soltani, Negin Moradi","doi":"10.1159/000545443","DOIUrl":"10.1159/000545443","url":null,"abstract":"<p><strong>Introduction: </strong>The purposes of this study were to multidimensionally assess the voice in patients with type 2 diabetes using perceptual and acoustic measurements, laryngeal imaging, and patient self-assessment and compare it with the healthy control group.</p><p><strong>Methods: </strong>A total of 60 patients with type 2 diabetes were enrolled in this cross-sectional study, and a healthy group matched according to age and gender was recruited. The subjects underwent acoustic analysis using the Dysphonia Severity Index (DSI) and perceptual evaluation using the Persian version of the Consensus Auditory Perceptual Evaluation of Voice (ATSHA). Voice-related quality of life was assessed using the Voice Handicap Index (VHI).</p><p><strong>Results: </strong>In both patient and control groups, 28 male and 32 female participants, with a mean age range of 47.35 ± 9.49 and 44.18 ± 9.39 years old, were included. The patient group had a higher mean for ATSHA parameters compared to the control group, and the difference between these two groups was significant (p < 0.05). The VHI scores for the patient group were significantly higher than those for the control group. The mean of DSI scores was lower in the patient group; however, this difference was not statistically significant (p > 0.05). There was a statistically significant correlation among the DSI, FBS, and HbA1C variables (p = 0.007, r = -0.346, and p = 0.008, r = -0.337).</p><p><strong>Conclusions: </strong>The results confirm the presence of voice disorder symptoms in the patient group and their effects on their quality of life compared to the healthy group. These symptoms may result from changes in respiratory and muscle function, as well as changes in the phonetic mechanism of diabetes patients.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-8"},"PeriodicalIF":1.1,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143709345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Silvia Piazzalunga, Benjamin W Y Hornsby, Gloria Parravicini, Michela M Dossi, Luca Negri, Antonio Schindler, Eliana Cristofari
{"title":"Listening-Related Fatigue in Adults: Cross-Cultural Adaptation and Validation of the Italian Version of the Vanderbilt Fatigue Scale.","authors":"Silvia Piazzalunga, Benjamin W Y Hornsby, Gloria Parravicini, Michela M Dossi, Luca Negri, Antonio Schindler, Eliana Cristofari","doi":"10.1159/000545301","DOIUrl":"10.1159/000545301","url":null,"abstract":"<p><strong>Introduction: </strong>Individuals with hearing loss frequently encounter augmented listening-related fatigue due to the elevated effort required for auditory processing, which can be evaluated using the Vanderbilt Fatigue Scale for Adults (VFS-A-40). The objective of this study was to cross-culturally adapt the VFS-A-40 into Italian and validate the Italian version of the scale (VFS-A-40-Ita).</p><p><strong>Methods: </strong>The study employed an observational design, with rigorous cross-cultural adaptation and back-translation procedures. The data were collected from a large sample of native Italian speakers aged 18 years or older without neuropsychological and/or cognitive disorders. The sample was recruited through convenience sampling and included 125 adults with hearing loss (AHL) and 203 adults with normal hearing (ANH). The validation of the VFS-A-40-Ita entailed the assessment of its construct validity (structural validity, known-groups validity, and convergent validity), internal consistency, test-retest reliability, and content validity. Additionally, face validity was assessed by a panel of 10 audiological experts, including individuals with hearing loss.</p><p><strong>Results: </strong>The results indicated a statistically significant difference in perceived hearing fatigue across different domains for AHL participants. Indeed, AHL tended to rate the cognitive domain as more impaired than the other domains. As for structural validity, the results of the confirmatory factor analysis provided support for a first-order one-dimensional solution comprising four correlated factors, thereby justifying the use of a total fatigue score. The known-groups validity demonstrated statistically significant differences between the AHL and ANH groups for all variables under analysis. The convergent validity was confirmed with a moderate correlation between the total scores on the VFS-A-40-Ita and scores related to the physical and mental symptoms of persistent generic fatigue (r > 0.39; p < 0.01). Internal consistency analysis revealed Cronbach's alpha values greater than 0.96 for the VFS-A-40-Ita total score, indicating high reliability. Test-retest reliability showed robust correlations between baseline and retest scores (p < 0.01). Content validity was deemed adequate, with almost all items meeting the item-level content validity index cut-off score of 0.78, as rated relevant or highly relevant by at least 8 out of 10 experts.</p><p><strong>Conclusion: </strong>the findings of this study provide substantial evidence for the validity of the VFS-A-40-Ita, confirming its relevance, validity, and reliability for the assessment of listening-related fatigue in Italian-speaking adults. The instrument is open access and suitable for use in clinical settings and future research, providing a valuable subjective outcome measure for initial assessment, follow-up, and post-intervention evaluation within biopsychosocial and evidence-based clinical pr","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-17"},"PeriodicalIF":1.1,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143647813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yufang Ruan, Peter Wermke, Nai Ding, Li Ma, Hongyu Chang, Linjun Zhang, Youyi Liu, Hua Shu
{"title":"Rhythmicity in Newborn Cry: From an Amplitude Modulation Perspective.","authors":"Yufang Ruan, Peter Wermke, Nai Ding, Li Ma, Hongyu Chang, Linjun Zhang, Youyi Liu, Hua Shu","doi":"10.1159/000545207","DOIUrl":"10.1159/000545207","url":null,"abstract":"<p><strong>Introduction: </strong>Newborn cries exhibit a richness of melody and rhythmicity, yet their rhythmic features have been less studied.</p><p><strong>Methods: </strong>In this study, we analyzed the amplitude modulation of 1,884 cries from 30 Chinese newborns.</p><p><strong>Results: </strong>The group-averaged amplitude modulation spectrum revealed an evident increase in energy between 0.5 and 8 Hz, peaking around 2 Hz. The peak frequency for an individual child averaged at around 3 Hz (SD = 1.3 Hz), with the majority lying below 5 Hz.</p><p><strong>Conclusion: </strong>This study is the first to demonstrate that newborn cries contain increased energy in slower rate amplitude modulation frequencies associated with prosodic feet and syllabic rhythms in speech, underscoring the important role of rhythmic information in early language acquisition.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-9"},"PeriodicalIF":1.1,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143639692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Olga V Sapunova, Elizaveta N Romanova, Olga I Morkovina, Anastasia A Sharapkova, Vera V Tolchennikova
{"title":"Comparing Senior Academics and Students in Phonemic Verbal Fluency: Phonetic and Word Frequency Variables.","authors":"Olga V Sapunova, Elizaveta N Romanova, Olga I Morkovina, Anastasia A Sharapkova, Vera V Tolchennikova","doi":"10.1159/000545151","DOIUrl":"10.1159/000545151","url":null,"abstract":"<p><strong>Introduction: </strong>Controlled Oral Word Association Test (COWAT) is a common neuropsychological assessment tool used to evaluate an individual's semantic and phonemic verbal fluency. The defining feature of this test is the imposed time limit (typically 60 s per task) and the initial letter constraints. Evaluating the performance on COWAT has been challenging because numerous factors may complicate the interpretation of the obtained results. Although it has been widely used in clinical and research settings, it remains unclear whether COWAT performance primarily depends on the current cognitive activity level, the participants' age or the complexity and richness of vocabulary (which might be revealed via linguistic means, e.g., a relationship between pause length and word frequency).</p><p><strong>Methods: </strong>The paper compares the performance of two age groups on the Russian version of COWAT test: students (M = 18.4, SD = 0.8) and senior academics (M = 60.6, SD = 5.2), all native speakers of Russian. Here, we develop the commonly used quantitative approach to processing data by adding the qualitative variables of word frequency and pause length.</p><p><strong>Results: </strong>Our findings show that the cohort of senior academics tends to outperform the younger cohort, which might be conditioned by constant extensive cognitive engagement potentially preventing any cognitive decline. Second, associations between word frequency and pause length, mediated by certain factors, were observed in both cohorts studied, which call for a more in-depth and larger scale investigation.</p><p><strong>Conclusion: </strong>The research opens new avenues for applying COWAT as a comprehensive diagnostic tool and a research method in both psychology and linguistics.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-16"},"PeriodicalIF":1.1,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143614032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gökhan Toptas, Emel Çadallı Tatar, Kemal Keseroğlu, Mustafa Şahin, Elife Barmak, Esma Altan, Mehmet Hakan Korkmaz
{"title":"Does Voice Therapy after Endolaryngeal Microsurgery Contribute to the Treatment of Vocal Fold Polyp?","authors":"Gökhan Toptas, Emel Çadallı Tatar, Kemal Keseroğlu, Mustafa Şahin, Elife Barmak, Esma Altan, Mehmet Hakan Korkmaz","doi":"10.1159/000545213","DOIUrl":"10.1159/000545213","url":null,"abstract":"<p><strong>Introduction: </strong>The aim of this study was to assess the contribution of voice therapy (VT) to management of patients following surgery for vocal fold polyps.</p><p><strong>Methods: </strong>In this prospective study, two groups were created by randomizing patients diagnosed with vocal fold polyps. Group 1 received only phonosurgery, and group 2 received phonosurgery followed by VT. Subjective test methods conducted before and after treatment were evaluated comparatively. Videolaryngostroboscopy, the Voice Handicap Index-10 (VHI-10), and voice analysis parameters, including fundamental frequency variation (vF0), jitter percent (Jitt%), shimmer percent (Shim%), noise-to-harmonic ratio, and maximum phonation time (MPT), were compared before and after treatment.</p><p><strong>Results: </strong>The data of 40 patients, 15 (37.5%) women and 25 (62.5%) men, were examined in this study. Significant reductions were observed in the pre- and post-treatment values in the Voice Handicap Index-10 and GRBAS scale (p < 0.001). In terms of aerodynamic analysis, a decrease was observed in the S/Z ratio in both groups after treatment. Regarding MPT, a statistically significant increase was observed following treatment in (1) both groups (p < 0.001) and (2) group 2 versus group 1 (p < 0.001). In terms of acoustic voice analysis, both groups showed an increase in fundamental frequency (F0) and a statistically significant decrease in jitter and shimmer values. When comparing these parameter changes, it was observed that group 2 had a statistically significant difference compared to group 1 (F0: p = 0.082; Jitt: p = 0.014; Shim: p = 0.035, respectively).</p><p><strong>Conclusion: </strong>In this study comparing the outcomes of vocal fold polyp treatment using only surgery versus surgery combined with VT, it was found that the addition of VT to endolaryngeal phonosurgery provided a significant positive contribution.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-11"},"PeriodicalIF":1.1,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143604413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Preparation and Application of Thickened Foods using Lotus Root Starch Powder for Post-Stroke Dysphagic Patients.","authors":"Mingyue Xiong, Manwa L Ng","doi":"10.1159/000545168","DOIUrl":"https://doi.org/10.1159/000545168","url":null,"abstract":"<p><strong>Objective: </strong>The present study explored the ways to prepare thickened liquids of different consistencies using lotus root starch (LRS) powder and evaluated their efficacy in treating post-stroke dysphagic patients.</p><p><strong>Method: </strong>Thickened liquids matching the four consistency levels advocated by the International Dysphagia Diet Standardization Initiative (IDDSI) guidelines were prepared. To assess their clinical applicability, 76 post-stroke dysphagic patients were randomly divided into three groups: a xanthan gum (XG) group (n=26), an LRS group (n=26), and a control group (n=24). All groups received an intervention including swallow-related exercises and oral feeding training. The control group used regular foods (water, juice, congee, egg custard, and banana puree), while the XG and LRS groups used their respective thickened liquids. Swallowing performance was assessed before and after the intervention using the Standardized Swallowing Assessment (SSA), Penetration-Aspiration Scale (PAS), and gastric tube use duration and removal rate, incidence of adverse reactions, and cost of thickening associated with the three groups were obtained after intervention.</p><p><strong>Results: </strong>While all groups experienced a significant improvement after the intervention, as indicated by the SSA and PAS scores, the XG and LRS groups demonstrated significantly better outcomes than the control group. In addition, the XG and LRS patients had shorter gastric tube use, greater removal rate, and fewer incidence of adverse reactions than the controls. The cost of XG was greater than that of LRS powder for the intervention.</p><p><strong>Conclusion: </strong>LRS powder can be used to prepare thickened liquids of different consistencies following the IDDSI framework. Its efficacy and safety in treating dysphagia are comparable to XG based thickeners. However, seeing its markedly lower cost, LRS powder can be considered a viable alternative to XG based commercial thickeners.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-19"},"PeriodicalIF":1.1,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143585226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Smart Preterm Breather: Variation of Cry Duration by Laryngeal Constriction Phenomena.","authors":"Lea Melber, Johannes Wirbelauer, Tabea Kottmann","doi":"10.1159/000543944","DOIUrl":"10.1159/000543944","url":null,"abstract":"<p><strong>Introduction and objective: </strong>Human neonates are born with some control over their larynx, which is essential for breathing, swallowing and phonation. Laryngeal constriction phenomena were observed during not provoked crying of full-term infants. As preterm infants show alterations in lung development and functionality, the aim of the present study was to examine whether these infants generate laryngeal constriction phenomena during the first week of life.</p><p><strong>Methods: </strong>Cry vocalizations (n = 562) of 11 healthy, preterm infants (6 females) were analyzed visually (frequency spectrograms and oscillograms) and auditorily. The analyses of the occurrence of laryngeal constriction phenomena and the duration of vocalization were conducted using the open-source software PRAAT.</p><p><strong>Results: </strong>Constriction phenomena were found in the crying of all preterm infants. No significant sex differences were observed. There was a significant correlation between the occurrence of laryngeal constriction phenomena and gestational age (r = -0.81). Vocalizations containing constriction phenomena were significantly longer than those without constrictions.</p><p><strong>Conclusion: </strong>The study showed laryngeal constrictions as a regular phenomenon in cry vocalizations of preterm infants during their first week of life. They seemed to be mainly used as a respiratory mechanism to increase end-expiratory lung volume and to improve alveolar gas exchange.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-10"},"PeriodicalIF":1.1,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143585221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yu-An Chen, Shoko Miyamoto, Kenneth O St Louis, Yao Liu, Haotian Liu, Ting Gong, Ying Yang, Sirui Liu
{"title":"An Exploratory Comparison of Attitudes toward Stuttering and Cluttering of Chinese Practicing Speech and Language Therapists and SLT Students.","authors":"Yu-An Chen, Shoko Miyamoto, Kenneth O St Louis, Yao Liu, Haotian Liu, Ting Gong, Ying Yang, Sirui Liu","doi":"10.1159/000543442","DOIUrl":"10.1159/000543442","url":null,"abstract":"<p><strong>Introduction: </strong>This study aimed to compare attitudes toward stuttering and cluttering among speech and language therapists (SLTs) and speech and language therapy students in China and how they differ from those in other countries. It also seeks to explore factors that may influence these attitude differences.</p><p><strong>Methods: </strong>We utilized online versions of the Public Opinion Survey of Human Attributes-Stuttering (POSHA-S) and Public Opinion Survey of Human Attributes-Cluttering (POSHA-Cl) instruments to conduct surveys of 54 SLTs and 99 SLT students at 6 Chinese universities offering speech and language therapy programs.</p><p><strong>Results: </strong>Chinese SLTs had more positive attitudes than did SLT students. For example, rejecting learning and emotional trauma as a cause of stuttering or cluttering (a positive response) was more likely in SLTs than in SLT students. Both groups expressed more positivity about their knowledge sources than the average ratings from international samples.</p><p><strong>Conclusion: </strong>Increased positive exposure opportunities with stuttering or cluttering will likely positively impact on attitudes toward stuttering and cluttering in China. It is crucial for speech and language professionals in China to recognize the widespread misinformation in various sources and actively combat misconceptions about stuttering and cluttering.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-16"},"PeriodicalIF":1.1,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143540781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Impact of Transcutaneous Electrical Stimulation on Oral Moisture in Older Adults with and without Xerostomia: A Pilot Study.","authors":"Ali Barikroo, Lauren Falter","doi":"10.1159/000543814","DOIUrl":"https://doi.org/10.1159/000543814","url":null,"abstract":"<p><strong>Introduction: </strong>Xerostomia, or dry mouth, is a prevalent and distressing oral health condition in older adults that is associated with reduced swallow frequency, thereby increasing the risk of dysphagia and aspiration pneumonia in this cohort. This pseudo-experimental study investigated the association between transcutaneous electrical stimulation (TES) and changes in perceived oral moisture, as well as the function of major and minor salivary glands in two groups of older adults, including those with and without xerostomia.</p><p><strong>Methods: </strong>Ten older adults with self-reported xerostomia and 7 control participants were exposed to two conditions: no TES and motor TES. TES electrodes were placed on cheeks and submental areas, targeting the parotid, submandibular, and sublingual salivary glands. Participants watched a silent nature movie while receiving continuous TES (only for active TES condition) for 15 min while swallowing sounds were recorded using a microphone attached under the cricoid cartilage. Changes in key outcome measures, including perceived oral moisture (measured by a visual analog scale) along with major and minor salivary glands function (measured by stimulated swallow frequency rate and Periotron) were assessed in response to TES.</p><p><strong>Results: </strong>The findings indicated that TES increased stimulated swallow frequency rate in both groups (F(1,13) = 9.813, p < 0.008, ηp2 = 0.430). Additionally, the perceived oral moisture measures were only improved for patients with xerostomia (F(1,13) = 5.155, p < 0.041, ηp2 = 0.284). However, no significant changes were noted in minor salivary gland flow rate measures.</p><p><strong>Conclusion: </strong>These preliminary findings suggest an association between TES and increased objective major salivary gland function in both groups and perceived oral moisture in patients with xerostomia. However, due to the cross-sectional nature of this study, further research involving larger cohorts and controlled designs, and follow up is necessary to understand the mechanisms and potential therapeutic effects of TES on xerostomia.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-9"},"PeriodicalIF":1.1,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143537049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ahmed M Yousef, Adrián Castillo-Allendes, Mark L Berardi, Juliana Codino, Adam D Rubin, Eric J Hunter
{"title":"Screening Voice Disorders: Acoustic Voice Quality Index, Cepstral Peak Prominence, and Machine Learning.","authors":"Ahmed M Yousef, Adrián Castillo-Allendes, Mark L Berardi, Juliana Codino, Adam D Rubin, Eric J Hunter","doi":"10.1159/000544852","DOIUrl":"10.1159/000544852","url":null,"abstract":"<p><strong>Introduction: </strong>The Acoustic Voice Quality Index (AVQI) and Smoothed Cepstral Peak Prominence (CPPs) have been reported to effectively support the assessment of voice quality in persons seeking voice care across many languages. This study aimed to evaluate the diagnostic accuracy of these two measures in detecting voice disorders in American English speakers, comparing their performance to machine learning (ML) models.</p><p><strong>Methods: </strong>This retrospective study included a cohort of 187 participants: 138 patients with clinically diagnosed voice disorders and 49 vocally healthy individuals. Each participant completed two voicing tasks: sustaining [a:] vowel and producing a running speech sample, which were then concatenated. These samples were analyzed using VOXplot software for AVQI-3 (version 03.01) and CPPs. Additionally, four ML models (random forest, k-nearest neighbors, support vector machine, and decision tree) were trained for comparison. The diagnostic accuracy of the two measures and models was assessed using various evaluation metrics, including receiver operating characteristic curve and Youden Index.</p><p><strong>Results: </strong>A cutoff score of 1.54 for the AVQI-3 (with 55% sensitivity and 80% specificity) and 14.35 dB for CPPs (with 65% sensitivity and 78% specificity) were identified for detecting voice disorders. Compared to an average ML sensitivity of 89% and specificity of 55%, CPPs offered a better balance between sensitivity and specificity, outperforming AVQI-3 and nearly matching the average ML performance.</p><p><strong>Conclusions: </strong>ML shows great potential for supporting voice disorder diagnostics, especially as models become more generalizable and easier to interpret. However, current tools like AVQI-3 and CPPs remain more practical and accessible for clinical use in evaluating voice quality than commonly implemented models. CPPs, in particular, offers distinct advantages for identifying voice disorders, making it a recommended and feasible choice for clinics with limited resources.</p>","PeriodicalId":12114,"journal":{"name":"Folia Phoniatrica et Logopaedica","volume":" ","pages":"1-15"},"PeriodicalIF":1.1,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143482558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}