{"title":"The Influence of Age, Gender, and Different Speech Tasks on CPP and CSID in Kannada Language: An Indian Normative Study.","authors":"Meluru Puttashetty Geetha, Thirunavukkarasu Jayakumar","doi":"10.1016/j.jvoice.2026.02.033","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.033","url":null,"abstract":"<p><strong>Introduction: </strong>The Cepstral Spectral Index of Dysphonia (CSID) is a multivariate index that represents a quantitative summary of dysphonia and is determined using a speech task. Since voice quality may depend on geographical and ethnic factors, it is necessary to establish regional normative CSID values if dysphonia needs to be accurately assessed. The present study aims to do this for Kannada-speaking normophonic individuals differing in age and gender, using a range of speech tasks.</p><p><strong>Method: </strong>A total of 360 normophonic adults, including 160 males and 160 females in the age range of 20-80 years, participated in the present study. The study included three types of speech stimuli: sustained phonation of the vowel /a/ lasting 3-5 sec. Four Kannada sentences from the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) protocol were designed to evaluate various voice qualities, and (3) the first paragraph of the Bengaluru passage, which contains all voiced phonemes. The speech samples were analyzed using ADSV software.</p><p><strong>Results: </strong>Age had no effect on the CSID<sub>CAPE-V</sub> or CSID<sub>Bengaluru</sub> tasks. Though age had an overall impact on the phonation task (CSID<sub>a</sub>) specific age groups did not show significant differences on CSID<sub>a</sub>. Significant gender effects were observed for all three CSID tasks. With respect to cepstral peak prominence (CPP), age had no impact on CPP, but different speech tasks and gender influenced the CPP.</p><p><strong>Conclusion: </strong>The study provides the normative values for CPP and CSID parameters for Kannada-speaking individuals of different ages and gender to provide a basis for clinical and research involving voice assessment in this population.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-16DOI: 10.1016/j.jvoice.2026.02.038
Siyan Xu, Ziying Song, Min Shu, Xia Hu, Chen Chen, Rui Fang, Jun Shao
{"title":"Smartphone-Based Mini-Program Voice Acquisition for Remote and Real-Time Acoustic Voice Analysis.","authors":"Siyan Xu, Ziying Song, Min Shu, Xia Hu, Chen Chen, Rui Fang, Jun Shao","doi":"10.1016/j.jvoice.2026.02.038","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.038","url":null,"abstract":"<p><strong>Objective: </strong>We developed a WeChat mini-program called \"Voice Acquisition\" that can be installed on smartphones for remote and real-time acoustic analysis. The program analyzes multiple parameters, including the fundamental frequency (F0), maximum phonatory time (MPT), jitter, shimmer, noise-to-harmonic ratio, cepstral peak prominence, cepstral/spectral index of dysphonia (CSID), and mel-scale frequency cepstral coefficients. This study aimed to validate these acoustic measurements against the conventional Computerized Speech Lab (CSL) software and assess their potential utility for screening and follow-up of voice disorders.</p><p><strong>Methods: </strong>During the study, smartphones were used as the carrier for Voice Acquisition. Researchers simultaneously used CSL software and the Voice Acquisition program to collect and analyze the sustained vowel phonation /a/ from 77 patients with voice abnormalities and 80 healthy volunteers in both a soundproof room and a regular office environment. The correlation coefficients from both applications were calculated to establish the reliability of the mini-program. In addition, we compared the differences between volunteers and patients to explore their potential for screening and follow-up of voice-related disorders.</p><p><strong>Results: </strong>Voice Acquisition and CSL correlated moderately to highly strongly across most acoustic parameters (r = 0.543-0.989). MPT and F0 mean proved most stable: MPT showed near-perfect positive correlation across all conditions, while F0 mean ranked second, performing optimally in the soundproof booth. Conversely, F0 min exhibited the most dramatic context sensitivity, correlating strongly in the booth (r = 0.655) but becoming non-significant outside (P > 0.05). Absolute values differed slightly between systems. Some acoustic parameters (eg, CSID) from the mini-program significantly discriminated between pathological and normal voice groups.</p><p><strong>Conclusions: </strong>Voice Acquisition provides acoustic measures that are strongly correlated with those from CSL, supporting its reliability for remote voice data collection. While preliminary data suggest potential for screening applications, further validation is needed to establish clinical utility.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-16DOI: 10.1016/j.jvoice.2026.02.023
David J Fei-Zhang, Kelsey Roessner, Kevin McElfresh, Lisa Kelchner, Alessandro de Alarcon, Nan Zhang, Stephanie Rc Zacharias
{"title":"Impact of Benign Bilateral Vocal Fold Lesions Across Childhood: Does Age Matter?","authors":"David J Fei-Zhang, Kelsey Roessner, Kevin McElfresh, Lisa Kelchner, Alessandro de Alarcon, Nan Zhang, Stephanie Rc Zacharias","doi":"10.1016/j.jvoice.2026.02.023","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.023","url":null,"abstract":"<p><strong>Objectives: </strong>To assess correlations between age group and functional, physical, emotional, and talkativeness domains of the pediatric Voice Handicap Index (pVHI) in pediatric cohorts with and without benign bilateral vocal fold lesions (BVFL) and to further validate the utility of the pVHI in these patients.</p><p><strong>Study design: </strong>Prospective data collection of pVHI in children with and without BVFL.</p><p><strong>Methods: </strong>Prospective pVHI data was collected from parents of children with and without benign BVFL. Data were examined for four age groups, preschool (3-4), young child (5-8), child (9-12), and adolescent (13-18).</p><p><strong>Results: </strong>A total of 182 parents filled-out the pVHI; 60 parents of children with no voice disorder and 122 with benign BVFL diagnosed via endoscopy. Among all (n=182), median pVHI subscores in the functional, emotional, and physical domains were significantly increased in BVFL patients compared to non-disordered (all P value <0.001). In patients with BVFL, age showed a direct relationship with emotional pVHI subscore (P < 0.001) but not with physical or functional pVHI subscores (P = 0.4, P = 0.8). There were no differences in talkativeness across groups.</p><p><strong>Conclusions: </strong>This study demonstrates that children with BVFL across all age groups face negative effects in the functional, physical, and emotional domains compared to those with no lesions. Of interest, the impact of a voice disorder changes as a function of age in children with BVFL. The impact of lesions in adolescents is significantly higher in the emotional domain. These results highlight that the needs of children with a voice disorder may differ at different ages. This information is important for voice professionals, indicating the need to adapt evaluation and therapy strategies accordingly.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-16DOI: 10.1016/j.jvoice.2026.02.046
Diego E Razura, Anna M Wisniowiecki, Zhaoyan Zhang, Marcela A Moran, Brian E Applegate, Michael M Johns Iii
{"title":"Quantifying Vocal Fold Vibration in the Coronal Plane Using Optical Coherence Tomography in Normal Human Subjects.","authors":"Diego E Razura, Anna M Wisniowiecki, Zhaoyan Zhang, Marcela A Moran, Brian E Applegate, Michael M Johns Iii","doi":"10.1016/j.jvoice.2026.02.046","DOIUrl":"10.1016/j.jvoice.2026.02.046","url":null,"abstract":"<p><strong>Objective: </strong>Assessment of human vocal fold (VF) morphology and vibration is largely limited to surface visualization via laryngeal videostroboscopy. Using optical coherence tomography (OCT), this study aims to measure VF vibratory dynamics in the coronal plane of healthy volunteers to support a quantitative assessment of VF biomechanics.</p><p><strong>Study design: </strong>Cross-sectional observational pilot study METHODS: Twelve healthy participants underwent awake, transoral imaging using a custom-built, hand-held rigid laryngoscope equipped with OCT technology. Phonation was captured during sustained vowel /i/ production at a modal pitch and loudness. A novel algorithm was utilized to reconstruct the phase-resolved VF vibratory cycle during phonation from the OCT data. Seven dynamic metrics were extracted from each video reconstruction, including closed quotient (CQ), angle at closure (AaC), divergent phase vertical thickness (DPVT), closed phase vertical span (CPVS), mucosal peak, amplitude of lateral excursion (amplitude), and vertical phase difference (VPD). Data were grouped by gender, and applicable metrics were stratified into right vocal fold and left vocal fold regions. Independent t-tests were performed to assess gender and laterality differences for each metric. Pairwise Bonferroni-adjusted Pearson correlations were performed between metrics.</p><p><strong>Results: </strong>Seven novel metrics were successfully extracted and measured from OCT data in all subjects. Significant gender differences were found for DPVT (p = 0.015), amplitude (p = 0.049), and VPD (p = 0.043). No significant laterality differences were observed except for DPVT in the female group. Correlation analyses showed a positive correlation between DPVT and CPVS (r = +0.89). CQ was negatively correlated with AaC (r = -0.74) and positively correlated with VPD (r = +0.83) CONCLUSION: OCT imaging of the VFs can be successfully performed in normal subjects during phonation to visualize sub-surface VF anatomy and vibration, enabling quantitative characterization of VF dynamics.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13000119/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Contribution of Fundamental Frequency to Perceived Vocal Roughness: Evidence From Time-Scaled Natural Voice Samples.","authors":"Kenji Aruga, Kiyohito Hosokawa, Itsuki Kitayama, Takanari Kawabe, Eri Okajima, Nao Hashida, Takeshi Tsuda, Yoshiyuki Ozono, Hidenori Tanaka, Toshihiro Kishikawa, Yukinori Takenaka, Makoto Ogawa, Hidenori Inohara","doi":"10.1016/j.jvoice.2026.02.042","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.042","url":null,"abstract":"<p><strong>Objectives: </strong>Fundamental frequency (f<sub>o</sub>) influences auditory roughness in psychoacoustic paradigms; however, it remains unclear how changes in f<sub>o</sub> alone, when applied to dysphonic voices, alter perceived vocal roughness and breathiness. We investigated how roughness and breathiness change when f<sub>o</sub> is systematically manipulated in dysphonic voices while maintaining the time-domain waveform shape.</p><p><strong>Methods: </strong>Natural sustained vowels from dysphonic or nondysphonic speakers were resynthesized in Praat by time-scaling so that f<sub>o</sub> was converted from the original value to a target f<sub>o</sub> of 50, 100, or 200 Hz without altering the overall waveform shape (Experiment 1). In Experiment 2, an extended low-f<sub>o</sub> set was created by converting f<sub>o</sub> to 30-100 Hz in 10-Hz steps. Experienced raters evaluated each token for roughness and breathiness. Linear mixed-effects models tested whether changes in f<sub>o</sub> predicted perceptual ratings after accounting for within-token and between-rater variability. Receiver operating characteristic (ROC) analysis was used to identify an f<sub>o</sub> cutoff associated with higher roughness in Experiment 2.</p><p><strong>Results: </strong>Across experiments, lowering f<sub>o</sub> systematically increased roughness ratings, whereas breathiness showed comparatively smaller and less consistent changes. In mixed-effects models, f<sub>o</sub> was an independent predictor of roughness but not of breathiness. In Experiment 2, ROC analysis identified an f<sub>o</sub> threshold of 70 Hz for differentiating higher versus lower roughness. These patterns are consistent with auditory accounts in which reduced harmonic spacing relative to cochlear bandwidths enhances within-band interactions and temporal-envelope modulation.</p><p><strong>Conclusions: </strong>By manipulating f<sub>o</sub> while preserving waveform shape, we identified an auditory contribution of f<sub>o</sub> to perceived vocal roughness that is largely distinct from breathiness. These findings support the development of acoustic roughness indices that account for fo-dependent harmonic spacing and critical-band envelope modulation, and they underscore the need to consider f<sub>o</sub> when interpreting auditory-perceptual roughness ratings.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Effectiveness of Unilateral and Bilateral Type I Thyroplasty for Nonparalytic Vocal Fold Atrophy.","authors":"Yasunori Maeda, Kazuhiro Nakamura, Hisashi Hasegawa, Marin Yoshida, Marino Otsuka, Takeshi Oshima, Shinsuke Suzuki","doi":"10.1016/j.jvoice.2026.02.025","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.025","url":null,"abstract":"<p><strong>Background: </strong>Vocal fold atrophy is a common cause of dysphonia, particularly in older adults, and is characterized by glottic insufficiency and impaired voice quality. Although type I thyroplasty is an established surgical intervention, its comprehensive effects on voice outcomes remain unclear. This study systematically evaluated the therapeutic outcomes of unilateral and bilateral type I thyroplasty using Gore-Tex for non-paralytic vocal fold atrophy using aerodynamic measurements, patient-reported assessments, auditory-perceptual evaluations, and acoustic analyses, aiming to clarify both the strengths and limitations of the procedure.</p><p><strong>Methods: </strong>Eleven patients who underwent type I thyroplasty for vocal fold atrophy between April 2018 and March 2025 were included. Surgery was generally performed bilaterally during the same session. However, intraoperative voice monitoring was conducted, and if the patient was sufficiently satisfied with the voice outcome, the procedure was concluded after unilateral medialization. Voice outcomes were assessed preoperatively and at 1 month and 1 year postoperatively using maximum phonation time (MPT); Voice Handicap Index (VHI)-30; the grade, roughness, and breathiness components of the GRBAS scale; jitter; and shimmer. The primary endpoint was the 1-year postoperative assessment. All outcome measures were analyzed using the Wilcoxon signed-rank test, with P<0.05 considered statistically significant.</p><p><strong>Results: </strong>At 1 year postoperatively, MPT significantly improved compared with preoperative values (P<0.05). VHI-30 and the grade, roughness, and breathiness components of the GRBAS scale also showed significant improvement (P<0.05). In contrast, jitter and shimmer did not show significant postoperative changes.</p><p><strong>Conclusion: </strong>Type I thyroplasty for non-paralytic vocal fold atrophy significantly improves MPT, VHI-30, and GRBAS scores, indicating enhanced phonatory function. However, improvements in acoustic parameters, such as jitter and shimmer, appear limited. These results underscore the procedure's value in clinical practice and the need to tailor treatment and set realistic expectations for each patient's voice outcomes.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-16DOI: 10.1016/j.jvoice.2026.02.037
Hyunsu Choi, Jooin Bang, Yaegi Song, Choung-Soo Kim
{"title":"Particulate Matter-Induced Lysosomal Rupture-Mediated Pyroptosis in Human Vocal Fold Fibroblasts.","authors":"Hyunsu Choi, Jooin Bang, Yaegi Song, Choung-Soo Kim","doi":"10.1016/j.jvoice.2026.02.037","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.037","url":null,"abstract":"<p><strong>Objective: </strong>Exposure to air pollution, specifically to particulate matter (PM), is a significant global health hazard. Recent studies have shown that PM triggers NLRP3-mediated pyroptosis. However, whether cathepsin B (CTSB), a protease released from ruptured lysosomes, activates the NLRP3 inflammasome is unclear. This study investigated the involvement of CTSB and NLRP3 inflammasome activation in PM-exposed human vocal fold fibroblasts (hVFFs).</p><p><strong>Methods: </strong>Pyroptotic cell death was evaluated based on LDH release and PI staining and NLRP3 inflammasome activation via western blotting and immunofluorescence staining. The inflammatory response during pyroptosis was analyzed in an ELISA assay and lysosomal stability by LysoTracker staining.</p><p><strong>Results: </strong>PM exposure was shown to induce cell membrane rupture and pyroptosis in hVFFs, through a mechanism involving NLRP3, cleaved caspase-1, gasdermin D (GSDMD), IL-1β, lysosomal-associated membrane protein 2a, and CTSB. These results suggested that PM causes pyroptosis in hVFFs via NLRP3 inflammasome activation and lysosomal destabilization. Treatment of the PM-exposed cells with MCC950, an NLRP3-specific inhibitor, suppressed this pathway, providing further evidence of the involvement of the NLRP3 inflammasome in PM-induced pyroptosis. A role for CTSB in NLRP3 inflammasome activation and pyroptosis was implied by the finding that treatment with CA-074-me, a CTSB-specific inhibitor, reduced CTSB expression and suppressed NLRP3, cleaved caspase-1, GSDMD, IL-1β, and CTSB in PM-exposed hVFFs.</p><p><strong>Conclusions: </strong>Our study shows that both the NLRP3 inflammasome and CTSB participate in PM-induced pyroptosis in hVFFs. Targeting these pathways could yield novel therapeutic agents able to mitigate the detrimental effects of PM exposure on respiratory health.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-13DOI: 10.1016/j.jvoice.2026.02.005
Buse Çelebi, Serkan Bengisu
{"title":"Investigating the Relationship Between Voice Characteristics and Presbyphagia in Geriatric Population.","authors":"Buse Çelebi, Serkan Bengisu","doi":"10.1016/j.jvoice.2026.02.005","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.005","url":null,"abstract":"<p><strong>Purpose: </strong>Aging is a significant risk factor for dysphagia. Speech and swallowing share the same anatomical and neurophysiological mechanisms and are closely related functions. Therefore, speech assessment can serve as a valuable tool for predicting the risk of dysphagia. Presbyphagia is a term that describes the impact of aging on swallowing function. This study aims to investigate the relationship between acoustic and aerodynamic parameters of voice and speech and presbyphagia in the normal geriatric population.</p><p><strong>Methods: </strong>A total of 96 individuals aged 65 and above participated in the study. Acoustic voice analysis was administered utilizing PRAAT software. In aerodynamic assessments, maximum phonation time (MPT) and the s/z ratio were assessed, whereas speech rate and vocal coordination were analyzed through diadochokinetic (DDK) measurements. Swallowing function was assessed using the eating assessment tool (EAT-10) and the timed water swallow test (TWST), while nutritional status was analyzed with the mini nutritional assessment-short form. Participants were classified as dysphagic or non-dysphagic based on their EAT-10 scores, with established cut-off values used to identify individuals at risk for dysphagia. The relevant parameters were first examined for differences between dysphagic and non-dysphagic groups, followed by a comparative assessment of factors influencing presbyphagia between the 65-79 and >80 age groups.</p><p><strong>Results: </strong>Among the participants, 66 individuals were found to be at risk for dysphagia, while the remaining 30 were not. Comparisons between the two groups revealed a statistically significant difference in swallowing capacity measured by TWST (P = 0.006). Another significant difference was also observed in the MPT parameter between dysphagic and non-dysphagic groups (P = 0.001). The comparisons based on age groups showed statistically significant differences in EAT-10 and TWST scores (P = 0.005; P = 0.001). Additionally, significant differences were found in MPT and DDK rate between age groups (P = 0.001; P = 0.001). Correlation analyses indicated that the relationship between fundamental frequency and EAT-10 and TWST was moderately significant (r = 0.423, P = 0.029; r = -0.542, P = 0.010), and that there was a strong and significant relationship between MPT and both EAT-10 and TWST (r = -0.741, P = 0.016; r = 0.729, P = 0.001). Furthermore, a strong negative correlation was found between the TWST and DDK rate (r = -0.734, P = 0.001).</p><p><strong>Conclusion: </strong>This study concludes that the assessment of voice and swallowing functions in the geriatric population is crucial for the early diagnosis of presbyphagia. Reduced respiratory support, inadequate oral-motor movements, deficiencies in vocal cord closure control, and weak airflow control have been identified as risk factors for presbyphagia. The findings emphasize that monitoring these functions pla","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147460870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-12DOI: 10.1016/j.jvoice.2026.02.026
Yue Wang, Wen Liu
{"title":"Vocal Register Adjustments in Sopranos: A Mixed-Effects Study of Source-Filter Interactions in Chinese National Singing and Bel Canto.","authors":"Yue Wang, Wen Liu","doi":"10.1016/j.jvoice.2026.02.026","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.026","url":null,"abstract":"<p><strong>Background: </strong>The definition of vocal registers has long been debated. Models emphasizing source-filter interaction have replaced exclusively laryngeal explanations in recent years. With the global spread of Chinese National Singing and Bel Canto, comparing formant tuning and glottal behavior during register transitions in these two singing genres is crucial for deepening the understanding of vocal registers and for describing the Chinese singing voice more scientifically.</p><p><strong>Method: </strong>Six sopranos with professional training (three Chinese National Singing, three Bel Canto) were recruited to produce self-perceived pitch exercises spanning their vocal pitch range. Acoustic and electroglottographic signals were recorded simultaneously. Fundamental frequency (F0), the first two formants (F1, F2), intensity, and contact quotient (CQ) were extracted. The linear and nonlinear interactions between F0 and these parameters were examined using linear mixed-effects models (LMMs) and generalized additive mixed models (GAMMs) to compare the source-filter interaction patterns of the two singing genres.</p><p><strong>Results: </strong>LMMs revealed that the F0-F1 relationship for the vowel /a/ exhibited a U-shaped pattern, with F1-F0 tuning emerging in the high pitch range, while /i/ and /u/ increased roughly linearly. A F2-based \"vowel space compression\" was observed in the high pitch range in both singing genres. Results of GAMMs showed multi-segment nonlinear F0-CQ functions. Specifically, CQ values were higher and more variable in Chinese National Singing, whereas Bel Canto maintained relatively stable CQ over a wider pitch range. Intensity increased with F0 in both singing genres; however, the growth rate varied among vowels in Chinese National Singing, while it remained relatively consistent in Bel Canto.</p><p><strong>Conclusions: </strong>This study reveals that sopranos in both singing genres develop source-filter interaction strategies that are internally consistent but stylistically distinct: (i) Chinese National Singing presents an upper-range-biased compensatory pattern, in which formant tuning and stronger glottal closure are concentrated in the upper pitch range. In contrast, Bel Canto embodies a lower-onset equalizing pattern characterized by smoother, anticipatory adjustments across a wider vocal range. (ii) The register adjustments in different singing genres are grounded in stylized source-filter interactions that directly shape vowel organization and intensity control. (iii) This study provides more evidence that register transitions involve complex mappings between acoustic outcomes and physiological implementation rather than being an isolated laryngeal event.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147460831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Journal of VoicePub Date : 2026-03-11DOI: 10.1016/j.jvoice.2026.02.020
Fabian Burk, Gerhard Förster, Robert Brunner, Lukas Werner, Orlando Guntinas-Lichius, Andreas H Müller
{"title":"Quantitative Office-Based Laryngoscopy: Bench Validation of a Simple Calibration Method for Submillimeter Accuracy With a Clinical-Grade Stereo-Laryngoscope.","authors":"Fabian Burk, Gerhard Förster, Robert Brunner, Lukas Werner, Orlando Guntinas-Lichius, Andreas H Müller","doi":"10.1016/j.jvoice.2026.02.020","DOIUrl":"https://doi.org/10.1016/j.jvoice.2026.02.020","url":null,"abstract":"<p><strong>Introduction and objective: </strong>Past successful approaches to quantitative laryngoscopy relied on research prototypes and laser-add-ons which were not approved as medical devices, preventing the acquisition of quantitative laryngoscopic data in routine-diagnostics and complicating large-scale studies. We present a simple one-shot reference-grid calibration for a CE-marked stereo-laryngoscope commercially available for clinical use and validate it on an optical bench.</p><p><strong>Methods: </strong>Using a 3D stereo-laryngoscope (XION GmbH, Berlin, Germany), we measured target distances (TD) of 2.5, 10, and 20 mm across working distances (WD) of 30, 45, 60, 75, and 90 mm in three runs, refocusing each time (30 measurements per target distance). A point-grid reference image with known spacing was captured. Relating each frame's stereo disparity to the reference grid yielded its px/mm scale. Precision was assessed as standard deviation (SD) of repeated measurements and accuracy was evaluated by Bland-Altman-analysis, reported as bias and 95% limits of agreement (LoA).</p><p><strong>Results: </strong>Over all, measurements showed a SD of 1.0%, a small bias of -0.3% with LoA from -2.2% to +1.6%. Precision and accuracy remained stable across TD, indicating even measurement performance across the evaluated range. No relevant edge-of-field distortion was observed.</p><p><strong>Conclusions: </strong>The proposed calibration method for a commercially available, CE-marked, off-the-shelf stereo endoscope yields precise, submillimeter distance measurements and is adequate for clinical and scientific use in office-based transoral laryngoscopy, without custom hardware.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147445784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}