Marcelo Saldías O'Hrens, Christian Castro, Víctor M Espinoza, Justin Stoney, Camilo Quezada, Anne-Maria Laukkanen
{"title":"Spectral features related to the auditory perception of twang-like voices.","authors":"Marcelo Saldías O'Hrens, Christian Castro, Víctor M Espinoza, Justin Stoney, Camilo Quezada, Anne-Maria Laukkanen","doi":"10.1080/14015439.2024.2345373","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived <i>twang-like</i> voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of <i>twang-like</i> voices.</p><p><strong>Methods: </strong>Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a <i>twang-like</i> voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.</p><p><strong>Results and conclusions: </strong>Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F<sub>1</sub> and F<sub>2</sub> correlated strongly with the auditory-perception of <i>twang-like</i> voices (0.90 and 0.74, respectively), whereas F<sub>3</sub> showed a moderate negative correlation (-0.52). The frequency difference between F<sub>1</sub> and F<sub>3</sub> showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F<sub>4</sub> and F<sub>5</sub>, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F<sub>1</sub> and F<sub>2</sub>), our results suggest that vowel articulation and the laryngeal height may be relevant when performing <i>twang-like</i> voices.</p>","PeriodicalId":49903,"journal":{"name":"Logopedics Phoniatrics Vocology","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Logopedics Phoniatrics Vocology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/14015439.2024.2345373","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived twang-like voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of twang-like voices.
Methods: Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a twang-like voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.
Results and conclusions: Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F1 and F2 correlated strongly with the auditory-perception of twang-like voices (0.90 and 0.74, respectively), whereas F3 showed a moderate negative correlation (-0.52). The frequency difference between F1 and F3 showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F4 and F5, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F1 and F2), our results suggest that vowel articulation and the laryngeal height may be relevant when performing twang-like voices.
期刊介绍:
Logopedics Phoniatrics Vocology is an amalgamation of the former journals Scandinavian Journal of Logopedics & Phoniatrics and VOICE.
The intention is to cover topics related to speech, language and voice pathology as well as normal voice function in its different aspects. The Journal covers a wide range of topics, including:
Phonation and laryngeal physiology
Speech and language development
Voice disorders
Clinical measurements of speech, language and voice
Professional voice including singing
Bilingualism
Cleft lip and palate
Dyslexia
Fluency disorders
Neurolinguistics and psycholinguistics
Aphasia
Motor speech disorders
Voice rehabilitation of laryngectomees
Augmentative and alternative communication
Acoustics
Dysphagia
Publications may have the form of original articles, i.e. theoretical or methodological studies or empirical reports, of reviews of books and dissertations, as well as of short reports, of minor or ongoing studies or short notes, commenting on earlier published material. Submitted papers will be evaluated by referees with relevant expertise.