与扭捏声听觉感知有关的频谱特征。

IF 0.7 4区 医学 Q4 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Marcelo Saldías O'Hrens, Christian Castro, Víctor M Espinoza, Justin Stoney, Camilo Quezada, Anne-Maria Laukkanen
{"title":"与扭捏声听觉感知有关的频谱特征。","authors":"Marcelo Saldías O'Hrens, Christian Castro, Víctor M Espinoza, Justin Stoney, Camilo Quezada, Anne-Maria Laukkanen","doi":"10.1080/14015439.2024.2345373","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived <i>twang-like</i> voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of <i>twang-like</i> voices.</p><p><strong>Methods: </strong>Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a <i>twang-like</i> voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.</p><p><strong>Results and conclusions: </strong>Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F<sub>1</sub> and F<sub>2</sub> correlated strongly with the auditory-perception of <i>twang-like</i> voices (0.90 and 0.74, respectively), whereas F<sub>3</sub> showed a moderate negative correlation (-0.52). The frequency difference between F<sub>1</sub> and F<sub>3</sub> showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F<sub>4</sub> and F<sub>5</sub>, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F<sub>1</sub> and F<sub>2</sub>), our results suggest that vowel articulation and the laryngeal height may be relevant when performing <i>twang-like</i> voices.</p>","PeriodicalId":49903,"journal":{"name":"Logopedics Phoniatrics Vocology","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spectral features related to the auditory perception of twang-like voices.\",\"authors\":\"Marcelo Saldías O'Hrens, Christian Castro, Víctor M Espinoza, Justin Stoney, Camilo Quezada, Anne-Maria Laukkanen\",\"doi\":\"10.1080/14015439.2024.2345373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived <i>twang-like</i> voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of <i>twang-like</i> voices.</p><p><strong>Methods: </strong>Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a <i>twang-like</i> voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.</p><p><strong>Results and conclusions: </strong>Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F<sub>1</sub> and F<sub>2</sub> correlated strongly with the auditory-perception of <i>twang-like</i> voices (0.90 and 0.74, respectively), whereas F<sub>3</sub> showed a moderate negative correlation (-0.52). The frequency difference between F<sub>1</sub> and F<sub>3</sub> showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F<sub>4</sub> and F<sub>5</sub>, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F<sub>1</sub> and F<sub>2</sub>), our results suggest that vowel articulation and the laryngeal height may be relevant when performing <i>twang-like</i> voices.</p>\",\"PeriodicalId\":49903,\"journal\":{\"name\":\"Logopedics Phoniatrics Vocology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Logopedics Phoniatrics Vocology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/14015439.2024.2345373\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Logopedics Phoniatrics Vocology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/14015439.2024.2345373","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:据我们所知,关于频谱能量分布与扭捏声音感知程度之间关系的研究仍然很少。我们旨在通过一项听觉感知测试,探讨可能与扭捏声音的听觉感知有关的频谱特征:方法:十位对测试任务和刺激物视而不见的评委对 76 个音频样本的扭曲程度进行评分。刺激物包括 20 个由 8 位中音歌手录制的声音,他们用不同的音高持续发出元音 [a:],并伴有或不伴有类似扭曲的声音。此外,还有 40 个经过过滤的刺激样本和 16 个经过合成处理的刺激样本:根据评分者内部信度评分,确定了四名适合纳入分析的评分者。结果表明,F1 和 F2 的频率与扭捏声的听觉感受密切相关(分别为 0.90 和 0.74),而 F3 则呈中度负相关(-0.52)。F1 和 F3 之间的频率差呈现出强烈的负相关(-0.82)。1-2 kHz 和 2-3 kHz 之间的平均能量呈中度相关(分别为 0.51 和 0.42)。F4 和 F5 的频率与 3 kHz 以上的能量呈弱相关。由于 2 kHz 以下的频谱变化与下颌、嘴唇和舌头的调整(即元音发音)有关,而较高的喉垂直位置可能会影响所有共振频率(包括 F1 和 F2),因此我们的结果表明,元音发音和喉的高度可能与扭捏声的演唱有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Spectral features related to the auditory perception of twang-like voices.

Background: To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived twang-like voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of twang-like voices.

Methods: Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a twang-like voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.

Results and conclusions: Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F1 and F2 correlated strongly with the auditory-perception of twang-like voices (0.90 and 0.74, respectively), whereas F3 showed a moderate negative correlation (-0.52). The frequency difference between F1 and F3 showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F4 and F5, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F1 and F2), our results suggest that vowel articulation and the laryngeal height may be relevant when performing twang-like voices.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Logopedics Phoniatrics Vocology
Logopedics Phoniatrics Vocology 医学-耳鼻喉科学
CiteScore
2.50
自引率
9.10%
发文量
21
审稿时长
>12 weeks
期刊介绍: Logopedics Phoniatrics Vocology is an amalgamation of the former journals Scandinavian Journal of Logopedics & Phoniatrics and VOICE. The intention is to cover topics related to speech, language and voice pathology as well as normal voice function in its different aspects. The Journal covers a wide range of topics, including: Phonation and laryngeal physiology Speech and language development Voice disorders Clinical measurements of speech, language and voice Professional voice including singing Bilingualism Cleft lip and palate Dyslexia Fluency disorders Neurolinguistics and psycholinguistics Aphasia Motor speech disorders Voice rehabilitation of laryngectomees Augmentative and alternative communication Acoustics Dysphagia Publications may have the form of original articles, i.e. theoretical or methodological studies or empirical reports, of reviews of books and dissertations, as well as of short reports, of minor or ongoing studies or short notes, commenting on earlier published material. Submitted papers will be evaluated by referees with relevant expertise.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信