Challenges and Limits in Explaining and Acoustic Modeling of Voice Characteristics.

IF 2.4 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Jana Wiechmann, Petra Wagner
{"title":"Challenges and Limits in Explaining and Acoustic Modeling of Voice Characteristics.","authors":"Jana Wiechmann, Petra Wagner","doi":"10.1016/j.jvoice.2025.07.036","DOIUrl":null,"url":null,"abstract":"<p><p>To this day, the assessment of human voices remains a challenge due to (i) inconsistencies in subjective ratings and (ii) the lack of objective measurements for the perceptual impressions of voice characteristics. This can lead to significant consequences in applied fields such as speech therapy, where the assessment of voices is crucial for a successful treatment. In this paper, we address the explanation of voice and its characteristics from two different angles: In a first study, 22 speech therapists in training assessed a set of 20 non-pathological voices regarding 20 voice characteristics before and after receiving an expert explanation. Although the expert explanation did not lead to an improvement in overall rating performance, the analysis still yielded valuable insights into the particular challenges for novice voice practitioners in their characterization of voices. A second study aimed at a better understanding of the link between perceived voice characteristics and acoustic features. A data set of 295 voice samples of the same corpus was labeled by an expert with regard to the same 20 voice characteristics as in the first study. Afterwards, we analyzed the speech samples using a set of acoustic features, which were then used as predictors in statistical models of the annotated characteristics. This analysis yielded a unique set of significant acoustic features as main effects predicting each individual voice characteristic, although the model fits were overall modest. Furthermore, all of the voice characteristic models showed interactions with the speakers' gender. These results suggest a necessity for paying special attention to gender differences when assessing voice. Interestingly, we obtained a tendency for a higher model accuracy for those voice characteristics that have also shown to be rated more accurately and consistently by human listeners.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.07.036","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

To this day, the assessment of human voices remains a challenge due to (i) inconsistencies in subjective ratings and (ii) the lack of objective measurements for the perceptual impressions of voice characteristics. This can lead to significant consequences in applied fields such as speech therapy, where the assessment of voices is crucial for a successful treatment. In this paper, we address the explanation of voice and its characteristics from two different angles: In a first study, 22 speech therapists in training assessed a set of 20 non-pathological voices regarding 20 voice characteristics before and after receiving an expert explanation. Although the expert explanation did not lead to an improvement in overall rating performance, the analysis still yielded valuable insights into the particular challenges for novice voice practitioners in their characterization of voices. A second study aimed at a better understanding of the link between perceived voice characteristics and acoustic features. A data set of 295 voice samples of the same corpus was labeled by an expert with regard to the same 20 voice characteristics as in the first study. Afterwards, we analyzed the speech samples using a set of acoustic features, which were then used as predictors in statistical models of the annotated characteristics. This analysis yielded a unique set of significant acoustic features as main effects predicting each individual voice characteristic, although the model fits were overall modest. Furthermore, all of the voice characteristic models showed interactions with the speakers' gender. These results suggest a necessity for paying special attention to gender differences when assessing voice. Interestingly, we obtained a tendency for a higher model accuracy for those voice characteristics that have also shown to be rated more accurately and consistently by human listeners.

声音特征的解释和声学建模的挑战和限制。
直到今天,人类声音的评估仍然是一个挑战,因为(i)主观评分不一致,(ii)缺乏对声音特征感知印象的客观测量。这可能会对语言治疗等应用领域产生重大影响,在这些领域,声音的评估对成功的治疗至关重要。在本文中,我们从两个不同的角度讨论了声音及其特征的解释:在第一项研究中,22名接受培训的语言治疗师在接受专家解释之前和之后评估了一组20个非病理声音的20个声音特征。虽然专家的解释并没有导致整体评分表现的改善,但分析仍然为新手在声音表征方面的特殊挑战提供了有价值的见解。第二项研究旨在更好地理解感知到的声音特征和声学特征之间的联系。同一语料库的295个语音样本的数据集由专家根据与第一项研究相同的20个语音特征进行标记。然后,我们使用一组声学特征对语音样本进行分析,然后将这些声学特征用作标注特征统计模型的预测因子。这个分析产生了一组独特的重要声学特征,作为预测每个人声音特征的主要影响,尽管模型总体上是适度的。此外,所有的语音特征模型都显示出与说话者性别的相互作用。这些结果表明,在评估声音时,有必要特别注意性别差异。有趣的是,我们获得了一种更高的模型准确性的趋势,这些声音特征也被人类听众评价得更准确和一致。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Voice
Journal of Voice 医学-耳鼻喉科学
CiteScore
4.00
自引率
13.60%
发文量
395
审稿时长
59 days
期刊介绍: The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信