Relationship between phoneme-level spectral acoustics and speech intelligibility in healthy speech: a systematic review

IF 1 Q3 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Timothy Pommée, M. Balaguer, J. Pinquier, J. Mauclair, V. Woisard, Renée Speyer
{"title":"Relationship between phoneme-level spectral acoustics and speech intelligibility in healthy speech: a systematic review","authors":"Timothy Pommée, M. Balaguer, J. Pinquier, J. Mauclair, V. Woisard, Renée Speyer","doi":"10.1080/2050571X.2021.1913300","DOIUrl":null,"url":null,"abstract":"ABSTRACT This study aims to systematically review original articles investigating the link between spectral acoustic measures in healthy talkers and perceived speech intelligibility, according to the PRISMA guidelines. Twenty-two studies were retained. Eighteen papers investigated vowel acoustics, one studied glides and eight articles investigated consonants, mostly sibilants. Various spectral measures and intelligibility estimates were used. The following measures were shown to be linked to sub-lexical perceived speech intelligibility ratings: for vowels, steady-state F1 and F2 measures, the F1 range, the [i]-[U] F2 difference, F0-F1 and F1-F2 differences in [è-A] and [q-è], the vowel space area, the mean amount of formant movement, the vector length and the spectral change measure; for consonants, the centroid energy and the spectral peak in the [s]-sound, as well as the steady-state F1 offset frequency in vowels preceding [t] and [d].To conclude, as speech is highly variable even in healthy adult speakers, a better understanding of the imprecisions in healthy spontaneous speech will provide a more realistic baseline for the investigation of disordered speech. To date, no acoustic measure is able to predict speech intelligibility to a large extent. There is still extensive research to be carried out to identify relevant acoustic combinations that could account for perceived speech variations (e.g. vowel and consonant reductions) and to gather normative data from a large number of healthy speakers. To that end, speech-related terms (e.g. intelligibility, comprehensibility, severity) need to be clearly defined and methodologies described in sufficient details to allow for replication, cross-comparisons/meta-analyses and pooling of data. .","PeriodicalId":43000,"journal":{"name":"Speech Language and Hearing","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Speech Language and Hearing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/2050571X.2021.1913300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 4

Abstract

ABSTRACT This study aims to systematically review original articles investigating the link between spectral acoustic measures in healthy talkers and perceived speech intelligibility, according to the PRISMA guidelines. Twenty-two studies were retained. Eighteen papers investigated vowel acoustics, one studied glides and eight articles investigated consonants, mostly sibilants. Various spectral measures and intelligibility estimates were used. The following measures were shown to be linked to sub-lexical perceived speech intelligibility ratings: for vowels, steady-state F1 and F2 measures, the F1 range, the [i]-[U] F2 difference, F0-F1 and F1-F2 differences in [è-A] and [q-è], the vowel space area, the mean amount of formant movement, the vector length and the spectral change measure; for consonants, the centroid energy and the spectral peak in the [s]-sound, as well as the steady-state F1 offset frequency in vowels preceding [t] and [d].To conclude, as speech is highly variable even in healthy adult speakers, a better understanding of the imprecisions in healthy spontaneous speech will provide a more realistic baseline for the investigation of disordered speech. To date, no acoustic measure is able to predict speech intelligibility to a large extent. There is still extensive research to be carried out to identify relevant acoustic combinations that could account for perceived speech variations (e.g. vowel and consonant reductions) and to gather normative data from a large number of healthy speakers. To that end, speech-related terms (e.g. intelligibility, comprehensibility, severity) need to be clearly defined and methodologies described in sufficient details to allow for replication, cross-comparisons/meta-analyses and pooling of data. .
健康言语中音素级频谱声学与语音可理解度关系的系统综述
本研究旨在根据PRISMA指南,系统地回顾研究健康说话者的频谱声学测量与感知语音清晰度之间联系的原始文章。22项研究被保留。18篇论文研究元音声学,1篇研究滑音,8篇研究辅音,主要是音节。使用了各种光谱测量和可理解性估计。以下测量被证明与亚词汇感知语音可理解度评级有关:对于元音,稳态F1和F2测量,F1范围,[i]-[U] F2差异,[i]- F1和[q-è]中的F0-F1和F1-F2差异,元音空间面积,平均形成峰移动量,矢量长度和频谱变化测量;对于辅音,[s]-音的质心能量和谱峰,以及[t]和[d]前面元音的稳态F1偏移频率。综上所述,即使在健康的成年说话者中,言语也是高度可变的,更好地理解健康自发言语中的不精确将为言语障碍的研究提供更现实的基线。迄今为止,没有任何声学测量能够在很大程度上预测语音可理解性。仍有广泛的研究需要进行,以确定相关的声学组合,可以解释感知到的语音变化(例如元音和辅音的减少),并从大量健康的说话者那里收集规范数据。为此,需要明确定义与语音相关的术语(如可理解性、可理解性、严重性),并对方法进行足够详细的描述,以便进行复制、交叉比较/元分析和数据汇集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Speech Language and Hearing
Speech Language and Hearing AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY-
CiteScore
2.30
自引率
6.70%
发文量
11
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信