面罩对普通话语音声学特征和可理解性的影响。

IF 2.2 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
Wei Hu, Libo Qiao, Lei Wu, Guoli Yan, Lihong Wang, Can Xu, Yao Chen, Chang Liu
{"title":"面罩对普通话语音声学特征和可理解性的影响。","authors":"Wei Hu, Libo Qiao, Lei Wu, Guoli Yan, Lihong Wang, Can Xu, Yao Chen, Chang Liu","doi":"10.1044/2025_JSLHR-24-00446","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The goal of this study was to investigate how face masks influenced the acoustic features of Chinese running speech in both temporal and spectral domains and how the intelligibility of the speech with face masks was affected in quiet and multitalker babbles. The relationship between the acoustic features and speech intelligibility was also examined.</p><p><strong>Method: </strong>In Experiment 1, Mandarin Chinese sentences were recorded by 24 native Mandarin Chinese speakers wearing a surgical mask, a KN95 mask, or not wearing a mask and temporal modulation (TM) depth; speaking rate, spectral tilt, and average value and standard deviation of fundamental frequency (<i>F</i>0) were then examined. In Experiment 2, the intelligibility of these recorded sentences were assessed in quiet and multitalker babble with the signal-to-noise ratios of -2 and -5 dB. To further examine the possible causal relationship between the impacted acoustic variables and speech intelligibility under different mask wearing conditions, the acoustic and speech intelligibility data were analyzed in a stepwise regression.</p><p><strong>Results: </strong>Results showed that both the KN95 and surgical masks produced significantly smaller TM depth compared to the no-mask condition. In terms of speaking rate, participants spoke faster with face masks than without a mask, whereas there was no significant difference between the KN95 and surgical mask. Additionally, spectral tilt was significantly shallower for the two face masks compared to the no-mask condition. Regarding <i>F</i>0, the mean <i>F</i>0 was higher with the KN95 mask than the surgical mask and no mask, while the standard deviation of <i>F</i>0 was lower in the two mask conditions than the no-mask condition, with no significant difference between the two types of masks. In addition to these acoustic differences, speech intelligibility in noise was significantly lower for the two mask conditions than the no-mask condition, with no significant difference between the KN95 and surgical masks, whereas there was no significant effect of face masks on speech intelligibly in quiet. Finally, the relationship between acoustic features and speech intelligibility showed that, under noise conditions, TM depth, spectral tilt, and <i>F</i>0 dynamics (e.g., standard deviation) were significantly correlated with speech intelligibility, while speaking rate and mean <i>F</i>0 were not.</p><p><strong>Conclusions: </strong>Acoustically, face masks led to smaller TM depth, slower speaking rate, shallower spectral tilt, higher mean <i>F</i>0 and smaller standard deviation of <i>F</i>0 in Mandarin Chinese running speech, and perceptually resulted in lower speech intelligibility in noise, but had no impact on speech intelligibility in quiet. Findings also suggest that certain acoustic characteristics (e.g., TM depth and spectral tilt) play important roles on speech intelligibility, especially in challenging listening conditions.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"1-16"},"PeriodicalIF":2.2000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Face Mask Effects on Acoustic Features and Intelligibility of Mandarin Chinese Speech.\",\"authors\":\"Wei Hu, Libo Qiao, Lei Wu, Guoli Yan, Lihong Wang, Can Xu, Yao Chen, Chang Liu\",\"doi\":\"10.1044/2025_JSLHR-24-00446\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>The goal of this study was to investigate how face masks influenced the acoustic features of Chinese running speech in both temporal and spectral domains and how the intelligibility of the speech with face masks was affected in quiet and multitalker babbles. The relationship between the acoustic features and speech intelligibility was also examined.</p><p><strong>Method: </strong>In Experiment 1, Mandarin Chinese sentences were recorded by 24 native Mandarin Chinese speakers wearing a surgical mask, a KN95 mask, or not wearing a mask and temporal modulation (TM) depth; speaking rate, spectral tilt, and average value and standard deviation of fundamental frequency (<i>F</i>0) were then examined. In Experiment 2, the intelligibility of these recorded sentences were assessed in quiet and multitalker babble with the signal-to-noise ratios of -2 and -5 dB. To further examine the possible causal relationship between the impacted acoustic variables and speech intelligibility under different mask wearing conditions, the acoustic and speech intelligibility data were analyzed in a stepwise regression.</p><p><strong>Results: </strong>Results showed that both the KN95 and surgical masks produced significantly smaller TM depth compared to the no-mask condition. In terms of speaking rate, participants spoke faster with face masks than without a mask, whereas there was no significant difference between the KN95 and surgical mask. Additionally, spectral tilt was significantly shallower for the two face masks compared to the no-mask condition. Regarding <i>F</i>0, the mean <i>F</i>0 was higher with the KN95 mask than the surgical mask and no mask, while the standard deviation of <i>F</i>0 was lower in the two mask conditions than the no-mask condition, with no significant difference between the two types of masks. In addition to these acoustic differences, speech intelligibility in noise was significantly lower for the two mask conditions than the no-mask condition, with no significant difference between the KN95 and surgical masks, whereas there was no significant effect of face masks on speech intelligibly in quiet. Finally, the relationship between acoustic features and speech intelligibility showed that, under noise conditions, TM depth, spectral tilt, and <i>F</i>0 dynamics (e.g., standard deviation) were significantly correlated with speech intelligibility, while speaking rate and mean <i>F</i>0 were not.</p><p><strong>Conclusions: </strong>Acoustically, face masks led to smaller TM depth, slower speaking rate, shallower spectral tilt, higher mean <i>F</i>0 and smaller standard deviation of <i>F</i>0 in Mandarin Chinese running speech, and perceptually resulted in lower speech intelligibility in noise, but had no impact on speech intelligibility in quiet. Findings also suggest that certain acoustic characteristics (e.g., TM depth and spectral tilt) play important roles on speech intelligibility, especially in challenging listening conditions.</p>\",\"PeriodicalId\":51254,\"journal\":{\"name\":\"Journal of Speech Language and Hearing Research\",\"volume\":\" \",\"pages\":\"1-16\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Speech Language and Hearing Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1044/2025_JSLHR-24-00446\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Speech Language and Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1044/2025_JSLHR-24-00446","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究的目的是探讨面罩在时间域和频谱域上对汉语跑动语音声学特征的影响,以及面罩对安静和多语幼儿的语音可理解性的影响。声学特征与语音清晰度之间的关系也进行了研究。方法:在实验1中,24名母语为普通话的汉语使用者分别戴外科口罩、KN95口罩和不戴口罩,记录普通话句子和时间调制(TM)深度;然后检测说话速率、频谱倾斜、基频平均值和标准差(F0)。在实验2中,在安静和多语的情况下,以-2和-5 dB的信噪比评估这些记录句子的可理解性。为了进一步检验不同口罩佩戴条件下受影响的声学变量与语音可理解度之间可能存在的因果关系,对声学和语音可理解度数据进行逐步回归分析。结果:结果表明,与不戴口罩相比,KN95和外科口罩产生的TM深度明显较小。在说话速度方面,参与者戴口罩比不戴口罩说得快,而KN95和外科口罩之间没有显著差异。此外,与不戴口罩的情况相比,两种口罩的光谱倾斜明显更浅。在F0方面,使用KN95口罩的平均F0高于外科口罩和不使用口罩,而两种口罩条件下F0的标准差均低于不使用口罩,两种口罩之间无显著差异。除了这些声学差异外,两种口罩条件下的噪音语音清晰度显著低于不戴口罩条件下的语音清晰度,KN95和外科口罩之间没有显著差异,而口罩对安静情况下的语音清晰度没有显著影响。最后,声学特征与语音可理解度的关系表明,在噪声条件下,TM深度、频谱倾斜和F0动态(如标准差)与语音可理解度显著相关,而说话速率和平均F0与语音可理解度无显著相关。结论:在声学上,面罩使普通话跑步语音的TM深度变小,语速变慢,频谱倾斜变浅,平均F0变高,F0标准差变小;在感知上,面罩使噪声环境下的语音可理解度降低,但对安静环境下的语音可理解度无影响。研究结果还表明,某些声学特征(如TM深度和频谱倾斜)对语音清晰度起着重要作用,特别是在具有挑战性的听力条件下。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Face Mask Effects on Acoustic Features and Intelligibility of Mandarin Chinese Speech.

Purpose: The goal of this study was to investigate how face masks influenced the acoustic features of Chinese running speech in both temporal and spectral domains and how the intelligibility of the speech with face masks was affected in quiet and multitalker babbles. The relationship between the acoustic features and speech intelligibility was also examined.

Method: In Experiment 1, Mandarin Chinese sentences were recorded by 24 native Mandarin Chinese speakers wearing a surgical mask, a KN95 mask, or not wearing a mask and temporal modulation (TM) depth; speaking rate, spectral tilt, and average value and standard deviation of fundamental frequency (F0) were then examined. In Experiment 2, the intelligibility of these recorded sentences were assessed in quiet and multitalker babble with the signal-to-noise ratios of -2 and -5 dB. To further examine the possible causal relationship between the impacted acoustic variables and speech intelligibility under different mask wearing conditions, the acoustic and speech intelligibility data were analyzed in a stepwise regression.

Results: Results showed that both the KN95 and surgical masks produced significantly smaller TM depth compared to the no-mask condition. In terms of speaking rate, participants spoke faster with face masks than without a mask, whereas there was no significant difference between the KN95 and surgical mask. Additionally, spectral tilt was significantly shallower for the two face masks compared to the no-mask condition. Regarding F0, the mean F0 was higher with the KN95 mask than the surgical mask and no mask, while the standard deviation of F0 was lower in the two mask conditions than the no-mask condition, with no significant difference between the two types of masks. In addition to these acoustic differences, speech intelligibility in noise was significantly lower for the two mask conditions than the no-mask condition, with no significant difference between the KN95 and surgical masks, whereas there was no significant effect of face masks on speech intelligibly in quiet. Finally, the relationship between acoustic features and speech intelligibility showed that, under noise conditions, TM depth, spectral tilt, and F0 dynamics (e.g., standard deviation) were significantly correlated with speech intelligibility, while speaking rate and mean F0 were not.

Conclusions: Acoustically, face masks led to smaller TM depth, slower speaking rate, shallower spectral tilt, higher mean F0 and smaller standard deviation of F0 in Mandarin Chinese running speech, and perceptually resulted in lower speech intelligibility in noise, but had no impact on speech intelligibility in quiet. Findings also suggest that certain acoustic characteristics (e.g., TM depth and spectral tilt) play important roles on speech intelligibility, especially in challenging listening conditions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Speech Language and Hearing Research
Journal of Speech Language and Hearing Research AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY-REHABILITATION
CiteScore
4.10
自引率
19.20%
发文量
538
审稿时长
4-8 weeks
期刊介绍: Mission: JSLHR publishes peer-reviewed research and other scholarly articles on the normal and disordered processes in speech, language, hearing, and related areas such as cognition, oral-motor function, and swallowing. The journal is an international outlet for both basic research on communication processes and clinical research pertaining to screening, diagnosis, and management of communication disorders as well as the etiologies and characteristics of these disorders. JSLHR seeks to advance evidence-based practice by disseminating the results of new studies as well as providing a forum for critical reviews and meta-analyses of previously published work. Scope: The broad field of communication sciences and disorders, including speech production and perception; anatomy and physiology of speech and voice; genetics, biomechanics, and other basic sciences pertaining to human communication; mastication and swallowing; speech disorders; voice disorders; development of speech, language, or hearing in children; normal language processes; language disorders; disorders of hearing and balance; psychoacoustics; and anatomy and physiology of hearing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信