Important Modulation Frequency Components of Temporal Amplitude Envelope Contributing to Vocal Emotion Perception.

IF 2.2
Taiyang Guo, Shunsuke Kidani, Takuto Isoyama, Peter Birkholz, Masato Akagi, Masashi Unoki
{"title":"Important Modulation Frequency Components of Temporal Amplitude Envelope Contributing to Vocal Emotion Perception.","authors":"Taiyang Guo, Shunsuke Kidani, Takuto Isoyama, Peter Birkholz, Masato Akagi, Masashi Unoki","doi":"10.1044/2025_JSLHR-24-00825","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Previous studies using noise-vocoded speech (NVS) have demonstrated the significance of the temporal amplitude envelope (TAE) of speech signals, such as modulation perception, in vocal emotion perception. In addition, due to the importance of modulation processing for TAE in speech perception, researchers began to focus on the role of TAE modulation components. A previous study suggested the contributions of modulation frequency components in vocal emotion perception. However, the important components remain unclear. This study aims to clarify the important components in vocal emotion perception.</p><p><strong>Method: </strong>Two experiments on vocal emotion perception using NVS were conducted with 10 native Japanese speakers (two women and eight men). In NVS generation, a modulation filterbank (MFB) is used to simulate modulation perception in the auditory system. The modulation frequency components of TAE are bandpass and bandstop filtered using the filterbank. The contributions of the individual modulation frequency components are evaluated by comparing the emotion recognition rates of NVS.</p><p><strong>Results: </strong>The results indicate that the use of an MFB does not affect emotion perception in NVS. The modulation frequency components within the 0- to 16-Hz band are important for each emotion, as well as for all emotions collectively. The important modulation frequency components for vocal emotion perception may differ slightly between positive and negative emotions. However, this observation should be interpreted cautiously and needs more verification due to the imbalance in the number of emotional categories in this study.</p><p><strong>Conclusion: </strong>This study investigated the important modulation frequency components of TAE that contribute to vocal emotion perception and suggested that modulation frequency components in the 0- to 16-Hz band are important components in vocal emotion perception.</p>","PeriodicalId":520690,"journal":{"name":"Journal of speech, language, and hearing research : JSLHR","volume":" ","pages":"4205-4219"},"PeriodicalIF":2.2000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of speech, language, and hearing research : JSLHR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1044/2025_JSLHR-24-00825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/1 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Previous studies using noise-vocoded speech (NVS) have demonstrated the significance of the temporal amplitude envelope (TAE) of speech signals, such as modulation perception, in vocal emotion perception. In addition, due to the importance of modulation processing for TAE in speech perception, researchers began to focus on the role of TAE modulation components. A previous study suggested the contributions of modulation frequency components in vocal emotion perception. However, the important components remain unclear. This study aims to clarify the important components in vocal emotion perception.

Method: Two experiments on vocal emotion perception using NVS were conducted with 10 native Japanese speakers (two women and eight men). In NVS generation, a modulation filterbank (MFB) is used to simulate modulation perception in the auditory system. The modulation frequency components of TAE are bandpass and bandstop filtered using the filterbank. The contributions of the individual modulation frequency components are evaluated by comparing the emotion recognition rates of NVS.

Results: The results indicate that the use of an MFB does not affect emotion perception in NVS. The modulation frequency components within the 0- to 16-Hz band are important for each emotion, as well as for all emotions collectively. The important modulation frequency components for vocal emotion perception may differ slightly between positive and negative emotions. However, this observation should be interpreted cautiously and needs more verification due to the imbalance in the number of emotional categories in this study.

Conclusion: This study investigated the important modulation frequency components of TAE that contribute to vocal emotion perception and suggested that modulation frequency components in the 0- to 16-Hz band are important components in vocal emotion perception.

时间振幅包络对声音情绪感知的重要调制频率成分。
目的:以往使用噪声语音编码(NVS)的研究已经证明了调制感知等语音信号的时间振幅包络(TAE)在语音情绪感知中的重要意义。此外,由于调制处理对于TAE在语音感知中的重要性,研究者开始关注TAE调制成分的作用。已有研究表明,调制频率成分在声音情绪感知中的作用。然而,其中的重要组成部分仍不清楚。本研究旨在厘清声音情绪知觉的重要组成部分。方法:对10名日语母语者(女2名,男8名)进行了两项语音情绪感知实验。在NVS生成中,使用调制滤波器组(MFB)来模拟听觉系统的调制感知。使用滤波器组对TAE的调制频率分量进行带通和带阻滤波。通过比较NVS的情绪识别率来评价各调制频率分量的贡献。结果:MFB的使用不影响NVS的情绪知觉。0- 16hz频段内的调制频率分量对每种情绪以及所有情绪都很重要。声音情绪感知的重要调制频率成分在积极情绪和消极情绪之间可能略有不同。然而,由于本研究中情绪类别数量的不平衡,这一观察结果需要谨慎解释,需要更多的验证。结论:本研究考察了TAE中参与声音情绪感知的重要调制频率成分,认为0 ~ 16 hz频段的调制频率成分是声音情绪感知的重要组成部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信