Significance of natural elicitation in developing simulated full blown speech emotion databases

D. Pravena, S. Nandhakumar, D. Govind
{"title":"Significance of natural elicitation in developing simulated full blown speech emotion databases","authors":"D. Pravena, S. Nandhakumar, D. Govind","doi":"10.1109/TECHSYM.2016.7872693","DOIUrl":null,"url":null,"abstract":"The work presented in this paper investigates the significance of natural elicitation of emotions during the development of simulated full blown emotion speech databases emotion analysis. A subset of primary emotions such as anger, happy and sad emotions along with neutral utterances are used in the present work. The first part of the work discusses the development of a simulated full blown emotion database by selecting 50 emotionally biased prompts for the recording the emotional speech data in Tamil language. For the comparative study, another simulated emotion database is developed by recording 50 neutral utterances for recording the emotion speech from the same speakers. The second part of the work is the comparison of emotion recognition performance of the simulated emotion speech databases using the basic Gaussian mixture model (GMM) based system with mel frequency cepstral coefficients (MFCC). A significant variations in the recognition rates of different emotions are observed for both the databases with emotionally biased utterances and emotionally neutral emotion utterances. Where the emotionally biased utterances observed to be more effective in discriminating emotions than emotionally neutral simulated emotion database. Also, the emotion recognition rates obtained for the simulated emotionally neutral emotion utterances follow the same trend as that of the classical German full blown simulated emotion database.","PeriodicalId":403350,"journal":{"name":"2016 IEEE Students’ Technology Symposium (TechSym)","volume":"301 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Students’ Technology Symposium (TechSym)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TECHSYM.2016.7872693","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

The work presented in this paper investigates the significance of natural elicitation of emotions during the development of simulated full blown emotion speech databases emotion analysis. A subset of primary emotions such as anger, happy and sad emotions along with neutral utterances are used in the present work. The first part of the work discusses the development of a simulated full blown emotion database by selecting 50 emotionally biased prompts for the recording the emotional speech data in Tamil language. For the comparative study, another simulated emotion database is developed by recording 50 neutral utterances for recording the emotion speech from the same speakers. The second part of the work is the comparison of emotion recognition performance of the simulated emotion speech databases using the basic Gaussian mixture model (GMM) based system with mel frequency cepstral coefficients (MFCC). A significant variations in the recognition rates of different emotions are observed for both the databases with emotionally biased utterances and emotionally neutral emotion utterances. Where the emotionally biased utterances observed to be more effective in discriminating emotions than emotionally neutral simulated emotion database. Also, the emotion recognition rates obtained for the simulated emotionally neutral emotion utterances follow the same trend as that of the classical German full blown simulated emotion database.
自然启发在开发模拟完整语音情感数据库中的意义
本文研究了情感自然激发在模拟完整情感语音数据库情感分析开发中的重要意义。在本研究中使用了愤怒、快乐和悲伤等主要情绪的子集以及中性话语。本文的第一部分讨论了通过选择50个情感偏见提示来记录泰米尔语情感语音数据的模拟完整情感数据库的开发。为了进行对比研究,我们建立了另一个模拟情绪数据库,通过记录50个中性话语来记录来自同一说话者的情绪语音。第二部分是利用基于基本高斯混合模型(GMM)的mel倒谱系数(MFCC)系统对模拟情感语音数据库的情感识别性能进行比较。不同情绪的识别率在情绪偏颇和情绪中立两种数据库中均有显著差异。结果表明,情绪偏见话语比情绪中立话语在识别情绪方面更有效。模拟的情绪中性情绪话语的情绪识别率与经典的德国完全成熟模拟情绪数据库的情绪识别率趋势相同。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信