Emotion recognition using LP residual

Arun Chauhan, S. Koolagudi, Sabin Kafley, K. S. Rao
{"title":"Emotion recognition using LP residual","authors":"Arun Chauhan, S. Koolagudi, Sabin Kafley, K. S. Rao","doi":"10.1109/TECHSYM.2010.5469162","DOIUrl":null,"url":null,"abstract":"This paper explores the Linear Prediction (LP) residual of speech signal for characterizing the basic emotions. The emotions used in this study are anger, compassion, disgust, fear, happy, neutral, sarcastic and surprise. LP residual is derived by inverse filtering of the speech signal, and the process is known as LP analysis. LP residual mainly contains higher order relations among the samples. For capturing the emotion specific information from these higher order relations, autoassociative neural network (AANN) and Gaussian mixture models (GMM) are used. The decrease in the error during training phase of the AANN's and the emotion recognition performance of the models, demonstrate that the excitation source component of speech contains emotion-specific information and is indeed being captured by the AANN and GMM models. IITKGP-Simulated Emotion Speech Corpus (IITKGP-SESC) is used as a database, for characterization and classification of emotions. The emotion recognition performance is observed to be about 56 %.","PeriodicalId":262830,"journal":{"name":"2010 IEEE Students Technology Symposium (TechSym)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Students Technology Symposium (TechSym)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TECHSYM.2010.5469162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

Abstract

This paper explores the Linear Prediction (LP) residual of speech signal for characterizing the basic emotions. The emotions used in this study are anger, compassion, disgust, fear, happy, neutral, sarcastic and surprise. LP residual is derived by inverse filtering of the speech signal, and the process is known as LP analysis. LP residual mainly contains higher order relations among the samples. For capturing the emotion specific information from these higher order relations, autoassociative neural network (AANN) and Gaussian mixture models (GMM) are used. The decrease in the error during training phase of the AANN's and the emotion recognition performance of the models, demonstrate that the excitation source component of speech contains emotion-specific information and is indeed being captured by the AANN and GMM models. IITKGP-Simulated Emotion Speech Corpus (IITKGP-SESC) is used as a database, for characterization and classification of emotions. The emotion recognition performance is observed to be about 56 %.
基于LP残差的情绪识别
本文探讨了语音信号的线性预测残差(LP)用于基本情绪表征的方法。在这项研究中使用的情绪是愤怒、同情、厌恶、恐惧、快乐、中性、讽刺和惊讶。通过对语音信号进行反滤波得到低噪声残差,这个过程被称为低噪声分析。LP残差主要包含样本间的高阶关系。为了从这些高阶关系中捕获情感特定信息,使用了自关联神经网络(AANN)和高斯混合模型(GMM)。在训练阶段误差的减小和模型的情绪识别性能,证明了语音的激励源成分包含情绪特定信息,并且确实被AANN和GMM模型捕获。iitkgp -模拟情绪语音语料库(IITKGP-SESC)被用作一个数据库,用于表征和分类情绪。观察到情绪识别性能约为56%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信