Emotion recognition based on EMD-Wavelet analysis of speech signals

2015 IEEE International Conference on Digital Signal Processing (DSP) Pub Date : 2015-07-21 DOI:10.1109/ICDSP.2015.7251881

C. Shahnaz, S. Sultana, S. Fattah, R. H. M. Rafi, I. Ahmmed, Weiping Zhu, M. Ahmad

{"title":"Emotion recognition based on EMD-Wavelet analysis of speech signals","authors":"C. Shahnaz, S. Sultana, S. Fattah, R. H. M. Rafi, I. Ahmmed, Weiping Zhu, M. Ahmad","doi":"10.1109/ICDSP.2015.7251881","DOIUrl":null,"url":null,"abstract":"In this paper, a speech emotion recognition method is proposed based on wavelet analysis on decomposed speech data obtained via empirical mode decomposition (EMD). Instead of analyzing the given speech signal directly, first the intrinsic mode functions (IMFs) are extracted by using the EMD and then the discrete wavelet transform (DWT) is performed only on the selected dominant IMFs. Both approximate and detail DWT coefficients of the dominant IMF are taken into consideration. It is found that some higher order statistics of these EMD-DWT coefficients corresponding to different emotions exhibit distinguishing characteristics and these statistical parameters are chosen as the desired features. For the purpose of classification, K nearest neighbor (KNN) classifier is employed along with the hierarchical clustering. Extensive simulations are carried out on widely used EMO-DB speech emotion database containing four class emotions, namely angry, happy, sad and neutral. Simulation results show that the proposed EMD-Wavelet based feature can provide quite satisfactory recognition performance with reduced feature dimension.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2015.7251881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

In this paper, a speech emotion recognition method is proposed based on wavelet analysis on decomposed speech data obtained via empirical mode decomposition (EMD). Instead of analyzing the given speech signal directly, first the intrinsic mode functions (IMFs) are extracted by using the EMD and then the discrete wavelet transform (DWT) is performed only on the selected dominant IMFs. Both approximate and detail DWT coefficients of the dominant IMF are taken into consideration. It is found that some higher order statistics of these EMD-DWT coefficients corresponding to different emotions exhibit distinguishing characteristics and these statistical parameters are chosen as the desired features. For the purpose of classification, K nearest neighbor (KNN) classifier is employed along with the hierarchical clustering. Extensive simulations are carried out on widely used EMO-DB speech emotion database containing four class emotions, namely angry, happy, sad and neutral. Simulation results show that the proposed EMD-Wavelet based feature can provide quite satisfactory recognition performance with reduced feature dimension.

查看原文本刊更多论文

基于emd -小波分析的语音信号情感识别

本文提出了一种基于小波分析的基于经验模态分解(EMD)的语音情感识别方法。该方法不直接对给定语音信号进行分析，而是首先利用EMD提取固有模态函数(IMFs)，然后仅对选定的主导模态函数进行离散小波变换(DWT)。同时考虑了主导IMF的近似和详细DWT系数。研究发现，不同情绪对应的EMD-DWT系数的一些高阶统计量表现出不同的特征，这些统计参数被选择为期望特征。为了实现分类目的，采用K近邻分类器(KNN)和分层聚类。在广泛使用的EMO-DB语音情绪数据库上进行了大量的仿真，该数据库包含愤怒、快乐、悲伤和中性四类情绪。仿真结果表明，所提出的基于emd -小波的特征在特征维数降低的情况下，具有较好的识别效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE International Conference on Digital Signal Processing (DSP)

自引率

0.00%

发文量