Recognizing Emotional State Changes Using Speech Processing

2016 European Modelling Symposium (EMS) Pub Date : 2016-11-01 DOI:10.1109/EMS.2016.017

Reza Ashrafidoost, S. Setayeshi, A. Sharifi

{"title":"Recognizing Emotional State Changes Using Speech Processing","authors":"Reza Ashrafidoost, S. Setayeshi, A. Sharifi","doi":"10.1109/EMS.2016.017","DOIUrl":null,"url":null,"abstract":"Research on understanding human emotions in speech, seeks to find out utterance mood by analyzing cognitive attributes extracted from acoustical speech signal. Speech contains rich patterns which can be altered by mood of a speaker. This paper explores speech from database and long-term speech recordings to analyze of mood changing in individual speaker during long-term speech. We introduce a learning method based on statistical model to classify emotional states and moods of utterance, and also track its changes. With this object, the perceptual backgrounds of the individual speaker are analyzed, and then classified during the speech to extract patterns, which is embedded in speech signal. The proposed method, classifies emotions of the utterance in seven standard classes including, happiness, anger, boredom, fear, disgust, neutral and sadness. To this end, we call the standard speech corpus database, the EmoDB for the training phase of this approach. Thus, when pre-processing tasks done, the speech patterns and meaningful attributes have extracted by the MFCC method and selected by SFS method, and then we apply a statistical classification approach, LGMM, to categorize obtained features, and finally illustrate changes trend of the emotional states.","PeriodicalId":446936,"journal":{"name":"2016 European Modelling Symposium (EMS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 European Modelling Symposium (EMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMS.2016.017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Research on understanding human emotions in speech, seeks to find out utterance mood by analyzing cognitive attributes extracted from acoustical speech signal. Speech contains rich patterns which can be altered by mood of a speaker. This paper explores speech from database and long-term speech recordings to analyze of mood changing in individual speaker during long-term speech. We introduce a learning method based on statistical model to classify emotional states and moods of utterance, and also track its changes. With this object, the perceptual backgrounds of the individual speaker are analyzed, and then classified during the speech to extract patterns, which is embedded in speech signal. The proposed method, classifies emotions of the utterance in seven standard classes including, happiness, anger, boredom, fear, disgust, neutral and sadness. To this end, we call the standard speech corpus database, the EmoDB for the training phase of this approach. Thus, when pre-processing tasks done, the speech patterns and meaningful attributes have extracted by the MFCC method and selected by SFS method, and then we apply a statistical classification approach, LGMM, to categorize obtained features, and finally illustrate changes trend of the emotional states.

查看原文本刊更多论文

使用语音处理识别情绪状态变化

言语情感理解研究是通过对声学语音信号中提取的认知属性进行分析，找出话语情绪。言语包含丰富的模式，可以随着说话人的情绪而改变。本文从数据库和长期语音录音中挖掘语音，分析个体说话者在长期讲话过程中的情绪变化。我们引入了一种基于统计模型的学习方法来对话语的情绪状态和情绪进行分类，并跟踪其变化。通过对说话人个体的感知背景进行分析，在说话过程中进行分类，提取模式，并将其嵌入到语音信号中。该方法将话语的情绪分为快乐、愤怒、无聊、恐惧、厌恶、中性和悲伤七种标准类别。为此，我们将标准语音语料库数据库，即EmoDB作为训练阶段的这种方法。因此，在完成预处理任务后，通过MFCC方法提取语音模式和有意义属性，并通过SFS方法进行选择，然后应用LGMM统计分类方法对得到的特征进行分类，最后说明情绪状态的变化趋势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 European Modelling Symposium (EMS)

自引率

0.00%

发文量