Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features

2007 IEEE 9th Workshop on Multimedia Signal Processing Pub Date : 2007-10-01 DOI:10.1109/MMSP.2007.4412815

Samuel Kim, P. Georgiou, Sungbok Lee, Shrikanth S. Narayanan

引用次数: 102

Abstract

The goal of this work is to build a real-time emotion detection system which utilizes multi-modal fusion of different timescale features of speech. Conventional spectral and prosody features are used for intra-frame and supra-frame features respectively, and a new information fusion algorithm which takes care of the characteristics of each machine learning algorithm is introduced. In this framework, the proposed system can be associated with additional features, such as lexical or discourse information, in later steps. To verify the realtime system performance, binary decision tasks on angry and neutral emotion are performed using concatenated speech signal simulating realtime conditions.

查看原文本刊更多论文

语音实时情感检测系统:不同时间尺度特征的多模态融合

本文的目标是建立一个利用语音不同时间尺度特征的多模态融合的实时情感检测系统。帧内特征和帧外特征分别采用传统的谱特征和韵律特征，并引入了一种新的信息融合算法，该算法兼顾了每种机器学习算法的特点。在这个框架中，建议的系统可以在后面的步骤中与其他特征相关联，例如词汇或话语信息。为了验证系统的实时性，利用模拟实时条件的串联语音信号对愤怒情绪和中性情绪进行二元决策任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 IEEE 9th Workshop on Multimedia Signal Processing

自引率

0.00%

发文量