Speaker- and Corpus-Independent Methods for Affect Classification in Computational Paralinguistics

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI:10.1145/2663204.2666284

Heysem Kaya

引用次数: 1

Abstract

The analysis of spoken emotions is of increasing interest in human computer interaction, in order to drive the machine communication into a humane manner. It has manifold applications ranging from intelligent tutoring systems to affect sensitive robots, from smart call centers to patient telemonitoring. In general the study of computational paralinguistics, which covers the analysis of speaker states and traits, faces with real life challenges of inter-speaker and inter-corpus variability. In this paper, a brief summary of the progress and future directions of my PhD study titled Adaptive Mixture Models for Speech Emotion Recognition that targets these challenges are given. An automatic mixture model selection method for Mixture of Factor Analyzers is proposed for modeling high dimensional data. To provide the mentioned statistical method a compact set of potent features, novel feature selection methods based on Canonical Correlation Analysis are introduced.

查看原文本刊更多论文

计算副语言学中独立于说话人和语料库的情感分类方法

为了使机器交流更人性化，对言语情感的分析在人机交互中日益引起人们的兴趣。它有多种应用，从智能辅导系统到影响敏感的机器人，从智能呼叫中心到患者远程监控。计算副语言学的研究涵盖了说话人状态和特征的分析，在现实生活中面临着说话人之间和语料库之间变化的挑战。在本文中，简要总结了我的博士研究的进展和未来的方向，题为自适应混合模型的语音情感识别，针对这些挑战。针对高维数据建模问题，提出了一种混合因子分析仪自动混合模型选择方法。为了给上述统计方法提供一个紧凑的有效特征集，提出了一种基于典型相关分析的特征选择方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 16th International Conference on Multimodal Interaction

自引率

0.00%

发文量