Deep Learning for Continuous Multiple Time Series Annotations

Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop Pub Date : 2018-10-15 DOI:10.1145/3266302.3266305

Jian Huang, Ya Li, J. Tao, Zheng Lian, Mingyue Niu, Minghao Yang

引用次数: 4

Abstract

Learning from multiple annotations is an increasingly important research topic. Compared with conventional classification or regression problems, it faces more challenges because time-continuous annotations would result in noisy and temporal lags problems for continuous emotion recognition. In this paper, we address the problem by deep learning for continuous multiple time series annotations. We attach a novel crowd layer to the output layer of basic continuous emotion recognition system, which learns directly from the noisy labels of multiple annotators with end-to-end manner. The inputs of the system are multimodal features and the targets are multiple annotations, with the intention of learning an annotator-specific mapping. Our proposed method considers the ground truth as latent variables and multiple annotations are variant of ground truth by linear mapping. The experimental results show that our system can achieve superior performance and capture the reliabilities and biases of different annotators.

查看原文本刊更多论文

连续多时间序列注释的深度学习

从多个注释中学习是一个越来越重要的研究课题。与传统的分类或回归问题相比，时间连续注释会导致持续的情绪识别存在噪声和时间滞后问题，因此面临着更多的挑战。在本文中，我们通过深度学习来解决连续多时间序列注释的问题。我们在基本连续情感识别系统的输出层上附加了一个新的人群层，该系统以端到端的方式直接从多个标注者的噪声标签中学习。系统的输入是多模态特征，目标是多个注释，目的是学习特定于注释器的映射。该方法将真值作为潜在变量，通过线性映射将多个标注作为真值的变体。实验结果表明，该系统能够很好地捕捉到不同标注者的可靠性和偏差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop

自引率

0.00%

发文量