Estimation of Affective Level in the Wild with Multiple Memory Networks

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2017-07-21 DOI:10.1109/CVPRW.2017.244

Jianshu Li, Yunpeng Chen, Shengtao Xiao, Jian Zhao, S. Roy, Jiashi Feng, Shuicheng Yan, T. Sim

引用次数: 15

Abstract

This paper presents the proposed solution to the "affect in the wild" challenge, which aims to estimate the affective level, i.e. the valence and arousal values, of every frame in a video. A carefully designed deep convolutional neural network (a variation of residual network) for affective level estimation of facial expressions is first implemented as a baseline. Next we use multiple memory networks to model the temporal relations between the frames. Finally ensemble models are used to combine the predictions from multiple memory networks. Our proposed solution outperforms the baseline model by a factor of 10.62% in terms of mean square error (MSE).

查看原文本刊更多论文

基于多记忆网络的野外情感水平估计

本文提出了“野外情感”挑战的解决方案，该挑战旨在估计视频中每帧的情感水平，即价值和唤醒值。首先将精心设计的深度卷积神经网络(残差网络的一种变体)用于面部表情的情感水平估计作为基线。接下来，我们使用多个记忆网络来建模帧之间的时间关系。最后，采用集成模型对多个记忆网络的预测结果进行组合。我们提出的解决方案在均方误差(MSE)方面优于基线模型10.62%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量