Emotion recognition and reaction prediction in videos

Nimish Ronghe, Sayali S. Nakashe, A. Pawar, S. Bobde
{"title":"Emotion recognition and reaction prediction in videos","authors":"Nimish Ronghe, Sayali S. Nakashe, A. Pawar, S. Bobde","doi":"10.1109/ICRCICN.2017.8234476","DOIUrl":null,"url":null,"abstract":"Facial analysis in videos and images has been a relatively tough task for machine learning models. Recent use of deep learning approaches has demonstrated substantial improvement in results and reliability and can be used for problems such as face recognition, emotion recognition and emotion reaction prediction. In the case of emotion reaction, relevant information of emotions in individual frames often must be aggregated over a variable length sequence of frames and speech signal to produce an appreciable prediction. Emotion reaction prediction is a subset of sequence analysis task and heavily relies on dynamic temporal and spectral features. Convolution neural networks (CNNs) have been extensively used for emotion recognition problems and have produced reliable results. However, they lack the ability to extract time-series information from a sequence of inputs and cannot model an emotion transaction. Recurrent neural networks (RNNs) are being used profoundly due to their ability to yield impressive results on a variety of tasks in the field of sequence analysis. In this work, we propose a system for emotion recognition and reaction prediction in videos. The primary focus is experimental analysis of a hybrid CNN-RNN architecture for emotion transaction analysis that can recognize the emotion in a frame in a video and predict its appropriate reaction.","PeriodicalId":166298,"journal":{"name":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2017.8234476","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Facial analysis in videos and images has been a relatively tough task for machine learning models. Recent use of deep learning approaches has demonstrated substantial improvement in results and reliability and can be used for problems such as face recognition, emotion recognition and emotion reaction prediction. In the case of emotion reaction, relevant information of emotions in individual frames often must be aggregated over a variable length sequence of frames and speech signal to produce an appreciable prediction. Emotion reaction prediction is a subset of sequence analysis task and heavily relies on dynamic temporal and spectral features. Convolution neural networks (CNNs) have been extensively used for emotion recognition problems and have produced reliable results. However, they lack the ability to extract time-series information from a sequence of inputs and cannot model an emotion transaction. Recurrent neural networks (RNNs) are being used profoundly due to their ability to yield impressive results on a variety of tasks in the field of sequence analysis. In this work, we propose a system for emotion recognition and reaction prediction in videos. The primary focus is experimental analysis of a hybrid CNN-RNN architecture for emotion transaction analysis that can recognize the emotion in a frame in a video and predict its appropriate reaction.
视频中的情绪识别与反应预测
对于机器学习模型来说,视频和图像中的面部分析一直是一项相对艰巨的任务。最近深度学习方法的使用已经证明在结果和可靠性方面有了实质性的改进,可以用于人脸识别、情绪识别和情绪反应预测等问题。在情绪反应的情况下,个体帧中的情绪相关信息通常必须在可变长度的帧序列和语音信号中进行汇总,以产生可感知的预测。情绪反应预测是序列分析任务的一个子集,严重依赖于动态的时间和频谱特征。卷积神经网络(cnn)已广泛应用于情绪识别问题,并产生了可靠的结果。然而,它们缺乏从输入序列中提取时间序列信息的能力,也不能为情感交易建模。递归神经网络(RNNs)由于其在序列分析领域的各种任务中产生令人印象深刻的结果的能力而被广泛使用。在这项工作中,我们提出了一个视频中的情绪识别和反应预测系统。本文的主要重点是对用于情感事务分析的混合CNN-RNN架构进行实验分析,该架构可以识别视频帧中的情感并预测其适当的反应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信