Offset or Onset Frame: A Multi-Stream Convolutional Neural Network with CapsuleNet Module for Micro-expression Recognition

Nian Liu, Xinyu Liu, Zhihao Zhang, Xueming Xu, Tong Chen
{"title":"Offset or Onset Frame: A Multi-Stream Convolutional Neural Network with CapsuleNet Module for Micro-expression Recognition","authors":"Nian Liu, Xinyu Liu, Zhihao Zhang, Xueming Xu, Tong Chen","doi":"10.1109/ICIIBMS50712.2020.9336412","DOIUrl":null,"url":null,"abstract":"Micro-expression is a spontaneous facial expression, which may reveal people's real emotions. The micro-expression recognition has recently attracted much attention in psychology and computer vision community. In this paper, we designed a multi-stream Convolutional Neural Network (CNN) combined with the Capsule Network(CapsNet) module.named CNNCapsNet, to improve the performance of micro-expression recognition. Firstly, both vertical and horizontal optical flow are computed from the onset to the apex, and from the apex to the offset frame respectively, which is the first time that the offset frame information has been taken into account in the field of micro-expression recognition. Secondly, these four optical flow images and the grayscale image of apex frame are input into the five-stream CNN model to extract features. Finally, CapsNet completes micro-expression recognition by learning the features extracted by CNN. The method proposed in this paper are evaluated using the Leave-One-Subject-Out (LOSO) cross-validation protocol on CASME II. The results show that the offset information, which is often neglected, is more important than onset information for the recognition task. Our CNNCapsNet framework can achieve the accuracy of 64.63% for the five-class micro-expression classification.","PeriodicalId":243033,"journal":{"name":"2020 5th International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"192 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS50712.2020.9336412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Micro-expression is a spontaneous facial expression, which may reveal people's real emotions. The micro-expression recognition has recently attracted much attention in psychology and computer vision community. In this paper, we designed a multi-stream Convolutional Neural Network (CNN) combined with the Capsule Network(CapsNet) module.named CNNCapsNet, to improve the performance of micro-expression recognition. Firstly, both vertical and horizontal optical flow are computed from the onset to the apex, and from the apex to the offset frame respectively, which is the first time that the offset frame information has been taken into account in the field of micro-expression recognition. Secondly, these four optical flow images and the grayscale image of apex frame are input into the five-stream CNN model to extract features. Finally, CapsNet completes micro-expression recognition by learning the features extracted by CNN. The method proposed in this paper are evaluated using the Leave-One-Subject-Out (LOSO) cross-validation protocol on CASME II. The results show that the offset information, which is often neglected, is more important than onset information for the recognition task. Our CNNCapsNet framework can achieve the accuracy of 64.63% for the five-class micro-expression classification.
偏移或开始帧:微表情识别的多流卷积神经网络与荚膜模块
微表情是一种自发的面部表情,它可以揭示人们的真实情绪。微表情识别近年来受到了心理学和计算机视觉学界的广泛关注。本文设计了一种多流卷积神经网络(CNN)与胶囊网络(CapsNet)模块相结合,命名为CNNCapsNet,以提高微表情识别的性能。首先,从起点到顶点分别计算垂直光流和水平光流,从顶点到偏移帧分别计算垂直光流和水平光流,这是微表情识别领域首次考虑偏移帧信息。其次,将这四幅光流图像和顶点帧的灰度图像输入到五流CNN模型中提取特征;最后,CapsNet通过学习CNN提取的特征完成微表情识别。在CASME II上使用留一受试者(LOSO)交叉验证协议对本文提出的方法进行了评估。结果表明,对于识别任务来说,经常被忽略的偏移信息比起始信息更重要。我们的CNNCapsNet框架对五类微表情分类准确率达到64.63%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信