不平衡样本下基于欧拉视频放大和三维残差网络的微表情识别

Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long
{"title":"不平衡样本下基于欧拉视频放大和三维残差网络的微表情识别","authors":"Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long","doi":"10.1088/2631-8695/ad5f14","DOIUrl":null,"url":null,"abstract":"\n A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.","PeriodicalId":505725,"journal":{"name":"Engineering Research Express","volume":"13 S15","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Micro-expression recognition based on euler video magnification and 3D residual network under imbalanced sample\",\"authors\":\"Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long\",\"doi\":\"10.1088/2631-8695/ad5f14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.\",\"PeriodicalId\":505725,\"journal\":{\"name\":\"Engineering Research Express\",\"volume\":\"13 S15\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Research Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2631-8695/ad5f14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Research Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2631-8695/ad5f14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

学生的语言行为在教育中起着至关重要的作用,而微表情等非语言行为则能显著提高教学质量。针对面部表情动作小、数据类别不平衡、静态表情缺乏时间信息等问题,提出了一种基于欧拉视频放大(EVM)和不平衡样本下的三维残差网络(3D ResNet)的微表情识别方法。首先,使用 Dlib 库中的人脸检测功能来定位微表情视频样本中的人脸并对其进行裁剪。其次,使用 EVM 放大微表情中的运动特征。然后,使用 3D ResNet 从微表情视频样本中提取时空信息,并在网络训练过程中引入循环焦点损失(CFL)函数,以解决微表情数据集中的类不平衡问题。最后,分析了 EVM 和 CFL 函数在三维 ResNet 识别微表情中的作用。在自发微表情数据库(SMIC)和中国科学院微表情数据库 II(CASME II)上的实验结果证明了该方法的有效性和优越性。所提出的方法可以辅助教学评价,促进智慧课堂的发展,而所提出的方法在设备上的存储和计算还需要进一步研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Micro-expression recognition based on euler video magnification and 3D residual network under imbalanced sample
A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信