不平衡样本下基于欧拉视频放大和三维残差网络的微表情识别

Engineering Research Express Pub Date : 2024-07-03 DOI:10.1088/2631-8695/ad5f14

Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long

{"title":"不平衡样本下基于欧拉视频放大和三维残差网络的微表情识别","authors":"Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long","doi":"10.1088/2631-8695/ad5f14","DOIUrl":null,"url":null,"abstract":"\n A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.","PeriodicalId":505725,"journal":{"name":"Engineering Research Express","volume":"13 S15","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Micro-expression recognition based on euler video magnification and 3D residual network under imbalanced sample\",\"authors\":\"Liangyu Zhu, Yujun He, Xiaoqing Yang, Hui Li, Xiangqian Long\",\"doi\":\"10.1088/2631-8695/ad5f14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.\",\"PeriodicalId\":505725,\"journal\":{\"name\":\"Engineering Research Express\",\"volume\":\"13 S15\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Research Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2631-8695/ad5f14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Research Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2631-8695/ad5f14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

学生的语言行为在教育中起着至关重要的作用，而微表情等非语言行为则能显著提高教学质量。针对面部表情动作小、数据类别不平衡、静态表情缺乏时间信息等问题，提出了一种基于欧拉视频放大（EVM）和不平衡样本下的三维残差网络（3D ResNet）的微表情识别方法。首先，使用 Dlib 库中的人脸检测功能来定位微表情视频样本中的人脸并对其进行裁剪。其次，使用 EVM 放大微表情中的运动特征。然后，使用 3D ResNet 从微表情视频样本中提取时空信息，并在网络训练过程中引入循环焦点损失（CFL）函数，以解决微表情数据集中的类不平衡问题。最后，分析了 EVM 和 CFL 函数在三维 ResNet 识别微表情中的作用。在自发微表情数据库（SMIC）和中国科学院微表情数据库 II（CASME II）上的实验结果证明了该方法的有效性和优越性。所提出的方法可以辅助教学评价，促进智慧课堂的发展，而所提出的方法在设备上的存储和计算还需要进一步研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Micro-expression recognition based on euler video magnification and 3D residual network under imbalanced sample

A student's verbal behavior plays a crucial role in education, while nonverbal behavior, such as micro-expressions, significantly improves teaching quality. To address the problem of small facial expression movements, imbalanced data categories, and lack of temporal information in static expressions, a micro-expression recognition method is proposed based on Eulerian Video Magnification (EVM) and a 3D Residual Network (3D ResNet) under imbalanced samples. Firstly, face detection in the Dlib library is used to locate the face in the micro-expression video sample and crop it. Secondly, the EVM is used to magnify the motion features in micro-expressions. Then, the 3D ResNet is used to extract spatio-temporal information from micro-expression video samples, and the Cyclical Focal Loss (CFL) function is introduced in the network training process to solve the class imbalance problem in micro-expression datasets. Finally, the roles of the EVM and the CFL function in recognizing micro-expressions by the 3D ResNet are analyzed. The experimental results on the Spontaneous Micro-expression Database (SMIC) and Chinese Academy of Sciences Micro-expression Database II (CASME II) demonstrate the effectiveness and superiority of this method. The proposed method can assist in teaching evaluation and promote the development of smart classrooms, and further research is needed on the storage and computing of the proposed method on devices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Research Express

自引率

0.00%

发文量