Fuli Zhang , Yu Liu , Xiaoling Yu , Zhichen Wang , Qi Zhang , Jing Wang , Qionghua Zhang
{"title":"利用改进的多模态集合学习方法实现面部微表情检测和分类","authors":"Fuli Zhang , Yu Liu , Xiaoling Yu , Zhichen Wang , Qi Zhang , Jing Wang , Qionghua Zhang","doi":"10.1016/j.inffus.2024.102735","DOIUrl":null,"url":null,"abstract":"<div><div>A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro-expressions. In recent years, Deep convolution neural methods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDC-MMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is maintained by optimizations that minimize computing cost. Empirical findings reveal that this methodology notably surpasses conventional techniques, providing enhanced precision and resilience on a variety of complex and demanding datasets. In addition to pushing the boundaries of micro-expression analysis research, the proposed strategy has potential uses in the real world in fields including security, psychology testing, and human-computer interaction.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"115 ","pages":"Article 102735"},"PeriodicalIF":14.7000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards facial micro-expression detection and classification using modified multimodal ensemble learning approach\",\"authors\":\"Fuli Zhang , Yu Liu , Xiaoling Yu , Zhichen Wang , Qi Zhang , Jing Wang , Qionghua Zhang\",\"doi\":\"10.1016/j.inffus.2024.102735\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro-expressions. In recent years, Deep convolution neural methods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDC-MMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is maintained by optimizations that minimize computing cost. Empirical findings reveal that this methodology notably surpasses conventional techniques, providing enhanced precision and resilience on a variety of complex and demanding datasets. In addition to pushing the boundaries of micro-expression analysis research, the proposed strategy has potential uses in the real world in fields including security, psychology testing, and human-computer interaction.</div></div>\",\"PeriodicalId\":50367,\"journal\":{\"name\":\"Information Fusion\",\"volume\":\"115 \",\"pages\":\"Article 102735\"},\"PeriodicalIF\":14.7000,\"publicationDate\":\"2024-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Fusion\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S156625352400513X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156625352400513X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
微表情是一种稍纵即逝、细腻而局部的面部动作。它可以暴露一个人试图隐藏的真实情感,被视为识破谎言的关键指标。由于微表情可能应用于多个领域,因此微表情研究受到了广泛关注。不过,由于微表情的动作短暂而微弱,因此微表情识别的准确性仍有待提高。近年来,深度卷积神经方法在应对复杂的人脸检测挑战时表现出了更高的效率。虽然人们对微表情识别(MER)进行了多次尝试,但问题远未解决,其他模型的准确率最低就说明了这一点。本研究提出了一种使用修正多模态集合学习(FMEDC-MMEL)的面部微表情检测和分类方法。FMEDC-MMEL 技术的主要目的在于熟练识别面部图像中存在的微表情。作为预处理阶段,FMEDC-MMEL 技术利用直方图均衡化(HE)方法来提高图像的对比度。在 FMEDC-MMEL 技术中,改进的密集连接网络(DenseNet)模型用于从预处理图像中学习特征模式。为了提高改进型 DenseNet 模型的能力,在超参数选择过程中使用了随机梯度下降(SGD)方法。对于面部 ME 检测,FMEDC-MMEL 技术采用了三种分类器的集合,即双向门控递归单元(Bi-GRU)、长短期记忆(LSTM)和极端学习机(ELM)。图中展示了一种量身定制的集合学习方法,它结合了多种机器学习模型,以提高分类性能和检测准确率。复杂的特征提取方法用于提取微表情的细微特征,并通过优化计算成本来保持精确度。实证研究结果表明,这种方法明显超越了传统技术,在各种复杂和高要求的数据集上提供了更高的精度和弹性。除了推动微表情分析研究的发展,所提出的策略在现实世界中还有潜在用途,包括安全、心理测试和人机交互等领域。
Towards facial micro-expression detection and classification using modified multimodal ensemble learning approach
A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro-expressions. In recent years, Deep convolution neural methods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDC-MMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is maintained by optimizations that minimize computing cost. Empirical findings reveal that this methodology notably surpasses conventional techniques, providing enhanced precision and resilience on a variety of complex and demanding datasets. In addition to pushing the boundaries of micro-expression analysis research, the proposed strategy has potential uses in the real world in fields including security, psychology testing, and human-computer interaction.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.