Williams Contreras-Higuera, Lucrezia Crescenzi-Lanna
{"title":"The Role of Time in Facial Dynamics and Challenges in Automatic Emotion Recognition (2019–2024)","authors":"Williams Contreras-Higuera, Lucrezia Crescenzi-Lanna","doi":"10.1155/hbe2/7777949","DOIUrl":null,"url":null,"abstract":"<p>Based on a comprehensive literature review, this study highlights the critical role of the temporal dimension of facial dynamics in understanding facial expressions and improving the accuracy and robustness of automatic emotion recognition systems (machine-FER). While deep learning (DL) techniques like convolutional neural networks (CNNs) and long short-term memory (LSTM) networks offer significant advances, they face challenges such as gradient vanishing and overfitting, particularly in long and complex sequences. Vision transformers (ViTs) show promise but require integration with algorithms to mitigate spatial noise. Conventional machine learning (CML) methods like support vector machine (SVM) remain robust, especially in smaller datasets. The study underscores the importance of multimodal data synchronization (e.g., video, voice) in classifying emotions more accurately, reflecting both human and machine learning capabilities. It also addresses the limitations of current models, including cultural biases and the need for large, diverse datasets. The findings suggest that future research should focus on real-world conditions, integrating sequential multimodal data and employing supervised models based on theoretical and empirical frameworks. This approach is aimed at enhancing the understanding and classification of facial emotions, ensuring data quality and acceptable results through systematic human observations. The study provides valuable insights for selecting appropriate algorithms that are tailored to specific research objectives and contexts.</p>","PeriodicalId":36408,"journal":{"name":"Human Behavior and Emerging Technologies","volume":"2025 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/hbe2/7777949","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Behavior and Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/hbe2/7777949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Based on a comprehensive literature review, this study highlights the critical role of the temporal dimension of facial dynamics in understanding facial expressions and improving the accuracy and robustness of automatic emotion recognition systems (machine-FER). While deep learning (DL) techniques like convolutional neural networks (CNNs) and long short-term memory (LSTM) networks offer significant advances, they face challenges such as gradient vanishing and overfitting, particularly in long and complex sequences. Vision transformers (ViTs) show promise but require integration with algorithms to mitigate spatial noise. Conventional machine learning (CML) methods like support vector machine (SVM) remain robust, especially in smaller datasets. The study underscores the importance of multimodal data synchronization (e.g., video, voice) in classifying emotions more accurately, reflecting both human and machine learning capabilities. It also addresses the limitations of current models, including cultural biases and the need for large, diverse datasets. The findings suggest that future research should focus on real-world conditions, integrating sequential multimodal data and employing supervised models based on theoretical and empirical frameworks. This approach is aimed at enhancing the understanding and classification of facial emotions, ensuring data quality and acceptable results through systematic human observations. The study provides valuable insights for selecting appropriate algorithms that are tailored to specific research objectives and contexts.
期刊介绍:
Human Behavior and Emerging Technologies is an interdisciplinary journal dedicated to publishing high-impact research that enhances understanding of the complex interactions between diverse human behavior and emerging digital technologies.