Optimizing Class Imbalance in Facial Expression Recognition Using Dynamic Intra-Class Clustering.

IF 3.4 3区医学 Q1 ENGINEERING, MULTIDISCIPLINARY

Biomimetics Pub Date : 2025-05-08 DOI:10.3390/biomimetics10050296

Qingdu Li, Keting Fu, Jian Liu, Yishan Li, Qinze Ren, Kang Xu, Junxiu Fu, Na Liu, Ye Yuan

{"title":"Optimizing Class Imbalance in Facial Expression Recognition Using Dynamic Intra-Class Clustering.","authors":"Qingdu Li, Keting Fu, Jian Liu, Yishan Li, Qinze Ren, Kang Xu, Junxiu Fu, Na Liu, Ye Yuan","doi":"10.3390/biomimetics10050296","DOIUrl":null,"url":null,"abstract":"<p><p>While deep neural networks demonstrate robust performance in visual tasks, the long-tail distribution of real-world data leads to significant recognition accuracy degradation in critical scenarios such as medical human-robot affective interaction, particularly the misidentification of low-frequency negative emotions (e.g., fear and disgust) that may trigger psychological resistance in patients. Here, we propose a method based on dynamic intra-class clustering (DICC) to optimize the class imbalance problem in facial expression recognition tasks. The DICC method dynamically adjusts the distribution of majority classes by clustering them into subclasses and generating pseudo-labels, which helps the model learn more discriminative features and improve classification accuracy. By comparing with existing methods, we demonstrate that the DICC method can help the model achieve superior performance across various facial expression datasets. In this study, we conducted an in-depth evaluation of the DICC method against baseline methods using the FER2013, MMAFEDB, and Emotion-Domestic datasets, achieving improvements in classification accuracy of 1.73%, 1.97%, and 5.48%, respectively. This indicates that the DICC method can effectively enhance classification precision, especially in the recognition of minority class samples. This approach provides a novel perspective for addressing the class imbalance challenge in facial expression recognition and offers a reference for future research and applications in related fields.</p>","PeriodicalId":8907,"journal":{"name":"Biomimetics","volume":"10 5","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12109554/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/biomimetics10050296","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

While deep neural networks demonstrate robust performance in visual tasks, the long-tail distribution of real-world data leads to significant recognition accuracy degradation in critical scenarios such as medical human-robot affective interaction, particularly the misidentification of low-frequency negative emotions (e.g., fear and disgust) that may trigger psychological resistance in patients. Here, we propose a method based on dynamic intra-class clustering (DICC) to optimize the class imbalance problem in facial expression recognition tasks. The DICC method dynamically adjusts the distribution of majority classes by clustering them into subclasses and generating pseudo-labels, which helps the model learn more discriminative features and improve classification accuracy. By comparing with existing methods, we demonstrate that the DICC method can help the model achieve superior performance across various facial expression datasets. In this study, we conducted an in-depth evaluation of the DICC method against baseline methods using the FER2013, MMAFEDB, and Emotion-Domestic datasets, achieving improvements in classification accuracy of 1.73%, 1.97%, and 5.48%, respectively. This indicates that the DICC method can effectively enhance classification precision, especially in the recognition of minority class samples. This approach provides a novel perspective for addressing the class imbalance challenge in facial expression recognition and offers a reference for future research and applications in related fields.

查看原文本刊更多论文

基于动态类内聚类优化面部表情识别中的类不平衡。

虽然深度神经网络在视觉任务中表现出强大的性能，但现实世界数据的长尾分布导致在关键场景（如医疗人机情感交互）中识别精度显著下降，特别是对低频负面情绪（如恐惧和厌恶）的错误识别，这可能引发患者的心理抵抗。在此，我们提出了一种基于动态类内聚类（DICC）的方法来优化面部表情识别任务中的类不平衡问题。DICC方法通过将多数类聚类为子类并生成伪标签来动态调整多数类的分布，有助于模型学习更多的判别特征，提高分类精度。通过与现有方法的比较，我们证明了DICC方法可以帮助模型在各种面部表情数据集上获得更好的性能。在本研究中，我们使用FER2013、MMAFEDB和情绪-国内数据集对DICC方法与基线方法进行了深入评估，分类准确率分别提高了1.73%、1.97%和5.48%。这表明DICC方法可以有效地提高分类精度，特别是对少数类样本的识别。该方法为解决面部表情识别中的类不平衡挑战提供了一个新的视角，为今后相关领域的研究和应用提供了参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊