An Efficient Ensemble Learning Model Integrating Multi-Branch Sub-Networks for Facial Expression Recognition

IF 1.2 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation and Systems Pub Date : 2025-06-10 DOI:10.1049/ccs2.70000

Golam Jilani, Samara Paul, Sadia Sultana

{"title":"An Efficient Ensemble Learning Model Integrating Multi-Branch Sub-Networks for Facial Expression Recognition","authors":"Golam Jilani, Samara Paul, Sadia Sultana","doi":"10.1049/ccs2.70000","DOIUrl":null,"url":null,"abstract":"<p>Accurate facial expression recognition is still challenging due to occlusion and location variability. Reducing computing overhead is also important because facial expression detection systems may be used in real-time applications. This research provides an effective ensemble learning architecture for facial emotion identification using advanced data augmentation and transfer learning techniques. The architecture uses a multi-branch sub-network framework. We chose the EfficientNet-B0, RegNet_Y_800MF and MobileNetV2 for ensembling because they are significantly smaller in terms of FLOPs and number of parameters than other variations, such as the EfficientNet-B7 and RegNet_Y_800MF. We included data augmentation methods such as Mixup and CutMix to make our system more resilient to overfitting. As demonstrated by our proposed approach, combining smaller models is more efficient than using a single large model. The proposed architecture achieves state-of-the-art results with an accuracy of 96.42% and 97.55% on the SUFEDB and KDEF datasets, respectively.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"7 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.70000","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.70000","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate facial expression recognition is still challenging due to occlusion and location variability. Reducing computing overhead is also important because facial expression detection systems may be used in real-time applications. This research provides an effective ensemble learning architecture for facial emotion identification using advanced data augmentation and transfer learning techniques. The architecture uses a multi-branch sub-network framework. We chose the EfficientNet-B0, RegNet_Y_800MF and MobileNetV2 for ensembling because they are significantly smaller in terms of FLOPs and number of parameters than other variations, such as the EfficientNet-B7 and RegNet_Y_800MF. We included data augmentation methods such as Mixup and CutMix to make our system more resilient to overfitting. As demonstrated by our proposed approach, combining smaller models is more efficient than using a single large model. The proposed architecture achieves state-of-the-art results with an accuracy of 96.42% and 97.55% on the SUFEDB and KDEF datasets, respectively.

Abstract Image

查看原文本刊更多论文

基于多分支子网络的面部表情识别高效集成学习模型

由于遮挡和位置的变化，准确的面部表情识别仍然具有挑战性。减少计算开销也很重要，因为面部表情检测系统可以用于实时应用。本研究利用先进的数据增强和迁移学习技术，为面部情绪识别提供了一个有效的集成学习架构。该体系结构使用多分支子网框架。我们选择了效率网- b0、RegNet_Y_800MF和MobileNetV2进行集成，因为它们在FLOPs和参数数量方面明显小于其他版本，如效率网- b7和RegNet_Y_800MF。我们加入了数据增强方法，如Mixup和CutMix，使我们的系统对过拟合更有弹性。正如我们提出的方法所证明的那样，组合较小的模型比使用单个大模型更有效。所提出的体系结构在SUFEDB和KDEF数据集上分别获得了96.42%和97.55%的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊