{"title":"An Efficient Ensemble Learning Model Integrating Multi-Branch Sub-Networks for Facial Expression Recognition","authors":"Golam Jilani, Samara Paul, Sadia Sultana","doi":"10.1049/ccs2.70000","DOIUrl":null,"url":null,"abstract":"<p>Accurate facial expression recognition is still challenging due to occlusion and location variability. Reducing computing overhead is also important because facial expression detection systems may be used in real-time applications. This research provides an effective ensemble learning architecture for facial emotion identification using advanced data augmentation and transfer learning techniques. The architecture uses a multi-branch sub-network framework. We chose the EfficientNet-B0, RegNet_Y_800MF and MobileNetV2 for ensembling because they are significantly smaller in terms of FLOPs and number of parameters than other variations, such as the EfficientNet-B7 and RegNet_Y_800MF. We included data augmentation methods such as Mixup and CutMix to make our system more resilient to overfitting. As demonstrated by our proposed approach, combining smaller models is more efficient than using a single large model. The proposed architecture achieves state-of-the-art results with an accuracy of 96.42% and 97.55% on the SUFEDB and KDEF datasets, respectively.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"7 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.70000","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.70000","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate facial expression recognition is still challenging due to occlusion and location variability. Reducing computing overhead is also important because facial expression detection systems may be used in real-time applications. This research provides an effective ensemble learning architecture for facial emotion identification using advanced data augmentation and transfer learning techniques. The architecture uses a multi-branch sub-network framework. We chose the EfficientNet-B0, RegNet_Y_800MF and MobileNetV2 for ensembling because they are significantly smaller in terms of FLOPs and number of parameters than other variations, such as the EfficientNet-B7 and RegNet_Y_800MF. We included data augmentation methods such as Mixup and CutMix to make our system more resilient to overfitting. As demonstrated by our proposed approach, combining smaller models is more efficient than using a single large model. The proposed architecture achieves state-of-the-art results with an accuracy of 96.42% and 97.55% on the SUFEDB and KDEF datasets, respectively.