Design Methodology for Single-Channel CNN-Based FER Systems

Dorfell Parra, Carlos Camargo
{"title":"Design Methodology for Single-Channel CNN-Based FER Systems","authors":"Dorfell Parra, Carlos Camargo","doi":"10.1109/ICICT58900.2023.00022","DOIUrl":null,"url":null,"abstract":"Facial Expression Recognition (FER) systems classify emotions by using geometrical approaches or Machine Learning (ML) algorithms such as Convolutional Neural Networks (CNNs). Due to their complexity, these FER systems need to be implemented on high-performance hardware, which makes them unsuitable for embedded devices. To address this challenge, we propose a methodology for the design of low-complexity, CNN-based FER systems. Our methodology includes data preprocessing, Local Binary Pattern (LBP) implementation, Data Augmentation (DA), and CNN design. Here, we also introduce the Model M6, a single-channel CNN that reaches an accuracy of 94% in less than 30 epochs. M6 has 306,182 parameters that correspond to 1.17 MB of memory. Therefore, our methodology and M6 model are feasible for implementation onto embedded systems capable of computing floating point operations. We validated our methodology and M6 model using 66 tests with 6 CNN models and 4 training parameters (batch size, learning rate, number of epochs, optimizer). This validation was performed using the Japanese Female Facial Expression (JAFFE) dataset and TensorFlow. In each test, the relationship between parameters, layers, overfitting, and underfitting was studied. Moreover, we present a step-by-step guideline on how to design the single-channel CNN and provide open-source code for readers interested in reproducing our work.","PeriodicalId":425057,"journal":{"name":"2023 6th International Conference on Information and Computer Technologies (ICICT)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT58900.2023.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Facial Expression Recognition (FER) systems classify emotions by using geometrical approaches or Machine Learning (ML) algorithms such as Convolutional Neural Networks (CNNs). Due to their complexity, these FER systems need to be implemented on high-performance hardware, which makes them unsuitable for embedded devices. To address this challenge, we propose a methodology for the design of low-complexity, CNN-based FER systems. Our methodology includes data preprocessing, Local Binary Pattern (LBP) implementation, Data Augmentation (DA), and CNN design. Here, we also introduce the Model M6, a single-channel CNN that reaches an accuracy of 94% in less than 30 epochs. M6 has 306,182 parameters that correspond to 1.17 MB of memory. Therefore, our methodology and M6 model are feasible for implementation onto embedded systems capable of computing floating point operations. We validated our methodology and M6 model using 66 tests with 6 CNN models and 4 training parameters (batch size, learning rate, number of epochs, optimizer). This validation was performed using the Japanese Female Facial Expression (JAFFE) dataset and TensorFlow. In each test, the relationship between parameters, layers, overfitting, and underfitting was studied. Moreover, we present a step-by-step guideline on how to design the single-channel CNN and provide open-source code for readers interested in reproducing our work.
基于cnn的单通道FER系统设计方法
面部表情识别(FER)系统通过几何方法或机器学习(ML)算法(如卷积神经网络(cnn))对情绪进行分类。由于其复杂性,这些FER系统需要在高性能硬件上实现,这使得它们不适合嵌入式设备。为了应对这一挑战,我们提出了一种设计低复杂度、基于cnn的FER系统的方法。我们的方法包括数据预处理、局部二值模式(LBP)实现、数据增强(DA)和CNN设计。在这里,我们还介绍了M6模型,这是一种单通道CNN,在不到30个epoch的时间内达到94%的精度。M6有306,182个参数,对应于1.17 MB的内存。因此,我们的方法和M6模型在能够计算浮点运算的嵌入式系统上是可行的。我们使用6个CNN模型和4个训练参数(批大小、学习率、epoch数、优化器)进行了66次测试,验证了我们的方法和M6模型。该验证使用日本女性面部表情(JAFFE)数据集和TensorFlow进行。在每次测试中,研究了参数、层、过拟合和欠拟合之间的关系。此外,我们还提供了一个关于如何设计单通道CNN的逐步指南,并为有兴趣复制我们工作的读者提供了开源代码。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信