Face expression recognition based on improved convolutional neural network

International Conference on Artificial Intelligence and Pattern Recognition Pub Date : 2019-08-16 DOI:10.1145/3357254.3357275

Quanming Liu, Jing Zhang, Y. Xin

{"title":"Face expression recognition based on improved convolutional neural network","authors":"Quanming Liu, Jing Zhang, Y. Xin","doi":"10.1145/3357254.3357275","DOIUrl":null,"url":null,"abstract":"Aiming at the problems of huge parameters and network degradation caused by simple linear stacked convolution layers or continuous full connection layers in traditional expression recognition methods, two convolution neural network models are designed through depth separation convolution and residual module respectively to widen and deepen the network. Firstly, model A adopts depth separation convolution instead of regular convolution layer, and the global average pooling layer replaces the final full connection layer, utilizes the methods of dropout, batch normalization, activation function of PReLU and image augmentation to avoid over-fitting effectively. Model B adopts pre-trained ResNet50 model to extract facial features, magnifies the images twice by the SRGAN method. Using ensemble method to fuse model A and B, the accuracy is further improved. To verify the feasibility of the method, the model was tested on the FER2013 facial expression dataset, and the performance was compared with the other facial expression recognition algorithms. The final results showed the improved convolutional neural network (CNN) reached the advanced precision of 73.244% in FER2013 dataset, and the experiment data and the number of model parameters all proved the effectiveness of this method.","PeriodicalId":361892,"journal":{"name":"International Conference on Artificial Intelligence and Pattern Recognition","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3357254.3357275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Aiming at the problems of huge parameters and network degradation caused by simple linear stacked convolution layers or continuous full connection layers in traditional expression recognition methods, two convolution neural network models are designed through depth separation convolution and residual module respectively to widen and deepen the network. Firstly, model A adopts depth separation convolution instead of regular convolution layer, and the global average pooling layer replaces the final full connection layer, utilizes the methods of dropout, batch normalization, activation function of PReLU and image augmentation to avoid over-fitting effectively. Model B adopts pre-trained ResNet50 model to extract facial features, magnifies the images twice by the SRGAN method. Using ensemble method to fuse model A and B, the accuracy is further improved. To verify the feasibility of the method, the model was tested on the FER2013 facial expression dataset, and the performance was compared with the other facial expression recognition algorithms. The final results showed the improved convolutional neural network (CNN) reached the advanced precision of 73.244% in FER2013 dataset, and the experiment data and the number of model parameters all proved the effectiveness of this method.

查看原文本刊更多论文

基于改进卷积神经网络的人脸表情识别

针对传统表情识别方法中简单的线性堆叠卷积层或连续全连接层导致的参数巨大和网络退化问题，分别通过深度分离卷积和残差模块设计了两种卷积神经网络模型，对网络进行了加宽和深化。首先，模型A采用深度分离卷积代替规则卷积层，用全局平均池化层代替最终的全连接层，利用dropout、批归一化、PReLU激活函数和图像增强等方法有效避免过拟合。模型B采用预训练的ResNet50模型提取人脸特征，通过SRGAN方法将图像放大2倍。采用集成方法对模型A和模型B进行融合，进一步提高了精度。为了验证该方法的可行性，在FER2013面部表情数据集上对该模型进行了测试，并与其他面部表情识别算法进行了性能比较。最终结果表明，改进后的卷积神经网络(CNN)在FER2013数据集中达到了73.244%的高级精度，实验数据和模型参数数量都证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Artificial Intelligence and Pattern Recognition

自引率

0.00%

发文量