多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用

Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen
{"title":"多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用","authors":"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen","doi":"10.1109/CACS47674.2019.9024357","DOIUrl":null,"url":null,"abstract":"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.","PeriodicalId":247039,"journal":{"name":"2019 International Automatic Control Conference (CACS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System\",\"authors\":\"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen\",\"doi\":\"10.1109/CACS47674.2019.9024357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.\",\"PeriodicalId\":247039,\"journal\":{\"name\":\"2019 International Automatic Control Conference (CACS)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Automatic Control Conference (CACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACS47674.2019.9024357\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Automatic Control Conference (CACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACS47674.2019.9024357","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

人脸检测与识别是计算机视觉和人机交互领域的一个重要课题和难点。近年来,随着深度学习的发展,人脸检测和面部表情识别(FER)的相关技术被提出,其中最突出的是卷积神经网络。本文将多任务级联卷积神经网络应用于人脸检测,设计了基于密集连接卷积网络(DenseNet)的实时人脸识别系统。该系统首先将输入图像缩放为图像金字塔,然后使用分层网络确定候选窗口是否包含人脸。如果存在,则将候选窗口发送到FER系统。由于DenseNet具有特征重用的特性,可以有效地减少参数量和计算量,有利于实时系统的开发。为了捕捉面部肌肉在不同表情下的变化,该架构采用步长为1的卷积运算,并尝试不同数量的密集块。通过实验,该系统可以在30FPS的速度下实现实时识别,识别精度优于人眼。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System
Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信