Residual Neural Network and Wing Loss for Face Alignment Network

Li Wang, Wei Xiang
{"title":"Residual Neural Network and Wing Loss for Face Alignment Network","authors":"Li Wang, Wei Xiang","doi":"10.1109/ISKE47853.2019.9170374","DOIUrl":null,"url":null,"abstract":"Face recognition has made great progress due to the development of convolutional neural network (CNN), and face alignment is an important part of recognition, it is easily affected by gestures and occlusion. In this paper, we propose Residual Neural Network and Wing Loss for Face Alignment Network (RWAN), which consists of a plurality of stages, each stage ameliorates the position of facial landmarks estimated from the previous stage. Unlike the traditional cascading model, the network model uses the Residual Neural Network, which is easily to optimize and can improve accuracy by adding considerable depth, the internal residual block uses shortcut to alleviate the gradient explosion problem caused by increasing depth in deep neural network. Enter the face images and extract features from the entire image by introducing landmark heatmaps to obtain more accurate positioning. Using wing loss in the loss function part not only focuses on the landmark of large error points, but also on the small and medium errors of landmarks, it aims to improve the training ability of deep neural network with small and medium range error. The problem of unbalanced data not only confuses classification tasks, but also affects the accuracy of the model when face samples of different attitudes are not balanced in face key detection tasks. a simple but effective data enhancement method is proposed to deal with the problem, which solves the problem of unbalanced data processing by randomly rotating the training samples, amplifying and the like. The experimental results obtained by our method on the 300W dataset indicate the advantages of this method.","PeriodicalId":399084,"journal":{"name":"2019 IEEE 14th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 14th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE47853.2019.9170374","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Face recognition has made great progress due to the development of convolutional neural network (CNN), and face alignment is an important part of recognition, it is easily affected by gestures and occlusion. In this paper, we propose Residual Neural Network and Wing Loss for Face Alignment Network (RWAN), which consists of a plurality of stages, each stage ameliorates the position of facial landmarks estimated from the previous stage. Unlike the traditional cascading model, the network model uses the Residual Neural Network, which is easily to optimize and can improve accuracy by adding considerable depth, the internal residual block uses shortcut to alleviate the gradient explosion problem caused by increasing depth in deep neural network. Enter the face images and extract features from the entire image by introducing landmark heatmaps to obtain more accurate positioning. Using wing loss in the loss function part not only focuses on the landmark of large error points, but also on the small and medium errors of landmarks, it aims to improve the training ability of deep neural network with small and medium range error. The problem of unbalanced data not only confuses classification tasks, but also affects the accuracy of the model when face samples of different attitudes are not balanced in face key detection tasks. a simple but effective data enhancement method is proposed to deal with the problem, which solves the problem of unbalanced data processing by randomly rotating the training samples, amplifying and the like. The experimental results obtained by our method on the 300W dataset indicate the advantages of this method.
面对齐网络的残差神经网络和翼损
由于卷积神经网络(CNN)的发展,人脸识别取得了很大的进步,而人脸对齐是识别的重要组成部分,它容易受到手势和遮挡的影响。在本文中,我们提出了残差神经网络和机翼损失的人脸对齐网络(RWAN),该网络由多个阶段组成,每个阶段都改善了从前一阶段估计的面部标志的位置。与传统的级联模型不同,该网络模型采用残差神经网络,易于优化,通过增加相当大的深度可以提高精度,内部残差块采用捷径,缓解了深度神经网络中增加深度引起的梯度爆炸问题。输入人脸图像,通过引入地标热图从整个图像中提取特征,以获得更准确的定位。在损失函数部分使用机翼损失,既关注大误差点的地标,也关注地标的中小型误差,旨在提高具有中小型误差的深度神经网络的训练能力。在人脸关键字检测任务中,当不同态度的人脸样本不平衡时,数据不平衡问题不仅会混淆分类任务,而且会影响模型的准确性。针对这一问题,提出了一种简单有效的数据增强方法,通过随机旋转训练样本、放大等方法解决了数据处理不平衡的问题。在300W数据集上的实验结果表明了该方法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信