Attribute-Enhanced Face Recognition with Neural Tensor Fusion Networks

2017 IEEE International Conference on Computer Vision (ICCV) Pub Date : 2017-12-25 DOI:10.1109/ICCV.2017.404

Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, S. Mukherjee, Timothy M. Hospedales, N. Robertson, Yongxin Yang

{"title":"Attribute-Enhanced Face Recognition with Neural Tensor Fusion Networks","authors":"Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, S. Mukherjee, Timothy M. Hospedales, N. Robertson, Yongxin Yang","doi":"10.1109/ICCV.2017.404","DOIUrl":null,"url":null,"abstract":"Deep learning has achieved great success in face recognition, however deep-learned features still have limited invariance to strong intra-personal variations such as large pose changes. It is observed that some facial attributes (e.g. eyebrow thickness, gender) are robust to such variations. We present the first work to systematically explore how the fusion of face recognition features (FRF) and facial attribute features (FAF) can enhance face recognition performance in various challenging scenarios. Despite the promise of FAF, we find that in practice existing fusion methods fail to leverage FAF to boost face recognition performance in some challenging scenarios. Thus, we develop a powerful tensor-based framework which formulates feature fusion as a tensor optimisation problem. It is nontrivial to directly optimise this tensor due to the large number of parameters to optimise. To solve this problem, we establish a theoretical equivalence between low-rank tensor optimisation and a two-stream gated neural network. This equivalence allows tractable learning using standard neural network optimisation tools, leading to accurate and stable optimisation. Experimental results show the fused feature works better than individual features, thus proving for the first time that facial attributes aid face recognition. We achieve state-of-the-art performance on three popular databases: MultiPIE (cross pose, lighting and expression), CASIA NIR-VIS2.0 (cross-modality environment) and LFW (uncontrolled environment).","PeriodicalId":6559,"journal":{"name":"2017 IEEE International Conference on Computer Vision (ICCV)","volume":"31 1","pages":"3764-3773"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"67","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Computer Vision (ICCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2017.404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 67

Abstract

Deep learning has achieved great success in face recognition, however deep-learned features still have limited invariance to strong intra-personal variations such as large pose changes. It is observed that some facial attributes (e.g. eyebrow thickness, gender) are robust to such variations. We present the first work to systematically explore how the fusion of face recognition features (FRF) and facial attribute features (FAF) can enhance face recognition performance in various challenging scenarios. Despite the promise of FAF, we find that in practice existing fusion methods fail to leverage FAF to boost face recognition performance in some challenging scenarios. Thus, we develop a powerful tensor-based framework which formulates feature fusion as a tensor optimisation problem. It is nontrivial to directly optimise this tensor due to the large number of parameters to optimise. To solve this problem, we establish a theoretical equivalence between low-rank tensor optimisation and a two-stream gated neural network. This equivalence allows tractable learning using standard neural network optimisation tools, leading to accurate and stable optimisation. Experimental results show the fused feature works better than individual features, thus proving for the first time that facial attributes aid face recognition. We achieve state-of-the-art performance on three popular databases: MultiPIE (cross pose, lighting and expression), CASIA NIR-VIS2.0 (cross-modality environment) and LFW (uncontrolled environment).

查看原文本刊更多论文

基于神经张量融合网络的属性增强人脸识别

深度学习在人脸识别方面取得了很大的成功，但是深度学习的特征对强烈的个人内部变化(如大的姿势变化)的不变性仍然有限。可以观察到，某些面部属性(如眉毛粗细、性别)对这种变化具有鲁棒性。我们首次系统地探讨了人脸识别特征(FRF)和人脸属性特征(FAF)的融合如何在各种具有挑战性的场景中提高人脸识别性能。尽管FAF很有前景，但我们发现在实践中，现有的融合方法无法在一些具有挑战性的场景中利用FAF来提高人脸识别性能。因此，我们开发了一个强大的基于张量的框架，它将特征融合表述为张量优化问题。由于需要优化的参数很多，直接优化这个张量是不平凡的。为了解决这个问题，我们建立了低秩张量优化和双流门控神经网络之间的理论等价。这种等效性允许使用标准神经网络优化工具进行可处理的学习，从而实现准确和稳定的优化。实验结果表明，融合特征比单个特征效果更好，首次证明了人脸属性对人脸识别的辅助作用。我们在三个流行的数据库上实现了最先进的性能:MultiPIE(交叉姿势，灯光和表情)，CASIA NIR-VIS2.0(交叉模态环境)和LFW(非受控环境)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量