学习用于鲁棒RGBD人脸识别的多模态3D人脸嵌入

Journal of Integrated and Advanced Engineering (JIAE) Pub Date : 2023-03-09 DOI:10.51662/jiae.v3i1.84

Ahmed Rimaz Faizabadi, Hasan Firdaus Mohd Zaki, Z. Zainal Abidin, M. A. Husman, Nik Nur Wahidah Nik Hashim

{"title":"学习用于鲁棒RGBD人脸识别的多模态3D人脸嵌入","authors":"Ahmed Rimaz Faizabadi, Hasan Firdaus Mohd Zaki, Z. Zainal Abidin, M. A. Husman, Nik Nur Wahidah Nik Hashim","doi":"10.51662/jiae.v3i1.84","DOIUrl":null,"url":null,"abstract":"Machine vision will play a significant role in the next generation of IR 4.0 systems. Recognition and analysis of faces are essential in many vision-based applications. Deep Learning provides the thrust for the advancement in visual recognition. An important tool for visual recognition tasks is Convolution Neural networks (CNN). However, the 2D methods for machine vision suffer from Pose, Illumination, and Expression (PIE) challenges and occlusions. The 3D Race Recognition (3DFR) is very promising for dealing with PIE and a certain degree of occlusions and is suitable for unconstrained environments. However, the 3D data is highly irregular, affecting the performance of deep networks. Most of the 3D Face recognition models are implemented from a research aspect and rarely find a complete 3DFR application. This work attempts to implement a complete end-to-end robust 3DFR pipeline. For this purpose, we implemented a CuteFace3D. This face recognition model is trained on the most challenging dataset, where the state-of-the-art model had below 95% accuracy. An accuracy of 98.89% is achieved on the intellifusion test dataset. Further, for open world and unseen domain adaptation, embeddings learning is achieved using KNN. Then a complete FR pipeline for RGBD face recognition is implemented using a RealSense D435 depth camera. With the KNN classifier and k-fold validation, we achieved 99.997% for the open set RGBD pipeline on registered users. The proposed method with early fusion four-channel input is found to be more robust and has achieved higher accuracy in the benchmark dataset.","PeriodicalId":424190,"journal":{"name":"Journal of Integrated and Advanced Engineering (JIAE)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Learning a Multimodal 3D Face Embedding for Robust RGBD Face Recognition\",\"authors\":\"Ahmed Rimaz Faizabadi, Hasan Firdaus Mohd Zaki, Z. Zainal Abidin, M. A. Husman, Nik Nur Wahidah Nik Hashim\",\"doi\":\"10.51662/jiae.v3i1.84\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine vision will play a significant role in the next generation of IR 4.0 systems. Recognition and analysis of faces are essential in many vision-based applications. Deep Learning provides the thrust for the advancement in visual recognition. An important tool for visual recognition tasks is Convolution Neural networks (CNN). However, the 2D methods for machine vision suffer from Pose, Illumination, and Expression (PIE) challenges and occlusions. The 3D Race Recognition (3DFR) is very promising for dealing with PIE and a certain degree of occlusions and is suitable for unconstrained environments. However, the 3D data is highly irregular, affecting the performance of deep networks. Most of the 3D Face recognition models are implemented from a research aspect and rarely find a complete 3DFR application. This work attempts to implement a complete end-to-end robust 3DFR pipeline. For this purpose, we implemented a CuteFace3D. This face recognition model is trained on the most challenging dataset, where the state-of-the-art model had below 95% accuracy. An accuracy of 98.89% is achieved on the intellifusion test dataset. Further, for open world and unseen domain adaptation, embeddings learning is achieved using KNN. Then a complete FR pipeline for RGBD face recognition is implemented using a RealSense D435 depth camera. With the KNN classifier and k-fold validation, we achieved 99.997% for the open set RGBD pipeline on registered users. The proposed method with early fusion four-channel input is found to be more robust and has achieved higher accuracy in the benchmark dataset.\",\"PeriodicalId\":424190,\"journal\":{\"name\":\"Journal of Integrated and Advanced Engineering (JIAE)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Integrated and Advanced Engineering (JIAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51662/jiae.v3i1.84\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Integrated and Advanced Engineering (JIAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51662/jiae.v3i1.84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

机器视觉将在下一代工业革命4.0系统中发挥重要作用。人脸识别和分析在许多基于视觉的应用中是必不可少的。深度学习为视觉识别的进步提供了推动力。卷积神经网络(CNN)是视觉识别任务的一个重要工具。然而，用于机器视觉的2D方法受到姿势，照明和表达(PIE)挑战和遮挡的影响。3D种族识别(3DFR)在处理PIE和一定程度的遮挡方面非常有前途，适用于无约束的环境。然而，三维数据是高度不规则的，影响了深度网络的性能。大多数3D人脸识别模型都是从研究的角度来实现的，很少有完整的3D人脸识别应用。这项工作试图实现一个完整的端到端健壮的3DFR管道。为此，我们实现了一个CuteFace3D。这个人脸识别模型是在最具挑战性的数据集上训练的，其中最先进的模型的准确率低于95%。在智力测试数据集上，准确率达到98.89%。此外，对于开放世界和未知领域的自适应，使用KNN实现了嵌入学习。然后，使用RealSense D435深度相机实现了RGBD人脸识别的完整FR流水线。使用KNN分类器和k-fold验证，我们在注册用户上实现了99.997%的开放集RGBD管道。实验结果表明，采用早期融合四通道输入的方法具有更好的鲁棒性和更高的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning a Multimodal 3D Face Embedding for Robust RGBD Face Recognition

Machine vision will play a significant role in the next generation of IR 4.0 systems. Recognition and analysis of faces are essential in many vision-based applications. Deep Learning provides the thrust for the advancement in visual recognition. An important tool for visual recognition tasks is Convolution Neural networks (CNN). However, the 2D methods for machine vision suffer from Pose, Illumination, and Expression (PIE) challenges and occlusions. The 3D Race Recognition (3DFR) is very promising for dealing with PIE and a certain degree of occlusions and is suitable for unconstrained environments. However, the 3D data is highly irregular, affecting the performance of deep networks. Most of the 3D Face recognition models are implemented from a research aspect and rarely find a complete 3DFR application. This work attempts to implement a complete end-to-end robust 3DFR pipeline. For this purpose, we implemented a CuteFace3D. This face recognition model is trained on the most challenging dataset, where the state-of-the-art model had below 95% accuracy. An accuracy of 98.89% is achieved on the intellifusion test dataset. Further, for open world and unseen domain adaptation, embeddings learning is achieved using KNN. Then a complete FR pipeline for RGBD face recognition is implemented using a RealSense D435 depth camera. With the KNN classifier and k-fold validation, we achieved 99.997% for the open set RGBD pipeline on registered users. The proposed method with early fusion four-channel input is found to be more robust and has achieved higher accuracy in the benchmark dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Integrated and Advanced Engineering (JIAE)

自引率

0.00%

发文量