A New Design of Occlusion-Invariant Face Recognition Using Optimal Pattern Extraction and CNN with GRU-Based Architecture

Int. J. Image Graph. Pub Date : 2022-05-11 DOI:10.1142/s0219467823500298

Pankaj, P. K. Bharti, B. Kumar

{"title":"A New Design of Occlusion-Invariant Face Recognition Using Optimal Pattern Extraction and CNN with GRU-Based Architecture","authors":"Pankaj, P. K. Bharti, B. Kumar","doi":"10.1142/s0219467823500298","DOIUrl":null,"url":null,"abstract":"Face detection is a computer technology being used in a variety of applications that identify human faces in digital images. In many face recognition challenges, Convolutional Neural Networks (CNNs) are regarded as a problem solver. Occlusion is determined as the most common challenge of face recognition in realistic applications. Several studies are undergoing to obtain face recognition without any challenges. However, the occurrence of noise and occlusion in the image reduces the achievement of face recognition. Hence, various researches and studies are carried out to solve the challenges involved with the occurrence of occlusion and noise in the image, and more clarification is needed to acquire high accuracy. Hence, a deep learning model is intended to be developed in this paper using the meta-heuristic approach. The proposed model covers four main steps: (a) data acquisition, (b) pre-processing, (c) pattern extraction and (d) classification. The benchmark datasets regarding the face image with occlusion are gathered from a public source. Further, the pre-processing of the images is performed by contrast enhancement and Gabor filtering. With these pre-processed images, pattern extraction is done by the optimal local mesh ternary pattern. Here, the hybrid Whale–Galactic Swarm Optimization (WGSO) algorithm is used for developing the optimal local mesh ternary pattern extraction. By inputting the pattern-extracted image, the new deep learning model namely “CNN with Gated Recurrent Unit (GRU)” network performs the recognition process to maximize the accuracy, and also it is used to enhance the face recognition model. From the results, in terms of accuracy, the proposed WGSO-[Formula: see text] model is better by 4.02%, 3.76% and 2.17% than the CNN, SVM and SRC, respectively. The experimental results are presented by performing their comparative analysis on a standard dataset, and they assure the efficiency of the proposed model. However, many challenging problems related to face recognition still exist, which offer excellent opportunities to face recognition researchers in the future.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Image Graph.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219467823500298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Face detection is a computer technology being used in a variety of applications that identify human faces in digital images. In many face recognition challenges, Convolutional Neural Networks (CNNs) are regarded as a problem solver. Occlusion is determined as the most common challenge of face recognition in realistic applications. Several studies are undergoing to obtain face recognition without any challenges. However, the occurrence of noise and occlusion in the image reduces the achievement of face recognition. Hence, various researches and studies are carried out to solve the challenges involved with the occurrence of occlusion and noise in the image, and more clarification is needed to acquire high accuracy. Hence, a deep learning model is intended to be developed in this paper using the meta-heuristic approach. The proposed model covers four main steps: (a) data acquisition, (b) pre-processing, (c) pattern extraction and (d) classification. The benchmark datasets regarding the face image with occlusion are gathered from a public source. Further, the pre-processing of the images is performed by contrast enhancement and Gabor filtering. With these pre-processed images, pattern extraction is done by the optimal local mesh ternary pattern. Here, the hybrid Whale–Galactic Swarm Optimization (WGSO) algorithm is used for developing the optimal local mesh ternary pattern extraction. By inputting the pattern-extracted image, the new deep learning model namely “CNN with Gated Recurrent Unit (GRU)” network performs the recognition process to maximize the accuracy, and also it is used to enhance the face recognition model. From the results, in terms of accuracy, the proposed WGSO-[Formula: see text] model is better by 4.02%, 3.76% and 2.17% than the CNN, SVM and SRC, respectively. The experimental results are presented by performing their comparative analysis on a standard dataset, and they assure the efficiency of the proposed model. However, many challenging problems related to face recognition still exist, which offer excellent opportunities to face recognition researchers in the future.

查看原文本刊更多论文

一种基于最优模式提取和基于gru架构的CNN的无遮挡人脸识别新设计

人脸检测是一种计算机技术，用于识别数字图像中的人脸。在许多人脸识别挑战中，卷积神经网络(cnn)被认为是一个解决问题的方法。在现实应用中，遮挡被认为是人脸识别最常见的挑战。一些研究正在进行中，以获得无任何挑战的人脸识别。然而，图像中噪声和遮挡的出现降低了人脸识别的效果。因此，为了解决图像中遮挡和噪声的出现所带来的挑战，人们进行了各种各样的研究和研究，需要更多的澄清以获得更高的精度。因此，本文打算使用元启发式方法开发一个深度学习模型。提出的模型包括四个主要步骤:(a)数据采集，(b)预处理，(c)模式提取和(d)分类。关于遮挡的人脸图像的基准数据集是从公开来源收集的。进一步，通过对比度增强和Gabor滤波对图像进行预处理。利用这些预处理后的图像，利用最优的局部网格三元模式进行模式提取。本文采用鲸-银河群混合优化算法(WGSO)进行最优局部网格三元模式提取。通过输入模式提取后的图像，新的深度学习模型即“CNN with Gated Recurrent Unit (GRU)”网络进行识别过程，使识别精度最大化，并用于增强人脸识别模型。从结果来看，在准确率方面，本文提出的WGSO-[公式:见文本]模型比CNN、SVM和SRC分别提高4.02%、3.76%和2.17%。通过对标准数据集的对比分析，给出了实验结果，验证了所提模型的有效性。然而，人脸识别仍然存在许多具有挑战性的问题，这为未来的人脸识别研究提供了很好的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Image Graph.

自引率

0.00%

发文量