Exploring the Potential of A-ResNet in Person-Independent Face Recognition and Classification

International Journal of Advanced Network, Monitoring and Controls Pub Date : 2023-06-01 DOI:10.2478/ijanmc-2023-0052

A. Obaid, Aws Saad Shawkat, Nazar Salih Abdulhussein

{"title":"Exploring the Potential of A-ResNet in Person-Independent Face Recognition and Classification","authors":"A. Obaid, Aws Saad Shawkat, Nazar Salih Abdulhussein","doi":"10.2478/ijanmc-2023-0052","DOIUrl":null,"url":null,"abstract":"Abstract This study offers a novel face recognition and classification method based on classifiers that use statistical local features. The use of ResNet has generated growing interest in a variety of areas of image processing and computer vision in recent years and demonstrated its usefulness in several applications, especially for facial image analysis, which includes tasks as varied as face detection, face recognition, facial expression analysis, demographic classification, etc. This paper is divided into two steps i.e. face recognition and classification. The first step in face recognition is automatic data cleansing which is done with the help of Multi-Task Cascaded Convolutional Neural Networks (MTCNNs) and face.evoLVe, followed by parameter changes in MTCNN to prevent dirty data. The authors next trained two models: Inception-ResNetV1, which had pre-trained weights, and Altered-ResNet (A-ResNet), which used Conv2d layers in ResNet for feature extraction and pooling and softmax layers for classifications. The authors use the best optimizer after comparing a number of them during the training phase, along with various combinations of batch and epoch. A-ResNet, the top model overall, detects 86/104 Labelled Faces in the Wild (LFW) dataset images in 0.50 seconds. The proposed approach was evaluated and received an accuracy of 91.7%. Along with this, the system achieved a training accuracy of 98.53% and a testing accuracy of 99.15% for masked face recognition. The proposed method exhibits competitive outcomes when measured against other cutting-edge algorithms and models. Finally, when it comes to why the suggested model is superior to ResNet, it may be because the A-ResNet is simpler thus it can perform at its best with little data, whereas deeper networks require higher data size.","PeriodicalId":193299,"journal":{"name":"International Journal of Advanced Network, Monitoring and Controls","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Network, Monitoring and Controls","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijanmc-2023-0052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract This study offers a novel face recognition and classification method based on classifiers that use statistical local features. The use of ResNet has generated growing interest in a variety of areas of image processing and computer vision in recent years and demonstrated its usefulness in several applications, especially for facial image analysis, which includes tasks as varied as face detection, face recognition, facial expression analysis, demographic classification, etc. This paper is divided into two steps i.e. face recognition and classification. The first step in face recognition is automatic data cleansing which is done with the help of Multi-Task Cascaded Convolutional Neural Networks (MTCNNs) and face.evoLVe, followed by parameter changes in MTCNN to prevent dirty data. The authors next trained two models: Inception-ResNetV1, which had pre-trained weights, and Altered-ResNet (A-ResNet), which used Conv2d layers in ResNet for feature extraction and pooling and softmax layers for classifications. The authors use the best optimizer after comparing a number of them during the training phase, along with various combinations of batch and epoch. A-ResNet, the top model overall, detects 86/104 Labelled Faces in the Wild (LFW) dataset images in 0.50 seconds. The proposed approach was evaluated and received an accuracy of 91.7%. Along with this, the system achieved a training accuracy of 98.53% and a testing accuracy of 99.15% for masked face recognition. The proposed method exhibits competitive outcomes when measured against other cutting-edge algorithms and models. Finally, when it comes to why the suggested model is superior to ResNet, it may be because the A-ResNet is simpler thus it can perform at its best with little data, whereas deeper networks require higher data size.

查看原文本刊更多论文

探索A-ResNet在独立于人的人脸识别和分类中的潜力

摘要提出了一种基于统计局部特征的分类器的人脸识别分类方法。近年来，ResNet的使用在图像处理和计算机视觉的各个领域引起了越来越多的兴趣，并在几个应用中证明了它的有用性，特别是在面部图像分析方面，其中包括面部检测，面部识别，面部表情分析，人口统计分类等各种任务。本文分为两个步骤，即人脸识别和分类。人脸识别的第一步是自动数据清洗，这是在多任务级联卷积神经网络(MTCNNs)和人脸的帮助下完成的。然后在MTCNN中更改参数以防止脏数据。作者接下来训练了两个模型:Inception-ResNetV1，它具有预训练的权重，以及Altered-ResNet (A-ResNet)，它使用ResNet中的Conv2d层进行特征提取和池化，使用softmax层进行分类。作者在训练阶段比较了许多优化器，以及批处理和epoch的各种组合后，使用了最好的优化器。A-ResNet是顶级模型，在0.50秒内检测到86/104张野外(LFW)数据集图像中的标记面孔。对该方法进行了评估，准确率为91.7%。与此同时，该系统在蒙面人脸识别方面的训练准确率为98.53%，测试准确率为99.15%。当与其他前沿算法和模型进行比较时，所提出的方法显示出具有竞争力的结果。最后，当谈到为什么建议的模型优于ResNet时，可能是因为A-ResNet更简单，因此它可以在少量数据下发挥最佳效果，而更深层次的网络需要更高的数据量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Advanced Network, Monitoring and Controls

自引率

0.00%

发文量