Optimal latent space for Low-shot Face Recognition

2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI) Pub Date : 2023-08-01 DOI:10.1109/IRI58017.2023.00029

Anvaya Rai, B. Lall, Astha Zalani, Raghawendra Prakash Singh, Shikha Srivastava

{"title":"Optimal latent space for Low-shot Face Recognition","authors":"Anvaya Rai, B. Lall, Astha Zalani, Raghawendra Prakash Singh, Shikha Srivastava","doi":"10.1109/IRI58017.2023.00029","DOIUrl":null,"url":null,"abstract":"The ability of humans to learn to classify objects after seeing a very few examples of them in the past, has given rise to the field of Low-shot learning. The idea is to be able to train a deep learning model to differentiate between same and different pairs and then generalise these ideas to evaluate new categories. As shapes, structure and low level visual features of human faces are similar in nature, so we can make use of extensive public face data sets to initially train a deep neural network (DNN), to learn generalised features of human face. We call this face space as Latent Feature Space. Then we demonstrate the use of probablistic interpretation of principal component analysis (PPCA) along with Extreme Learning Machine (ELM) algorithms, as an efficient technique to transform this space for representing our novel dataset classes with limited number of available samples. We avoid any kind of network re-training, while enforcing the network to learn a distance function between images rather than explicitly classifying them. The proposed algorithm couples a deep neural network (DNN) based feature representation with a low-dimensional manifold extraction to address the Low-shot classification and verification problems. We call this low-dimensional subspace as Feature Transformed Latent Space. Also, in addition to providing performance improvements in terms of accuracy, the suggested approach provides significant advantages in terms of memory, computation and speed during classification/ verification tasks, while being agnostic to occlusion, pose, expression and illumination conditions.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI58017.2023.00029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The ability of humans to learn to classify objects after seeing a very few examples of them in the past, has given rise to the field of Low-shot learning. The idea is to be able to train a deep learning model to differentiate between same and different pairs and then generalise these ideas to evaluate new categories. As shapes, structure and low level visual features of human faces are similar in nature, so we can make use of extensive public face data sets to initially train a deep neural network (DNN), to learn generalised features of human face. We call this face space as Latent Feature Space. Then we demonstrate the use of probablistic interpretation of principal component analysis (PPCA) along with Extreme Learning Machine (ELM) algorithms, as an efficient technique to transform this space for representing our novel dataset classes with limited number of available samples. We avoid any kind of network re-training, while enforcing the network to learn a distance function between images rather than explicitly classifying them. The proposed algorithm couples a deep neural network (DNN) based feature representation with a low-dimensional manifold extraction to address the Low-shot classification and verification problems. We call this low-dimensional subspace as Feature Transformed Latent Space. Also, in addition to providing performance improvements in terms of accuracy, the suggested approach provides significant advantages in terms of memory, computation and speed during classification/ verification tasks, while being agnostic to occlusion, pose, expression and illumination conditions.

查看原文本刊更多论文

低镜头人脸识别的最优潜在空间

在过去，人类在看到很少的例子后就学会对物体进行分类的能力，已经引起了Low-shot学习领域的兴起。这个想法是能够训练一个深度学习模型来区分相同和不同的对，然后概括这些想法来评估新的类别。由于人脸的形状、结构和低级视觉特征在本质上是相似的，因此我们可以利用大量的公共人脸数据集来初始训练深度神经网络(DNN)，以学习人脸的泛化特征。我们把这个面空间称为潜在特征空间。然后，我们演示了主成分分析(PPCA)的概率解释以及极限学习机(ELM)算法的使用，作为一种有效的技术来转换这个空间，以表示有限数量的可用样本的新数据集类。我们避免了任何类型的网络再训练，同时强制网络学习图像之间的距离函数，而不是明确地对它们进行分类。该算法将基于深度神经网络(DNN)的特征表示与低维流形提取相结合，以解决Low-shot分类和验证问题。我们把这个低维子空间称为特征变换潜空间。此外，除了在准确性方面提供性能改进外，建议的方法在分类/验证任务期间在内存，计算和速度方面具有显着优势，同时不受遮挡，姿势，表情和照明条件的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)

自引率

0.00%

发文量