A two-stage algorithm for heterogeneous face recognition using Deep Stacked PCA Descriptor (DSPD) and Coupled Discriminant Neighbourhood Embedding (CDNE)

Neural Computing and Applications Pub Date : 2024-08-14 DOI:10.1007/s00521-024-10272-5

Shubhobrata Bhattacharya

{"title":"A two-stage algorithm for heterogeneous face recognition using Deep Stacked PCA Descriptor (DSPD) and Coupled Discriminant Neighbourhood Embedding (CDNE)","authors":"Shubhobrata Bhattacharya","doi":"10.1007/s00521-024-10272-5","DOIUrl":null,"url":null,"abstract":"<p>Automatic face recognition has made significant progress in recent decades, particularly in controlled environments. However, recognizing faces across different modalities, known as Heterogeneous Face Recognition, presents challenges due to variations in modality gaps. This paper addresses the problem of HFR by proposing a two-stage algorithm. In the first stage, a deep stacked PCA descriptor (DSPD) is introduced to extract domain-invariant features from face images of different modalities. The DSPD utilizes multiple convolution layers of domain-trained PCA filters, and the features extracted from each layer are concatenated to obtain a final feature representation. Additionally, pre-processing steps are applied to input images to enhance the prominence of facial edges, making the features more distinctive. The obtained DSPD features can be directly used for recognition using nearest neighbour algorithms. To further improve recognition robustness, a coupled subspace called coupled discriminant neighbourhood embedding (CDNE) is proposed in the second stage. CDNE is trained with a limited number of data samples and can project DSPD features from different modalities onto a common subspace. In this subspace, data points representing the same subjects from different modalities are positioned closely, while those of different subjects are positioned apart. This spatial arrangement enhances the recognition of heterogeneous faces using nearest neighbour algorithms. Experimental results demonstrate the effectiveness of the proposed algorithm on various HFR scenarios, including VIS-NIR, VIS-Sketch, and VIS-Thermal face pairs from respective databases. The algorithm shows promising performance in addressing the challenges posed by the modality gap, providing a potential solution for accurate and robust Heterogeneous Face Recognition.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10272-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Automatic face recognition has made significant progress in recent decades, particularly in controlled environments. However, recognizing faces across different modalities, known as Heterogeneous Face Recognition, presents challenges due to variations in modality gaps. This paper addresses the problem of HFR by proposing a two-stage algorithm. In the first stage, a deep stacked PCA descriptor (DSPD) is introduced to extract domain-invariant features from face images of different modalities. The DSPD utilizes multiple convolution layers of domain-trained PCA filters, and the features extracted from each layer are concatenated to obtain a final feature representation. Additionally, pre-processing steps are applied to input images to enhance the prominence of facial edges, making the features more distinctive. The obtained DSPD features can be directly used for recognition using nearest neighbour algorithms. To further improve recognition robustness, a coupled subspace called coupled discriminant neighbourhood embedding (CDNE) is proposed in the second stage. CDNE is trained with a limited number of data samples and can project DSPD features from different modalities onto a common subspace. In this subspace, data points representing the same subjects from different modalities are positioned closely, while those of different subjects are positioned apart. This spatial arrangement enhances the recognition of heterogeneous faces using nearest neighbour algorithms. Experimental results demonstrate the effectiveness of the proposed algorithm on various HFR scenarios, including VIS-NIR, VIS-Sketch, and VIS-Thermal face pairs from respective databases. The algorithm shows promising performance in addressing the challenges posed by the modality gap, providing a potential solution for accurate and robust Heterogeneous Face Recognition.

Abstract Image

查看原文本刊更多论文

使用深度堆积 PCA 描述符 (DSPD) 和耦合判别邻域嵌入 (CDNE) 的两阶段异构人脸识别算法

近几十年来，自动人脸识别技术取得了长足进步，尤其是在受控环境中。然而，由于模态间隙的差异，识别不同模态的人脸（即异构人脸识别）面临着挑战。本文通过提出一种两阶段算法来解决 HFR 问题。在第一阶段，引入深度堆叠 PCA 描述符（DSPD），从不同模态的人脸图像中提取域不变特征。DSPD 利用多层领域训练 PCA 过滤器的卷积层，将从每一层提取的特征串联起来，以获得最终的特征表示。此外，还对输入图像进行预处理，以增强面部边缘的突出度，使特征更加鲜明。获得的 DSPD 特征可直接用于使用近邻算法进行识别。为了进一步提高识别鲁棒性，第二阶段提出了一种称为耦合判别邻域嵌入（CDNE）的耦合子空间。CDNE 使用有限的数据样本进行训练，可以将不同模态的 DSPD 特征投射到一个共同的子空间。在这个子空间中，来自不同模态的代表相同受试者的数据点被紧密定位，而来自不同受试者的数据点则被分开定位。这种空间安排提高了使用近邻算法识别异质人脸的能力。实验结果证明了所提算法在各种 HFR 场景下的有效性，包括来自各自数据库的 VIS-NIR、VIS-Sketch 和 VIS-Thermal 人脸对。该算法在应对模态差距带来的挑战方面表现出了良好的性能，为准确、稳健的异构人脸识别提供了潜在的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Computing and Applications

自引率

0.00%

发文量