Mitigating Catastrophic Interference using Unsupervised Multi-Part Attention for RGB-IR Face Recognition

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2023-06-01 DOI:10.1109/CVPRW59228.2023.00039

Kshitij Nikhal, Nkiruka Uzuegbunam, Bridget Kennedy, B. Riggan

{"title":"Mitigating Catastrophic Interference using Unsupervised Multi-Part Attention for RGB-IR Face Recognition","authors":"Kshitij Nikhal, Nkiruka Uzuegbunam, Bridget Kennedy, B. Riggan","doi":"10.1109/CVPRW59228.2023.00039","DOIUrl":null,"url":null,"abstract":"Modern algorithms for RGB-IR facial recognition— a challenging problem where infrared probe images are matched with visible gallery images—leverage precise and accurate guidance from curated (i.e., labeled) data to bridge large spectral differences. However, supervised cross-spectral face recognition methods are often extremely sensitive due to over-fitting to labels, performing well in some settings but not in others. Moreover, when fine-tuning on data from additional settings, supervised cross-spectral face recognition are prone to catastrophic forgetting. Therefore, we propose a novel unsupervised framework for RGB-IR face recognition to minimize the cost and time inefficiencies pertaining to labeling large-scale, multispectral data required to train supervised cross-spectral recognition methods and to alleviate the effect of forgetting by removing over dependence on hard labels to bridge such large spectral differences. The proposed framework integrates an efficient backbone network architecture with part-based attention models, which collectively enhances common information between visible and infrared faces. Then, the framework is optimized using pseudo-labels and a new cross-spectral memory bank loss. This framework is evaluated on the ARL-VTF and TUFTS datasets, achieving 98.55% and 43.28% true accept rate, respectively. Additionally, we analyze effects of forgetting and show that our framework is less prone to these effects.","PeriodicalId":355438,"journal":{"name":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW59228.2023.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern algorithms for RGB-IR facial recognition— a challenging problem where infrared probe images are matched with visible gallery images—leverage precise and accurate guidance from curated (i.e., labeled) data to bridge large spectral differences. However, supervised cross-spectral face recognition methods are often extremely sensitive due to over-fitting to labels, performing well in some settings but not in others. Moreover, when fine-tuning on data from additional settings, supervised cross-spectral face recognition are prone to catastrophic forgetting. Therefore, we propose a novel unsupervised framework for RGB-IR face recognition to minimize the cost and time inefficiencies pertaining to labeling large-scale, multispectral data required to train supervised cross-spectral recognition methods and to alleviate the effect of forgetting by removing over dependence on hard labels to bridge such large spectral differences. The proposed framework integrates an efficient backbone network architecture with part-based attention models, which collectively enhances common information between visible and infrared faces. Then, the framework is optimized using pseudo-labels and a new cross-spectral memory bank loss. This framework is evaluated on the ARL-VTF and TUFTS datasets, achieving 98.55% and 43.28% true accept rate, respectively. Additionally, we analyze effects of forgetting and show that our framework is less prone to these effects.

查看原文本刊更多论文

基于无监督多部分注意力的RGB-IR人脸识别技术

RGB-IR面部识别的现代算法-一个具有挑战性的问题，其中红外探头图像与可见画廊图像相匹配-利用从策展(即标记)数据中精确和准确的指导来弥合巨大的光谱差异。然而，监督交叉光谱人脸识别方法由于对标签的过度拟合，通常非常敏感，在某些设置中表现良好，但在其他设置中表现不佳。此外，当对来自其他设置的数据进行微调时，监督交叉光谱人脸识别容易发生灾难性遗忘。因此，我们提出了一种新的无监督RGB-IR人脸识别框架，以最大限度地减少与训练监督交叉光谱识别方法所需的大规模多光谱数据标记相关的成本和时间效率低下，并通过消除对硬标签的过度依赖来减轻遗忘的影响，以弥合如此大的光谱差异。该框架集成了高效的骨干网架构和基于部分的注意力模型，共同增强了可见光和红外人脸之间的公共信息。然后，利用伪标签和一种新的跨谱记忆库损耗对框架进行优化。该框架在ARL-VTF和TUFTS数据集上进行了评估，真实接受率分别达到98.55%和43.28%。此外，我们分析了遗忘的影响，并表明我们的框架不太容易受到这些影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量