CoNAN：用于无约束远距离生物特征融合的条件神经聚合网络

IF 5

IEEE transactions on biometrics, behavior, and identity science Pub Date : 2024-06-06 DOI:10.1109/TBIOM.2024.3410311

Bhavin Jawade;Deen Dayal Mohan;Prajwal Shetty;Dennis Fedorishin;Srirangaraj Setlur;Venu Govindaraju

{"title":"CoNAN：用于无约束远距离生物特征融合的条件神经聚合网络","authors":"Bhavin Jawade;Deen Dayal Mohan;Prajwal Shetty;Dennis Fedorishin;Srirangaraj Setlur;Venu Govindaraju","doi":"10.1109/TBIOM.2024.3410311","DOIUrl":null,"url":null,"abstract":"Person recognition from image sets acquired under unregulated and uncontrolled settings, such as at large distances, low resolutions, varying viewpoints, illumination, pose, and atmospheric conditions, is challenging. Feature aggregation, which involves aggregating a set of N feature representations present in a template into a single global representation, plays a pivotal role in such recognition systems. Existing works in traditional face feature aggregation either utilize metadata or high-dimensional intermediate feature representations to estimate feature quality for aggregation. However, generating high-quality metadata or style information is not feasible for extremely low-resolution faces captured in long-range and high altitude settings. To overcome these limitations, we propose a feature distribution conditioning approach called CoNAN for template aggregation. Specifically, our method aims to learn a context vector conditioned over the distribution information of the incoming feature set, which is utilized to weigh the features based on their estimated informativeness. The proposed method produces state-of-the-art results on long-range unconstrained face recognition datasets such as BTS, and DroneSURF, validating the advantages of such an aggregation strategy. We show that CoNAN generalizes present CoNAN’s results on other modalities such as body features and gait. We also produce extensive qualitative and quantitative experiments on different components of CoNAN.","PeriodicalId":73307,"journal":{"name":"IEEE transactions on biometrics, behavior, and identity science","volume":"6 4","pages":"602-612"},"PeriodicalIF":5.0000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CoNAN: Conditional Neural Aggregation Network for Unconstrained Long Range Biometric Feature Fusion\",\"authors\":\"Bhavin Jawade;Deen Dayal Mohan;Prajwal Shetty;Dennis Fedorishin;Srirangaraj Setlur;Venu Govindaraju\",\"doi\":\"10.1109/TBIOM.2024.3410311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Person recognition from image sets acquired under unregulated and uncontrolled settings, such as at large distances, low resolutions, varying viewpoints, illumination, pose, and atmospheric conditions, is challenging. Feature aggregation, which involves aggregating a set of N feature representations present in a template into a single global representation, plays a pivotal role in such recognition systems. Existing works in traditional face feature aggregation either utilize metadata or high-dimensional intermediate feature representations to estimate feature quality for aggregation. However, generating high-quality metadata or style information is not feasible for extremely low-resolution faces captured in long-range and high altitude settings. To overcome these limitations, we propose a feature distribution conditioning approach called CoNAN for template aggregation. Specifically, our method aims to learn a context vector conditioned over the distribution information of the incoming feature set, which is utilized to weigh the features based on their estimated informativeness. The proposed method produces state-of-the-art results on long-range unconstrained face recognition datasets such as BTS, and DroneSURF, validating the advantages of such an aggregation strategy. We show that CoNAN generalizes present CoNAN’s results on other modalities such as body features and gait. We also produce extensive qualitative and quantitative experiments on different components of CoNAN.\",\"PeriodicalId\":73307,\"journal\":{\"name\":\"IEEE transactions on biometrics, behavior, and identity science\",\"volume\":\"6 4\",\"pages\":\"602-612\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on biometrics, behavior, and identity science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10551380/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biometrics, behavior, and identity science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10551380/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

从在不规范和不受控的环境下获取的图像集（如远距离、低分辨率、不同视角、光照、姿势和大气条件）中进行人物识别具有挑战性。特征聚合是指将模板中的 N 个特征表征集合成一个单一的全局表征，在此类识别系统中起着至关重要的作用。现有的传统人脸特征聚合工作要么利用元数据，要么利用高维中间特征表示来估计特征质量，以便进行聚合。然而，对于在远距离和高海拔环境下捕捉到的分辨率极低的人脸，生成高质量的元数据或样式信息是不可行的。为了克服这些限制，我们提出了一种用于模板聚合的名为 CoNAN 的特征分布调节方法。具体来说，我们的方法旨在学习一个以输入特征集的分布信息为条件的上下文向量，并根据估计的信息量对特征进行权衡。所提出的方法在 BTS 和 DroneSURF 等远程无约束人脸识别数据集上取得了一流的结果，验证了这种聚合策略的优势。我们的研究表明，CoNAN 可以在身体特征和步态等其他模态上推广 CoNAN 的成果。我们还对 CoNAN 的不同组件进行了广泛的定性和定量实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CoNAN: Conditional Neural Aggregation Network for Unconstrained Long Range Biometric Feature Fusion

Person recognition from image sets acquired under unregulated and uncontrolled settings, such as at large distances, low resolutions, varying viewpoints, illumination, pose, and atmospheric conditions, is challenging. Feature aggregation, which involves aggregating a set of N feature representations present in a template into a single global representation, plays a pivotal role in such recognition systems. Existing works in traditional face feature aggregation either utilize metadata or high-dimensional intermediate feature representations to estimate feature quality for aggregation. However, generating high-quality metadata or style information is not feasible for extremely low-resolution faces captured in long-range and high altitude settings. To overcome these limitations, we propose a feature distribution conditioning approach called CoNAN for template aggregation. Specifically, our method aims to learn a context vector conditioned over the distribution information of the incoming feature set, which is utilized to weigh the features based on their estimated informativeness. The proposed method produces state-of-the-art results on long-range unconstrained face recognition datasets such as BTS, and DroneSURF, validating the advantages of such an aggregation strategy. We show that CoNAN generalizes present CoNAN’s results on other modalities such as body features and gait. We also produce extensive qualitative and quantitative experiments on different components of CoNAN.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on biometrics, behavior, and identity science

CiteScore

10.90

自引率

0.00%

发文量