Using Adaptive Trackers for Video Face Recognition from a Single Sample Per Person

2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA) Pub Date : 2018-11-01 DOI:10.1109/IPTA.2018.8608163

Francis Charette Migneault, Eric Granger, F. Mokhayeri

{"title":"Using Adaptive Trackers for Video Face Recognition from a Single Sample Per Person","authors":"Francis Charette Migneault, Eric Granger, F. Mokhayeri","doi":"10.1109/IPTA.2018.8608163","DOIUrl":null,"url":null,"abstract":"Still-to-video face recognition (FR) is an important function in many video surveillance applications, allowing to recognize target individuals of interest appearing over a distributed network of cameras. Systems for still-to-video FR match faces captured in videos under challenging conditions against facial models, often based on a single reference still per individual. To improve robustness to intra-class variations, an adaptive visual tracker is considered for learning of a diversified face trajectory model for each person appearing in the scene. These appearance models are updated along a trajectory, and matched against the reference gallery stills of each individual enrolled to the system. Matching scores per individual are thereby accumulated over successive frames for robust spatio-temporal recognition. In a specific implementation, face trajectory models learned with a STRUCK tracker are compared to reference stills using an ensemble of SVMs per individual that are trained a priori to discriminate target reference faces (in gallery stills) versus non-target faces (in videos from the operational domain). To represent common pose and illumination variations, domain-specific face synthesis is employed to augment the number of reference stills. Experimental results obtained with this implementation on the Chokepoint video dataset indicate that the proposed system can maintain a comparably high level of accuracy versus state-of-the-art systems, yet requires a lower complexity.","PeriodicalId":272294,"journal":{"name":"2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPTA.2018.8608163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Still-to-video face recognition (FR) is an important function in many video surveillance applications, allowing to recognize target individuals of interest appearing over a distributed network of cameras. Systems for still-to-video FR match faces captured in videos under challenging conditions against facial models, often based on a single reference still per individual. To improve robustness to intra-class variations, an adaptive visual tracker is considered for learning of a diversified face trajectory model for each person appearing in the scene. These appearance models are updated along a trajectory, and matched against the reference gallery stills of each individual enrolled to the system. Matching scores per individual are thereby accumulated over successive frames for robust spatio-temporal recognition. In a specific implementation, face trajectory models learned with a STRUCK tracker are compared to reference stills using an ensemble of SVMs per individual that are trained a priori to discriminate target reference faces (in gallery stills) versus non-target faces (in videos from the operational domain). To represent common pose and illumination variations, domain-specific face synthesis is employed to augment the number of reference stills. Experimental results obtained with this implementation on the Chokepoint video dataset indicate that the proposed system can maintain a comparably high level of accuracy versus state-of-the-art systems, yet requires a lower complexity.

查看原文本刊更多论文

使用自适应跟踪器对每个人的单个样本进行视频人脸识别

静止到视频的人脸识别(FR)在许多视频监控应用中是一项重要功能，它允许识别分布式摄像机网络中出现的目标个人。静止到视频FR系统在具有挑战性的条件下与面部模型匹配视频中捕获的面部，通常基于每个人的单个参考图像。为了提高对类内变化的鲁棒性，考虑了一种自适应视觉跟踪器，用于学习场景中出现的每个人的多样化面部轨迹模型。这些外观模型沿着轨迹更新，并与系统中登记的每个人的参考画廊剧照相匹配。因此，每个个体的匹配分数在连续的帧中累积，以实现鲁棒的时空识别。在具体实现中，使用STRUCK跟踪器学习的面部轨迹模型与参考静态图像进行比较，使用每个个体的svm集合进行先验训练，以区分目标参考面部(在画廊静态图像中)与非目标面部(在来自操作域的视频中)。为了表示常见的姿势和照明变化，采用特定领域的人脸合成来增加参考静态图像的数量。在阻塞点视频数据集上实现的实验结果表明，与最先进的系统相比，所提出的系统可以保持相当高的精度，但需要更低的复杂性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA)

自引率

0.00%

发文量