Using Adaptive Trackers for Video Face Recognition from a Single Sample Per Person

Francis Charette Migneault, Eric Granger, F. Mokhayeri
{"title":"Using Adaptive Trackers for Video Face Recognition from a Single Sample Per Person","authors":"Francis Charette Migneault, Eric Granger, F. Mokhayeri","doi":"10.1109/IPTA.2018.8608163","DOIUrl":null,"url":null,"abstract":"Still-to-video face recognition (FR) is an important function in many video surveillance applications, allowing to recognize target individuals of interest appearing over a distributed network of cameras. Systems for still-to-video FR match faces captured in videos under challenging conditions against facial models, often based on a single reference still per individual. To improve robustness to intra-class variations, an adaptive visual tracker is considered for learning of a diversified face trajectory model for each person appearing in the scene. These appearance models are updated along a trajectory, and matched against the reference gallery stills of each individual enrolled to the system. Matching scores per individual are thereby accumulated over successive frames for robust spatio-temporal recognition. In a specific implementation, face trajectory models learned with a STRUCK tracker are compared to reference stills using an ensemble of SVMs per individual that are trained a priori to discriminate target reference faces (in gallery stills) versus non-target faces (in videos from the operational domain). To represent common pose and illumination variations, domain-specific face synthesis is employed to augment the number of reference stills. Experimental results obtained with this implementation on the Chokepoint video dataset indicate that the proposed system can maintain a comparably high level of accuracy versus state-of-the-art systems, yet requires a lower complexity.","PeriodicalId":272294,"journal":{"name":"2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPTA.2018.8608163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Still-to-video face recognition (FR) is an important function in many video surveillance applications, allowing to recognize target individuals of interest appearing over a distributed network of cameras. Systems for still-to-video FR match faces captured in videos under challenging conditions against facial models, often based on a single reference still per individual. To improve robustness to intra-class variations, an adaptive visual tracker is considered for learning of a diversified face trajectory model for each person appearing in the scene. These appearance models are updated along a trajectory, and matched against the reference gallery stills of each individual enrolled to the system. Matching scores per individual are thereby accumulated over successive frames for robust spatio-temporal recognition. In a specific implementation, face trajectory models learned with a STRUCK tracker are compared to reference stills using an ensemble of SVMs per individual that are trained a priori to discriminate target reference faces (in gallery stills) versus non-target faces (in videos from the operational domain). To represent common pose and illumination variations, domain-specific face synthesis is employed to augment the number of reference stills. Experimental results obtained with this implementation on the Chokepoint video dataset indicate that the proposed system can maintain a comparably high level of accuracy versus state-of-the-art systems, yet requires a lower complexity.
使用自适应跟踪器对每个人的单个样本进行视频人脸识别
静止到视频的人脸识别(FR)在许多视频监控应用中是一项重要功能,它允许识别分布式摄像机网络中出现的目标个人。静止到视频FR系统在具有挑战性的条件下与面部模型匹配视频中捕获的面部,通常基于每个人的单个参考图像。为了提高对类内变化的鲁棒性,考虑了一种自适应视觉跟踪器,用于学习场景中出现的每个人的多样化面部轨迹模型。这些外观模型沿着轨迹更新,并与系统中登记的每个人的参考画廊剧照相匹配。因此,每个个体的匹配分数在连续的帧中累积,以实现鲁棒的时空识别。在具体实现中,使用STRUCK跟踪器学习的面部轨迹模型与参考静态图像进行比较,使用每个个体的svm集合进行先验训练,以区分目标参考面部(在画廊静态图像中)与非目标面部(在来自操作域的视频中)。为了表示常见的姿势和照明变化,采用特定领域的人脸合成来增加参考静态图像的数量。在阻塞点视频数据集上实现的实验结果表明,与最先进的系统相比,所提出的系统可以保持相当高的精度,但需要更低的复杂性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信