上下文感知人脸验证和识别的一次性相似性预训练分类器

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) Pub Date : 2017-02-01 DOI:10.1109/ISBA.2017.7947687

Monika Sharma, R. Hebbalaguppe, L. Vig

{"title":"上下文感知人脸验证和识别的一次性相似性预训练分类器","authors":"Monika Sharma, R. Hebbalaguppe, L. Vig","doi":"10.1109/ISBA.2017.7947687","DOIUrl":null,"url":null,"abstract":"Most affect based systems analyse facial expressions for emotion detection, and utilize face detection and recognition methods in order to do effective affect analysis. Recent work has demonstrated the efficacy of deep architectures for face recognition by training as classifiers on voluminous datasets. Some architectures are trained as classifiers, and some directly learn an embedding via a triplet loss function. In this paper, we consider the case of one shot prediction from the feature space learnt initially via classification, i.e. we consider the situation where we have a pre-trained model, but do not have access to the training data and are required to make predictions on novel faces with just one training image per identity. We utilize the one shot similarity metric in order to compute similarity scores and compare it with the state-of-the-art results on the Youtube videos face dataset (YTF). We demonstrate the effect of temporal context on frame wise face recognition, and use a probabilistic majority voting scheme over past frames to determine current frame identity. Additionally, we found a number of labelling errors in the Youtube face dataset that were not published in the errata, and have published the same online for the benefit of the community.","PeriodicalId":436086,"journal":{"name":"2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Pre-trained classifiers with One Shot Similarity for context aware face verification and identification\",\"authors\":\"Monika Sharma, R. Hebbalaguppe, L. Vig\",\"doi\":\"10.1109/ISBA.2017.7947687\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most affect based systems analyse facial expressions for emotion detection, and utilize face detection and recognition methods in order to do effective affect analysis. Recent work has demonstrated the efficacy of deep architectures for face recognition by training as classifiers on voluminous datasets. Some architectures are trained as classifiers, and some directly learn an embedding via a triplet loss function. In this paper, we consider the case of one shot prediction from the feature space learnt initially via classification, i.e. we consider the situation where we have a pre-trained model, but do not have access to the training data and are required to make predictions on novel faces with just one training image per identity. We utilize the one shot similarity metric in order to compute similarity scores and compare it with the state-of-the-art results on the Youtube videos face dataset (YTF). We demonstrate the effect of temporal context on frame wise face recognition, and use a probabilistic majority voting scheme over past frames to determine current frame identity. Additionally, we found a number of labelling errors in the Youtube face dataset that were not published in the errata, and have published the same online for the benefit of the community.\",\"PeriodicalId\":436086,\"journal\":{\"name\":\"2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISBA.2017.7947687\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISBA.2017.7947687","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

大多数基于情感的系统通过分析面部表情来进行情感检测，并利用人脸检测和识别方法来进行有效的情感分析。最近的工作已经通过在大量数据集上训练分类器来证明深度架构对人脸识别的有效性。有些架构被训练成分类器，有些则直接通过三元损失函数学习嵌入。在本文中，我们考虑了从最初通过分类学习的特征空间中进行一次预测的情况，即我们考虑了我们有一个预训练模型，但无法访问训练数据，并且需要对每个身份只有一个训练图像的新面孔进行预测的情况。我们利用单镜头相似度度量来计算相似度分数，并将其与Youtube视频面部数据集(YTF)上的最新结果进行比较。我们展示了时间上下文对帧智能人脸识别的影响，并使用过去帧的概率多数投票方案来确定当前帧身份。此外，我们在Youtube人脸数据集中发现了许多标记错误，这些错误没有在勘误表中发布，并且为了社区的利益，我们在网上发布了相同的标签错误。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Pre-trained classifiers with One Shot Similarity for context aware face verification and identification

Most affect based systems analyse facial expressions for emotion detection, and utilize face detection and recognition methods in order to do effective affect analysis. Recent work has demonstrated the efficacy of deep architectures for face recognition by training as classifiers on voluminous datasets. Some architectures are trained as classifiers, and some directly learn an embedding via a triplet loss function. In this paper, we consider the case of one shot prediction from the feature space learnt initially via classification, i.e. we consider the situation where we have a pre-trained model, but do not have access to the training data and are required to make predictions on novel faces with just one training image per identity. We utilize the one shot similarity metric in order to compute similarity scores and compare it with the state-of-the-art results on the Youtube videos face dataset (YTF). We demonstrate the effect of temporal context on frame wise face recognition, and use a probabilistic majority voting scheme over past frames to determine current frame identity. Additionally, we found a number of labelling errors in the Youtube face dataset that were not published in the errata, and have published the same online for the benefit of the community.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

自引率

0.00%

发文量