State-of-the-art face recognition performance using publicly available software and datasets

2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) Pub Date : 2018-03-21 DOI:10.1109/ATSIP.2018.8364450

Mohamed Amine Hmani, D. Petrovska-Delacrétaz

{"title":"State-of-the-art face recognition performance using publicly available software and datasets","authors":"Mohamed Amine Hmani, D. Petrovska-Delacrétaz","doi":"10.1109/ATSIP.2018.8364450","DOIUrl":null,"url":null,"abstract":"We are interested in the reproducibility of face recognition systems. By reproducibility we mean: is the scientific community, and are the researchers from different sides, capable of reproducing the last published results by a big company, that has at its disposal huge computational power and huge proprietary databases? With the constant advancements in GPU computation power and availability of open-source software, the reproducibility of published results should not be a problem. But, if architectures of the systems are private and databases are proprietary, the reproducibility of published results can not be easily attained. To tackle this problem, we focus on training and evaluation of face recognition systems on publicly available data and software. We are also interested in comparing the best Deep Neural Net (DNN) based results with a baseline “classical” system. This paper exploits the OpenFace open-source system to generate a deep convolutional neural network model using publicly available datasets. We study the impact of the size of the datasets, their quality and compare the performance to a classical face recognition approach. Our focus is to have a fully reproducible model. To this end, we used publicly available datasets (FRGC, MS-celeb-lM, MOBIO, LFW), as well publicly available software (OpenFace) to train our model in order to do face recognition. Our best trained model achieves 97.52% accuracy on the Labelled in the Wild dataset (LFW) dataset which is lower than Google's best reported results of 99.96% but slightly better than FaceBook's reported result of 97.35%. We also evaluated our best model on the challenging video dataset MOBIO and report competitive results with the best reported results on this database.","PeriodicalId":332253,"journal":{"name":"2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP.2018.8364450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

We are interested in the reproducibility of face recognition systems. By reproducibility we mean: is the scientific community, and are the researchers from different sides, capable of reproducing the last published results by a big company, that has at its disposal huge computational power and huge proprietary databases? With the constant advancements in GPU computation power and availability of open-source software, the reproducibility of published results should not be a problem. But, if architectures of the systems are private and databases are proprietary, the reproducibility of published results can not be easily attained. To tackle this problem, we focus on training and evaluation of face recognition systems on publicly available data and software. We are also interested in comparing the best Deep Neural Net (DNN) based results with a baseline “classical” system. This paper exploits the OpenFace open-source system to generate a deep convolutional neural network model using publicly available datasets. We study the impact of the size of the datasets, their quality and compare the performance to a classical face recognition approach. Our focus is to have a fully reproducible model. To this end, we used publicly available datasets (FRGC, MS-celeb-lM, MOBIO, LFW), as well publicly available software (OpenFace) to train our model in order to do face recognition. Our best trained model achieves 97.52% accuracy on the Labelled in the Wild dataset (LFW) dataset which is lower than Google's best reported results of 99.96% but slightly better than FaceBook's reported result of 97.35%. We also evaluated our best model on the challenging video dataset MOBIO and report competitive results with the best reported results on this database.

查看原文本刊更多论文

最先进的面部识别性能使用公开可用的软件和数据集

我们对人脸识别系统的再现性很感兴趣。我们所说的可重复性是指:科学界，以及来自不同方面的研究人员，是否有能力重现一家拥有巨大计算能力和庞大专有数据库的大公司最近发表的结果?随着GPU计算能力的不断提高和开源软件的可用性，发布结果的可重复性应该不是问题。但是，如果系统的体系结构是私有的，并且数据库是专有的，那么发布结果的再现性就不容易获得。为了解决这个问题，我们专注于在公开可用的数据和软件上训练和评估人脸识别系统。我们也有兴趣将基于深度神经网络(DNN)的最佳结果与基线“经典”系统进行比较。本文利用OpenFace开源系统，利用公开可用的数据集生成深度卷积神经网络模型。我们研究了数据集的大小及其质量的影响，并将其性能与经典的人脸识别方法进行了比较。我们的重点是建立一个完全可复制的模型。为此，我们使用公开可用的数据集(FRGC, ms - celebrity - lm, MOBIO, LFW)以及公开可用的软件(OpenFace)来训练我们的模型，以便进行人脸识别。我们最好的训练模型在野生数据集(LFW)数据集上达到97.52%的准确率，低于谷歌报告的99.96%的最佳结果，但略好于FaceBook报告的97.35%的结果。我们还在具有挑战性的视频数据集MOBIO上评估了我们的最佳模型，并报告了与该数据库上的最佳报告结果相竞争的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

自引率

0.00%

发文量