Susceptibility to Image Resolution in Face Recognition and Training Strategies to Enhance Robustness

Martin Knoche, S. Hörmann, G. Rigoll
{"title":"Susceptibility to Image Resolution in Face Recognition and Training Strategies to Enhance Robustness","authors":"Martin Knoche, S. Hörmann, G. Rigoll","doi":"10.4230/LITES.8.1.1","DOIUrl":null,"url":null,"abstract":"Face recognition approaches often rely on equal image resolution for verifying faces on two images. However, in practical applications, those image resolutions are usually not in the same range due to different image capture mechanisms or sources. In this work, we first analyze the impact of image resolutions on face verification performance with a state-of-the-art face recognition model. For images synthetically reduced to $5\\,\\times\\,5$ px resolution, the verification performance drops from $99.23\\%$ increasingly down to almost $55\\%$. Especially for cross-resolution image pairs (one high- and one low-resolution image), the verification accuracy decreases even further. We investigate this behavior more in-depth by looking at the feature distances for every 2-image test pair. To tackle this problem, we propose the following two methods: 1) Train a state-of-the-art face-recognition model straightforwardly with $50\\%$ low-resolution images directly within each batch. 2) Train a siamese-network structure and add a cosine distance feature loss between high- and low-resolution features. Both methods show an improvement for cross-resolution scenarios and can increase the accuracy at very low resolution to approximately $70\\%$. However, a disadvantage is that a specific model needs to be trained for every resolution pair. Thus, we extend the aforementioned methods by training them with multiple image resolutions at once. The performances for particular testing image resolutions are slightly worse, but the advantage is that this model can be applied to arbitrary resolution images and achieves an overall better performance ($97.72\\%$ compared to $96.86\\%$). Due to the lack of a benchmark for arbitrary resolution images for the cross-resolution and equal-resolution task, we propose an evaluation protocol for five well-known datasets, focusing on high, mid, and low-resolution images.","PeriodicalId":376325,"journal":{"name":"Leibniz Trans. Embed. Syst.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Leibniz Trans. Embed. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LITES.8.1.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Face recognition approaches often rely on equal image resolution for verifying faces on two images. However, in practical applications, those image resolutions are usually not in the same range due to different image capture mechanisms or sources. In this work, we first analyze the impact of image resolutions on face verification performance with a state-of-the-art face recognition model. For images synthetically reduced to $5\,\times\,5$ px resolution, the verification performance drops from $99.23\%$ increasingly down to almost $55\%$. Especially for cross-resolution image pairs (one high- and one low-resolution image), the verification accuracy decreases even further. We investigate this behavior more in-depth by looking at the feature distances for every 2-image test pair. To tackle this problem, we propose the following two methods: 1) Train a state-of-the-art face-recognition model straightforwardly with $50\%$ low-resolution images directly within each batch. 2) Train a siamese-network structure and add a cosine distance feature loss between high- and low-resolution features. Both methods show an improvement for cross-resolution scenarios and can increase the accuracy at very low resolution to approximately $70\%$. However, a disadvantage is that a specific model needs to be trained for every resolution pair. Thus, we extend the aforementioned methods by training them with multiple image resolutions at once. The performances for particular testing image resolutions are slightly worse, but the advantage is that this model can be applied to arbitrary resolution images and achieves an overall better performance ($97.72\%$ compared to $96.86\%$). Due to the lack of a benchmark for arbitrary resolution images for the cross-resolution and equal-resolution task, we propose an evaluation protocol for five well-known datasets, focusing on high, mid, and low-resolution images.
人脸识别对图像分辨率的敏感性及增强鲁棒性的训练策略
人脸识别方法通常依赖于相等的图像分辨率来验证两张图像上的人脸。然而,在实际应用中,由于不同的图像捕获机制或来源,这些图像分辨率通常不在同一范围内。在这项工作中,我们首先用最先进的人脸识别模型分析了图像分辨率对人脸验证性能的影响。对于综合降低到$5\,\times\,5$ px分辨率的图像,验证性能从$ 99.23% $逐渐下降到$55\%$。特别是对于交叉分辨率图像对(一个高分辨率和一个低分辨率图像),验证精度进一步降低。我们通过查看每个2-图像测试对的特征距离来更深入地研究这种行为。为了解决这个问题,我们提出以下两种方法:1)直接在每个批次中使用$ 50%的低分辨率图像直接训练最先进的人脸识别模型。2)训练连体网络结构,在高分辨率和低分辨率特征之间添加余弦距离特征损失。两种方法都显示出跨分辨率场景的改进,并且可以在非常低的分辨率下将精度提高到大约70%。然而,缺点是需要为每个分辨率对训练特定的模型。因此,我们通过同时训练多个图像分辨率来扩展上述方法。特定测试图像分辨率的性能稍差,但优点是该模型可以应用于任意分辨率的图像,并且实现了更好的整体性能(97.72\%$比96.86\%$)。由于缺乏针对任意分辨率图像的跨分辨率和等分辨率任务的基准,我们提出了针对五个已知数据集的评估方案,重点关注高、中、低分辨率图像。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信