General Requirements on Synthetic Fingerprint Images for Biometric Authentication and Forensic Investigations

Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security Pub Date : 2021-06-17 DOI:10.1145/3437880.3460410

A. Makrushin, Christof Kauba, Simon Kirchgasser, Stefan Seidlitz, Christian Kraetzer, A. Uhl, J. Dittmann

{"title":"General Requirements on Synthetic Fingerprint Images for Biometric Authentication and Forensic Investigations","authors":"A. Makrushin, Christof Kauba, Simon Kirchgasser, Stefan Seidlitz, Christian Kraetzer, A. Uhl, J. Dittmann","doi":"10.1145/3437880.3460410","DOIUrl":null,"url":null,"abstract":"Generation of synthetic biometric samples such as, for instance, fingerprint images gains more and more importance especially in view of recent cross-border regulations on security of private data. The reason is that biometric data is designated in recent regulations such as the EU GDPR as a special category of private data, making sharing datasets of biometric samples hardly possible even for research purposes. The usage of fingerprint images in forensic research faces the same challenge. The replacement of real datasets by synthetic datasets is the most advantageous straightforward solution which bears, however, the risk of generating \"unrealistic\" samples or \"unrealistic distributions\" of samples which may visually appear realistic. Despite numerous efforts to generate high-quality fingerprints, there is still no common agreement on how to define \"high-quality'' and how to validate that generated samples are realistic enough. Here, we propose general requirements on synthetic biometric samples (that are also applicable for fingerprint images used in forensic application scenarios) together with formal metrics to validate whether the requirements are fulfilled. Validation of our proposed requirements enables establishing the quality of a generative model (informed evaluation) or even the quality of a dataset of generated samples (blind evaluation). Moreover, we demonstrate in an example how our proposed evaluation concept can be applied to a comparison of real and synthetic datasets aiming at revealing if the synthetic samples exhibit significantly different properties as compared to real ones.","PeriodicalId":120300,"journal":{"name":"Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3437880.3460410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Generation of synthetic biometric samples such as, for instance, fingerprint images gains more and more importance especially in view of recent cross-border regulations on security of private data. The reason is that biometric data is designated in recent regulations such as the EU GDPR as a special category of private data, making sharing datasets of biometric samples hardly possible even for research purposes. The usage of fingerprint images in forensic research faces the same challenge. The replacement of real datasets by synthetic datasets is the most advantageous straightforward solution which bears, however, the risk of generating "unrealistic" samples or "unrealistic distributions" of samples which may visually appear realistic. Despite numerous efforts to generate high-quality fingerprints, there is still no common agreement on how to define "high-quality'' and how to validate that generated samples are realistic enough. Here, we propose general requirements on synthetic biometric samples (that are also applicable for fingerprint images used in forensic application scenarios) together with formal metrics to validate whether the requirements are fulfilled. Validation of our proposed requirements enables establishing the quality of a generative model (informed evaluation) or even the quality of a dataset of generated samples (blind evaluation). Moreover, we demonstrate in an example how our proposed evaluation concept can be applied to a comparison of real and synthetic datasets aiming at revealing if the synthetic samples exhibit significantly different properties as compared to real ones.

查看原文本刊更多论文

生物识别鉴定和法医调查用合成指纹图像的一般要求

合成生物特征样本的生成，例如指纹图像，变得越来越重要，特别是考虑到最近跨境对私人数据安全的规定。原因是，在欧盟GDPR等最近的法规中，生物特征数据被指定为特殊的私人数据，即使是为了研究目的，也很难共享生物特征样本的数据集。指纹图像在法医研究中的应用也面临着同样的挑战。用合成数据集替代真实数据集是最有利的直接解决方案，然而，它承担了生成“不现实”样本或样本“不现实分布”的风险，这些样本可能在视觉上看起来是真实的。尽管为生成高质量的指纹付出了许多努力，但对于如何定义“高质量”以及如何验证生成的样本足够真实，仍然没有达成共识。在这里，我们提出了合成生物特征样本(也适用于法医应用场景中使用的指纹图像)的一般要求，以及验证是否满足要求的形式化度量。对我们提出的要求进行验证，可以建立生成模型的质量(知情评估)，甚至可以建立生成样本数据集的质量(盲评估)。此外，我们在一个示例中演示了如何将我们提出的评估概念应用于真实和合成数据集的比较，旨在揭示合成样本与真实样本相比是否表现出显着不同的特性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security

自引率

0.00%

发文量