基于傅立叶变换损失的并行全局和局部注意力视觉生成对抗网络生成假虹膜图像

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-05-12 DOI:10.1016/j.engappai.2025.111080

Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park

{"title":"基于傅立叶变换损失的并行全局和局部注意力视觉生成对抗网络生成假虹膜图像","authors":"Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park","doi":"10.1016/j.engappai.2025.111080","DOIUrl":null,"url":null,"abstract":"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"155 ","pages":"Article 111080"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image\",\"authors\":\"Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park\",\"doi\":\"10.1016/j.engappai.2025.111080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"155 \",\"pages\":\"Article 111080\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010814\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010814","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

生成对抗网络（GAN）的技术进步促进了高度逼真的伪造图像的创建。这反过来又为展示攻击铺平了道路，这种攻击被称为欺骗攻击，目标是一系列生物识别系统，包括面部、指纹、静脉和虹膜。为了降低这些风险，人们一直致力于开发呈现攻击检测（PAD）方法，专门用于识别由虹膜识别领域的GAN技术生成的假虹膜图像。然而，现有研究中使用的大多数方法都没有考虑频域的差异，尽管频域的差异使假虹膜图像更容易与真（实）虹膜图像区分。为了解决这些问题并创建更鲁棒的PAD模型，我们提出了一种基于并行全局和局部注意力视觉转换器的生成对抗网络（PGLAV-GAN），该网络考虑了全局和局部上下文，并可以通过傅里叶变换（FT）损失生成具有与真实（真实）图像相似频域特征的假虹膜图像。PGLAV-GAN中的生成器旨在通过将嵌套u型网络（UNet++）和深度监督作为骨干网架构，最大限度地减少特征信息的丢失，同时允许不同分辨率的特征信息得到有效利用。此外，通过应用各种额外的后处理技术来评估PAD的性能，确认所提出的方法优于最先进的（SOTA）呈现攻击（PA）方法。当使用两个开放数据库对所提出的模型进行实验验证时，对于活体检测-虹膜竞赛2017 (liveet -iris -2017)-Notre Dame数据集，PAD模型的平均分类错误率为1.51%，对于liveet -iris -2017- warsaw数据集，PAD模型的平均分类错误率为1.9588%。这表明该模型优于SOTA模型，生成的假虹膜图像对于PA来说更真实、更复杂，可以使用该方法生成的假虹膜图像构建更鲁棒的PAD模型，从而提高虹膜识别系统的安全级别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image

The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.

When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.