Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park
{"title":"基于傅立叶变换损失的并行全局和局部注意力视觉生成对抗网络生成假虹膜图像","authors":"Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park","doi":"10.1016/j.engappai.2025.111080","DOIUrl":null,"url":null,"abstract":"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"155 ","pages":"Article 111080"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image\",\"authors\":\"Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park\",\"doi\":\"10.1016/j.engappai.2025.111080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"155 \",\"pages\":\"Article 111080\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010814\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010814","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image
The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.
When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.