基于傅立叶变换损失的并行全局和局部注意力视觉生成对抗网络生成假虹膜图像

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Jung Soo Kim, Jin Seong Hong, Seung Gu Kim, Kang Ryoung Park
{"title":"基于傅立叶变换损失的并行全局和局部注意力视觉生成对抗网络生成假虹膜图像","authors":"Jung Soo Kim,&nbsp;Jin Seong Hong,&nbsp;Seung Gu Kim,&nbsp;Kang Ryoung Park","doi":"10.1016/j.engappai.2025.111080","DOIUrl":null,"url":null,"abstract":"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"155 ","pages":"Article 111080"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image\",\"authors\":\"Jung Soo Kim,&nbsp;Jin Seong Hong,&nbsp;Seung Gu Kim,&nbsp;Kang Ryoung Park\",\"doi\":\"10.1016/j.engappai.2025.111080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.</div><div>When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"155 \",\"pages\":\"Article 111080\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010814\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010814","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

生成对抗网络(GAN)的技术进步促进了高度逼真的伪造图像的创建。这反过来又为展示攻击铺平了道路,这种攻击被称为欺骗攻击,目标是一系列生物识别系统,包括面部、指纹、静脉和虹膜。为了降低这些风险,人们一直致力于开发呈现攻击检测(PAD)方法,专门用于识别由虹膜识别领域的GAN技术生成的假虹膜图像。然而,现有研究中使用的大多数方法都没有考虑频域的差异,尽管频域的差异使假虹膜图像更容易与真(实)虹膜图像区分。为了解决这些问题并创建更鲁棒的PAD模型,我们提出了一种基于并行全局和局部注意力视觉转换器的生成对抗网络(PGLAV-GAN),该网络考虑了全局和局部上下文,并可以通过傅里叶变换(FT)损失生成具有与真实(真实)图像相似频域特征的假虹膜图像。PGLAV-GAN中的生成器旨在通过将嵌套u型网络(UNet++)和深度监督作为骨干网架构,最大限度地减少特征信息的丢失,同时允许不同分辨率的特征信息得到有效利用。此外,通过应用各种额外的后处理技术来评估PAD的性能,确认所提出的方法优于最先进的(SOTA)呈现攻击(PA)方法。当使用两个开放数据库对所提出的模型进行实验验证时,对于活体检测-虹膜竞赛2017 (liveet -iris -2017)-Notre Dame数据集,PAD模型的平均分类错误率为1.51%,对于liveet -iris -2017- warsaw数据集,PAD模型的平均分类错误率为1.9588%。这表明该模型优于SOTA模型,生成的假虹膜图像对于PA来说更真实、更复杂,可以使用该方法生成的假虹膜图像构建更鲁棒的PAD模型,从而提高虹膜识别系统的安全级别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Parallel-wise global and local attention vision transformer-based generative adversarial network using fourier transform loss for generating fake iris image
The technological progress in generative adversarial network (GAN) has facilitated the creation of highly realistic counterfeit images. This, in turn, has paved the way for presentation attacks, named as spoof attacks, targeting a range of biometric recognition systems, including those for faces, fingerprints, veins, and irises. To mitigate these risks, significant efforts have been directed toward developing presentation attack detection (PAD) methods specifically tailored to identify fake iris images generated by the technology of GAN in the field of iris recognition. However, most of them used in the existing research do not consider the difference in the frequency domain although the difference in frequency domain makes the fake iris image easier to distinguish from the true (real) one. To tackle these problems and create a more robust PAD model, we propose a parallel-wise global and local attention vision transformer-based generative adversarial network (PGLAV-GAN) that considers both global and local context, and can generate fake iris images with similar frequency domain features to true (real) ones through Fourier transform (FT) loss. The generator in PGLAV-GAN is designed to minimize the loss of feature information while allowing feature information at different resolutions to be effectively utilized, through the incorporation of nested U-shaped network (UNet++) and deep supervision as the backbone network architecture. Moreover, the performance of PAD was assessed by applying various additional post-processing techniques, confirming that the proposed method surpassed the state-of-the-art (SOTA) presentation attack (PA) methods.
When the proposed model was validated through an experiment using two open databases, the average classification error rate of the PAD model was 1.51 % for the liveness detection-iris competition 2017 (LiveDet-Iris-2017)-Notre Dame dataset and 1.9588 % for the LiveDet-Iris-2017-Warsaw dataset. This indicates that the proposed model outperformed SOTA models, and produced fake iris images that are more authentic and sophisticated for PA, enabling more robust PAD model to be constructed with the fake iris images generated by our method and enhancing the consequent security level of iris recognition system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信