{"title":"基于特征注意学习的人脸反欺骗动态残差蒸馏网络","authors":"Yan He;Fei Peng;Min Long","doi":"10.1109/TBIOM.2023.3312128","DOIUrl":null,"url":null,"abstract":"Currently, most face anti-spoofing methods target the generalization problem by relying on auxiliary information such as additional annotations and modalities. However, this auxiliary information is unavailable in practical scenarios, which potentially hinders the application of these methods. Meanwhile, the predetermined or fixed characteristics limit their generalization capability. To countermeasure these problems, a dynamic residual distillation network with feature attention learning (DRDN) is developed to adaptively search discriminative representation and embedding space without accessing any auxiliary information. Specifically, a pixel-level residual distillation module is first designed to obtain domain-irrelevant liveness representation by suppressing both the high-level semantic and low-frequency illumination factors, thus the domain divergence between the source and target domains can be adaptively mitigated. Secondly, a feature-level attention contrastive learning is proposed to construct a distance-aware asymmetrical embedding space to avoid the class boundary over-fitting. Finally, an attention enhancement backbone incorporated with attention blocks is designed for automatically capturing important regions and channels in feature extraction. Experimental results and analysis demonstrate that the proposed method outperforms the state-of-the-art anti-spoofing methods in both single-source and multi-source domain generalization scenarios.","PeriodicalId":73307,"journal":{"name":"IEEE transactions on biometrics, behavior, and identity science","volume":"5 4","pages":"579-592"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic Residual Distillation Network for Face Anti-Spoofing With Feature Attention Learning\",\"authors\":\"Yan He;Fei Peng;Min Long\",\"doi\":\"10.1109/TBIOM.2023.3312128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Currently, most face anti-spoofing methods target the generalization problem by relying on auxiliary information such as additional annotations and modalities. However, this auxiliary information is unavailable in practical scenarios, which potentially hinders the application of these methods. Meanwhile, the predetermined or fixed characteristics limit their generalization capability. To countermeasure these problems, a dynamic residual distillation network with feature attention learning (DRDN) is developed to adaptively search discriminative representation and embedding space without accessing any auxiliary information. Specifically, a pixel-level residual distillation module is first designed to obtain domain-irrelevant liveness representation by suppressing both the high-level semantic and low-frequency illumination factors, thus the domain divergence between the source and target domains can be adaptively mitigated. Secondly, a feature-level attention contrastive learning is proposed to construct a distance-aware asymmetrical embedding space to avoid the class boundary over-fitting. Finally, an attention enhancement backbone incorporated with attention blocks is designed for automatically capturing important regions and channels in feature extraction. Experimental results and analysis demonstrate that the proposed method outperforms the state-of-the-art anti-spoofing methods in both single-source and multi-source domain generalization scenarios.\",\"PeriodicalId\":73307,\"journal\":{\"name\":\"IEEE transactions on biometrics, behavior, and identity science\",\"volume\":\"5 4\",\"pages\":\"579-592\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on biometrics, behavior, and identity science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10258471/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biometrics, behavior, and identity science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10258471/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic Residual Distillation Network for Face Anti-Spoofing With Feature Attention Learning
Currently, most face anti-spoofing methods target the generalization problem by relying on auxiliary information such as additional annotations and modalities. However, this auxiliary information is unavailable in practical scenarios, which potentially hinders the application of these methods. Meanwhile, the predetermined or fixed characteristics limit their generalization capability. To countermeasure these problems, a dynamic residual distillation network with feature attention learning (DRDN) is developed to adaptively search discriminative representation and embedding space without accessing any auxiliary information. Specifically, a pixel-level residual distillation module is first designed to obtain domain-irrelevant liveness representation by suppressing both the high-level semantic and low-frequency illumination factors, thus the domain divergence between the source and target domains can be adaptively mitigated. Secondly, a feature-level attention contrastive learning is proposed to construct a distance-aware asymmetrical embedding space to avoid the class boundary over-fitting. Finally, an attention enhancement backbone incorporated with attention blocks is designed for automatically capturing important regions and channels in feature extraction. Experimental results and analysis demonstrate that the proposed method outperforms the state-of-the-art anti-spoofing methods in both single-source and multi-source domain generalization scenarios.