Deep Guard：一种增强的混合集成分类器，用于人脸呈现攻击检测，将Gabor和二值化统计图像特征描述符与深度学习相结合

IF 4.9 3区计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Computers & Electrical Engineering Pub Date : 2025-07-31 DOI:10.1016/j.compeleceng.2025.110566

Aparna Santra Biswas , Somnath Dey , Sanskar Verma , Khushi Verma

{"title":"Deep Guard：一种增强的混合集成分类器，用于人脸呈现攻击检测，将Gabor和二值化统计图像特征描述符与深度学习相结合","authors":"Aparna Santra Biswas , Somnath Dey , Sanskar Verma , Khushi Verma","doi":"10.1016/j.compeleceng.2025.110566","DOIUrl":null,"url":null,"abstract":"<div><div>Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I <span><math><mo>→</mo></math></span> M).</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"127 ","pages":"Article 110566"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Guard: An enhanced hybrid ensemble classifier for face presentation attack detection integrating Gabor and binarized statistical image features descriptors with deep learning\",\"authors\":\"Aparna Santra Biswas , Somnath Dey , Sanskar Verma , Khushi Verma\",\"doi\":\"10.1016/j.compeleceng.2025.110566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I <span><math><mo>→</mo></math></span> M).</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"127 \",\"pages\":\"Article 110566\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625005099\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625005099","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

摘要

人脸识别系统因其可靠性和便捷性被广泛应用于各种现实应用中。然而，攻击者通过模仿真正的用户特征来利用这些系统来获得未经授权的访问。这强调需要将有效的对策集成到基于人脸的身份验证系统中。人脸呈现攻击检测方法遇到了一些挑战，如光照变化和噪声输入图像，这些限制了攻击检测方法的性能，特别是在看不见的数据上。在本文中，我们介绍了Deep Guard，这是一个混合框架，将手工制作的纹理描述符与先进的深度学习技术相结合。该框架利用不同分类器的集合来利用它们的互补优势。第一个分类器使用二值化统计图像特征（BSIF）和多层感知器（MLP）来捕获细粒度纹理细节。第二个分类器将EfficientNet-B0与ConvMixer层和CBAM注意机制结合起来，以增强特征表示并提高感知能力。第三个分类器使用Gabor滤波器作为卷积层，并在第二个分类器中使用深度网络来细化边缘并增加对光照和噪声的鲁棒性。这些分类器的输出使用软投票机制进行融合，以将面部图像分类为真实或虚假。我们在六个公开可用的数据集CASIA-FASD、Replay-Attack、3DMAD、ROSE-Youtu、OULU-NPU和MSU-MFSD上评估了所提出的框架。结果表明，Deep Guard在数据集内测试中优于大多数最先进的方法，并在跨数据集单源训练和测试场景中实现了强大的泛化性能，混合了所有三个分类器的HybridNet I的平均HTER为25.78%，混合了分类器2和3的HybridNet II的平均HTER为27.96%。对于多源训练和单源测试（O&C&I→M）的跨数据集评估，它的AUC也达到了98.65%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Guard: An enhanced hybrid ensemble classifier for face presentation attack detection integrating Gabor and binarized statistical image features descriptors with deep learning

Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I

\to

M).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers & Electrical Engineering 工程技术-工程：电子与电气

CiteScore

9.20

自引率

7.00%

发文量

661

审稿时长

47 days

期刊介绍： The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.