{"title":"Deep Guard:一种增强的混合集成分类器,用于人脸呈现攻击检测,将Gabor和二值化统计图像特征描述符与深度学习相结合","authors":"Aparna Santra Biswas , Somnath Dey , Sanskar Verma , Khushi Verma","doi":"10.1016/j.compeleceng.2025.110566","DOIUrl":null,"url":null,"abstract":"<div><div>Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I <span><math><mo>→</mo></math></span> M).</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"127 ","pages":"Article 110566"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Guard: An enhanced hybrid ensemble classifier for face presentation attack detection integrating Gabor and binarized statistical image features descriptors with deep learning\",\"authors\":\"Aparna Santra Biswas , Somnath Dey , Sanskar Verma , Khushi Verma\",\"doi\":\"10.1016/j.compeleceng.2025.110566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I <span><math><mo>→</mo></math></span> M).</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"127 \",\"pages\":\"Article 110566\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625005099\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625005099","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Deep Guard: An enhanced hybrid ensemble classifier for face presentation attack detection integrating Gabor and binarized statistical image features descriptors with deep learning
Facial recognition systems are widely used in various real-world applications due to their reliability and convenience. However, attackers exploit these systems by mimicking bona fide user traits to gain unauthorized access. This emphasizes the need for effective countermeasures to be integrated into face-based authentication systems. Face presentation attack detection methods encounter several challenges such as illumination variations and noisy input images which limit the performance of the attack detection methods, particularly on unseen data. In this paper, we introduce Deep Guard, a hybrid framework that combines handcrafted texture descriptors with advanced deep learning techniques. The framework utilizes an ensemble of different classifiers to leverage their complementary strengths. The first classifier applies Binarized Statistical Image Features (BSIF) and a Multilayer Perceptron (MLP) to capture fine-grained texture details. The second classifier combines EfficientNet-B0 with ConvMixer layers and a CBAM attention mechanism to enhance feature representation and improve perceptual capabilities. The third classifier uses Gabor filters as convolutional layers with a deep network which is used in second classifier to refine edges and increase robustness to illumination and noise. The outputs from these classifiers are fused using a soft voting mechanism to classify facial images as real or fake. We evaluate the proposed framework on six publicly available datasets CASIA-FASD, Replay-Attack, 3DMAD, ROSE-Youtu, OULU-NPU, and MSU-MFSD. The results demonstrate that Deep Guard outperforms most state-of-the-art methods in intra-dataset testing and achieves strong generalization performance in cross-dataset single source training and testing scenarios, with an average HTER of 25.78% for HybridNet I, which combines all three classifiers and 27.96% for HybridNet II, combining classifiers two and three. It also achieves an AUC of 98.65% for cross-dataset evaluation with multiple-source training and single-source testing (O&C&I M).
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.