提高人脸识别性能:深度学习模型的综合评估以及带有超参数调整功能的新型集合方法

IF 3.1 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jana Selvaganesan, B. Sudharani, S. N. Chandra Shekhar, K. Vaishnavi, K. Priyadarsini, K. Srujan Raju, T. Srinivasa Rao
{"title":"提高人脸识别性能:深度学习模型的综合评估以及带有超参数调整功能的新型集合方法","authors":"Jana Selvaganesan, B. Sudharani, S. N. Chandra Shekhar, K. Vaishnavi, K. Priyadarsini, K. Srujan Raju, T. Srinivasa Rao","doi":"10.1007/s00500-024-09954-y","DOIUrl":null,"url":null,"abstract":"<p>In response to growing security concerns and the increasing demand for face recognition (FR) technology in various sectors, this research explores the application of deep learning techniques, specifically pre-trained Convolutional Neural Network (CNN) models, in the field of FR. The study harnesses the power of five pre-trained CNN models—DenseNet201, ResNet152V2, MobileNetV2, SeResNeXt, and Xception—for robust feature extraction, followed by SoftMax classification. A novel weighted average ensemble model, meticulously optimized through a grid search technique, is introduced to augment feature extraction and classification efficacy. Emphasizing the significance of robust data pre-processing, encompassing resizing, data augmentation, splitting, and normalization, the research endeavors to fortify the reliability of FR systems. Methodologically, the study systematically investigates hyperparameters across deep learning models, fine-tuning network depth, learning rate, activation functions, and optimization methods. Comprehensive evaluations unfold across diverse datasets to discern the effectiveness of the proposed models. Key contributions of this work encompass the utilization of pre-trained CNN models for feature extraction, extensive evaluation across multiple datasets, the introduction of a weighted average ensemble model, emphasis on robust data pre-processing, systematic hyperparameter tuning, and the utilization of comprehensive evaluation metrics. The results, meticulously analyzed, unveil the superior performance of the proposed method, consistently outshining alternative models across pivotal metrics, including Recall, Precision, F1 Score, Matthews Correlation Coefficient (MCC), and Accuracy. Notably, the proposed method attains an exceptional accuracy of 99.48% on the labeled faces in the wild (LFW) dataset, surpassing erstwhile state-of-the-art benchmarks. This research represents a significant stride in FR technology, furnishing a dependable and accurate solution fortified by empirical substantiation. The proposed method showcases the potential of pre-trained CNN models, ensemble learning, robust data pre-processing, and hyperparameter tuning in augmenting the accuracy and reliability of FR systems, with far-reaching implications for real-world applications.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"103 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing face recognition performance: a comprehensive evaluation of deep learning models and a novel ensemble approach with hyperparameter tuning\",\"authors\":\"Jana Selvaganesan, B. Sudharani, S. N. Chandra Shekhar, K. Vaishnavi, K. Priyadarsini, K. Srujan Raju, T. Srinivasa Rao\",\"doi\":\"10.1007/s00500-024-09954-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In response to growing security concerns and the increasing demand for face recognition (FR) technology in various sectors, this research explores the application of deep learning techniques, specifically pre-trained Convolutional Neural Network (CNN) models, in the field of FR. The study harnesses the power of five pre-trained CNN models—DenseNet201, ResNet152V2, MobileNetV2, SeResNeXt, and Xception—for robust feature extraction, followed by SoftMax classification. A novel weighted average ensemble model, meticulously optimized through a grid search technique, is introduced to augment feature extraction and classification efficacy. Emphasizing the significance of robust data pre-processing, encompassing resizing, data augmentation, splitting, and normalization, the research endeavors to fortify the reliability of FR systems. Methodologically, the study systematically investigates hyperparameters across deep learning models, fine-tuning network depth, learning rate, activation functions, and optimization methods. Comprehensive evaluations unfold across diverse datasets to discern the effectiveness of the proposed models. Key contributions of this work encompass the utilization of pre-trained CNN models for feature extraction, extensive evaluation across multiple datasets, the introduction of a weighted average ensemble model, emphasis on robust data pre-processing, systematic hyperparameter tuning, and the utilization of comprehensive evaluation metrics. The results, meticulously analyzed, unveil the superior performance of the proposed method, consistently outshining alternative models across pivotal metrics, including Recall, Precision, F1 Score, Matthews Correlation Coefficient (MCC), and Accuracy. Notably, the proposed method attains an exceptional accuracy of 99.48% on the labeled faces in the wild (LFW) dataset, surpassing erstwhile state-of-the-art benchmarks. This research represents a significant stride in FR technology, furnishing a dependable and accurate solution fortified by empirical substantiation. The proposed method showcases the potential of pre-trained CNN models, ensemble learning, robust data pre-processing, and hyperparameter tuning in augmenting the accuracy and reliability of FR systems, with far-reaching implications for real-world applications.</p>\",\"PeriodicalId\":22039,\"journal\":{\"name\":\"Soft Computing\",\"volume\":\"103 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s00500-024-09954-y\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09954-y","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

为了应对日益增长的安全问题以及各行各业对人脸识别技术日益增长的需求,本研究探索了深度学习技术,特别是预训练卷积神经网络(CNN)模型在人脸识别领域的应用。该研究利用了五个预训练 CNN 模型--DenseNet201、ResNet152V2、MobileNetV2、SeResNeXt 和 Xception--的强大功能来进行鲁棒特征提取,然后进行 SoftMax 分类。为了提高特征提取和分类的效率,还引入了一种通过网格搜索技术精心优化的新型加权平均集合模型。研究强调了稳健的数据预处理(包括调整大小、数据增强、分割和归一化)的重要性,致力于加强 FR 系统的可靠性。在方法上,该研究系统地研究了深度学习模型的超参数,微调了网络深度、学习率、激活函数和优化方法。在不同的数据集上展开综合评估,以辨别所建议模型的有效性。这项工作的主要贡献包括:利用预训练的 CNN 模型提取特征、在多个数据集上进行广泛评估、引入加权平均集合模型、强调稳健的数据预处理、系统的超参数调整以及利用综合评估指标。经过细致分析,结果揭示了所提方法的卓越性能,在召回率、精确度、F1 分数、马修斯相关系数 (MCC) 和准确率等关键指标上始终优于其他模型。值得注意的是,所提出的方法在标注了野生人脸(LFW)的数据集上达到了 99.48% 的超高准确率,超过了以往最先进的基准。这项研究标志着 FR 技术取得了重大进展,提供了一个可靠、准确的解决方案,并得到了经验证明。所提出的方法展示了预训练 CNN 模型、集合学习、稳健的数据预处理和超参数调整在提高 FR 系统的准确性和可靠性方面的潜力,对现实世界的应用具有深远影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Enhancing face recognition performance: a comprehensive evaluation of deep learning models and a novel ensemble approach with hyperparameter tuning

Enhancing face recognition performance: a comprehensive evaluation of deep learning models and a novel ensemble approach with hyperparameter tuning

In response to growing security concerns and the increasing demand for face recognition (FR) technology in various sectors, this research explores the application of deep learning techniques, specifically pre-trained Convolutional Neural Network (CNN) models, in the field of FR. The study harnesses the power of five pre-trained CNN models—DenseNet201, ResNet152V2, MobileNetV2, SeResNeXt, and Xception—for robust feature extraction, followed by SoftMax classification. A novel weighted average ensemble model, meticulously optimized through a grid search technique, is introduced to augment feature extraction and classification efficacy. Emphasizing the significance of robust data pre-processing, encompassing resizing, data augmentation, splitting, and normalization, the research endeavors to fortify the reliability of FR systems. Methodologically, the study systematically investigates hyperparameters across deep learning models, fine-tuning network depth, learning rate, activation functions, and optimization methods. Comprehensive evaluations unfold across diverse datasets to discern the effectiveness of the proposed models. Key contributions of this work encompass the utilization of pre-trained CNN models for feature extraction, extensive evaluation across multiple datasets, the introduction of a weighted average ensemble model, emphasis on robust data pre-processing, systematic hyperparameter tuning, and the utilization of comprehensive evaluation metrics. The results, meticulously analyzed, unveil the superior performance of the proposed method, consistently outshining alternative models across pivotal metrics, including Recall, Precision, F1 Score, Matthews Correlation Coefficient (MCC), and Accuracy. Notably, the proposed method attains an exceptional accuracy of 99.48% on the labeled faces in the wild (LFW) dataset, surpassing erstwhile state-of-the-art benchmarks. This research represents a significant stride in FR technology, furnishing a dependable and accurate solution fortified by empirical substantiation. The proposed method showcases the potential of pre-trained CNN models, ensemble learning, robust data pre-processing, and hyperparameter tuning in augmenting the accuracy and reliability of FR systems, with far-reaching implications for real-world applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Soft Computing
Soft Computing 工程技术-计算机:跨学科应用
CiteScore
8.10
自引率
9.80%
发文量
927
审稿时长
7.3 months
期刊介绍: Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems. Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信