CG-MCFNet：基于跨层制导的三维人脸识别多尺度相关融合网络

IF 3.5 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence Pub Date : 2025-01-03 DOI:10.1007/s10489-024-06221-3

Panzi Zhao, Yue Ming, Hui Yu, Yuting Hu, Jiangwan Zhou, Yuanan Liu

{"title":"CG-MCFNet：基于跨层制导的三维人脸识别多尺度相关融合网络","authors":"Panzi Zhao, Yue Ming, Hui Yu, Yuting Hu, Jiangwan Zhou, Yuanan Liu","doi":"10.1007/s10489-024-06221-3","DOIUrl":null,"url":null,"abstract":"<div><p>3D face recognition (FR) has been a popular field in recent years, which benefits from the advancement of 3D sensors and the application demands of video surveillance scenes. Existing 3D FR methods could show excellent performance when faces are complete. However, incomplete 3D faces, especially large poses and occluded, may prevent the model to learn effective and strong discriminative facial information adequately, resulting in unsatisfactory recognition results. To address this issue, we propose a cross-layer guidance-based multi-scale correlation fusion network (CG-MCFNet) for 3D FR. Firstly, we design a shallow feature enhancement extraction (SFE) module to obtain more effective facial detail information, and a deep feature enhancement extraction (DFE) module to learn more information with strong discrimination. Secondly, a novel multi-scale feature correlation fusion (MCF) module is proposed for fusing features from different layers, aiming to reduce the interference of redundant features and enhance the acquisition of discriminative features. Finally, the above three modules are integrated to form a new multi-scale local feature extraction (MLFE) module for capturing the face local information of rich and more strong discriminative. In addition, we introduce a global and local feature similarity weighted joint inference strategy, to further improve recognition accuracy. Extensive experiments on six challenging datasets, including three low-quality datasets (Lock3DFace, KinectFaceDB, and IIIT-D, where Lock3DFace is a video dataset), two high-quality datasets (UMB-DB, Bosphorus), and a cross-quality dataset synthesized by Bosphorus, prove that our CG-MCFNet achieves the best performance for incomplete 3D FR, which demonstrates the strong generalization ability of our model.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CG-MCFNet: cross-layer guidance-based multi-scale correlation fusion network for 3D face recognition\",\"authors\":\"Panzi Zhao, Yue Ming, Hui Yu, Yuting Hu, Jiangwan Zhou, Yuanan Liu\",\"doi\":\"10.1007/s10489-024-06221-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>3D face recognition (FR) has been a popular field in recent years, which benefits from the advancement of 3D sensors and the application demands of video surveillance scenes. Existing 3D FR methods could show excellent performance when faces are complete. However, incomplete 3D faces, especially large poses and occluded, may prevent the model to learn effective and strong discriminative facial information adequately, resulting in unsatisfactory recognition results. To address this issue, we propose a cross-layer guidance-based multi-scale correlation fusion network (CG-MCFNet) for 3D FR. Firstly, we design a shallow feature enhancement extraction (SFE) module to obtain more effective facial detail information, and a deep feature enhancement extraction (DFE) module to learn more information with strong discrimination. Secondly, a novel multi-scale feature correlation fusion (MCF) module is proposed for fusing features from different layers, aiming to reduce the interference of redundant features and enhance the acquisition of discriminative features. Finally, the above three modules are integrated to form a new multi-scale local feature extraction (MLFE) module for capturing the face local information of rich and more strong discriminative. In addition, we introduce a global and local feature similarity weighted joint inference strategy, to further improve recognition accuracy. Extensive experiments on six challenging datasets, including three low-quality datasets (Lock3DFace, KinectFaceDB, and IIIT-D, where Lock3DFace is a video dataset), two high-quality datasets (UMB-DB, Bosphorus), and a cross-quality dataset synthesized by Bosphorus, prove that our CG-MCFNet achieves the best performance for incomplete 3D FR, which demonstrates the strong generalization ability of our model.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 4\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-024-06221-3\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06221-3","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

得益于3D传感器的进步和视频监控场景的应用需求，3D人脸识别（FR）近年来成为一个热门领域。现有的3D人脸识别方法在人脸完整时表现优异。然而，不完整的三维人脸，特别是大的姿势和遮挡，可能会阻碍模型充分学习有效的、强判别性的人脸信息，导致识别结果不理想。为了解决这一问题，我们提出了一种基于跨层制导的3D人脸识别多尺度相关融合网络（CG-MCFNet）。首先，我们设计了一个浅层特征增强提取（SFE）模块，以获得更有效的人脸细节信息，然后设计了一个深层特征增强提取（DFE）模块，以获得更多的强分辨信息。其次，提出了一种新的多尺度特征相关融合（MCF）模块，用于融合来自不同层的特征，以减少冗余特征的干扰，增强判别特征的获取；最后，将以上三个模块集成，形成一个新的多尺度局部特征提取（MLFE）模块，用于捕获丰富且判别性更强的人脸局部信息。此外，我们引入了全局和局部特征相似度加权联合推理策略，进一步提高了识别精度。在3个低质量数据集（Lock3DFace、KinectFaceDB和IIIT-D，其中Lock3DFace为视频数据集）、2个高质量数据集（mb - db、Bosphorus）和Bosphorus合成的一个跨质量数据集等6个具有挑战性的数据集上进行的大量实验证明，CG-MCFNet在不完全3D FR上取得了最佳性能，证明了我们的模型具有较强的推广能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CG-MCFNet: cross-layer guidance-based multi-scale correlation fusion network for 3D face recognition

3D face recognition (FR) has been a popular field in recent years, which benefits from the advancement of 3D sensors and the application demands of video surveillance scenes. Existing 3D FR methods could show excellent performance when faces are complete. However, incomplete 3D faces, especially large poses and occluded, may prevent the model to learn effective and strong discriminative facial information adequately, resulting in unsatisfactory recognition results. To address this issue, we propose a cross-layer guidance-based multi-scale correlation fusion network (CG-MCFNet) for 3D FR. Firstly, we design a shallow feature enhancement extraction (SFE) module to obtain more effective facial detail information, and a deep feature enhancement extraction (DFE) module to learn more information with strong discrimination. Secondly, a novel multi-scale feature correlation fusion (MCF) module is proposed for fusing features from different layers, aiming to reduce the interference of redundant features and enhance the acquisition of discriminative features. Finally, the above three modules are integrated to form a new multi-scale local feature extraction (MLFE) module for capturing the face local information of rich and more strong discriminative. In addition, we introduce a global and local feature similarity weighted joint inference strategy, to further improve recognition accuracy. Extensive experiments on six challenging datasets, including three low-quality datasets (Lock3DFace, KinectFaceDB, and IIIT-D, where Lock3DFace is a video dataset), two high-quality datasets (UMB-DB, Bosphorus), and a cross-quality dataset synthesized by Bosphorus, prove that our CG-MCFNet achieves the best performance for incomplete 3D FR, which demonstrates the strong generalization ability of our model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.