Replay Attention and Data Augmentation Network for 3-D Face and Object Reconstruction

IEEE transactions on biometrics, behavior, and identity science Pub Date : 2023-03-24 DOI:10.1109/TBIOM.2023.3261272

Zhiyuan Zhou;Lei Li;Suping Wu;Xinyu Li;Kehua Ma;Xitie Zhang

{"title":"Replay Attention and Data Augmentation Network for 3-D Face and Object Reconstruction","authors":"Zhiyuan Zhou;Lei Li;Suping Wu;Xinyu Li;Kehua Ma;Xitie Zhang","doi":"10.1109/TBIOM.2023.3261272","DOIUrl":null,"url":null,"abstract":"3D face reconstruction from single-view images plays an important role in the field of biometrics, which is a long-standing challenging problem in the wild. Traditional 3DMM-based methods directly regressed parameters, which probably caused that the network learned the discriminative informative features insufficiently. In this paper, we propose a replay attention and data augmentation network (RADAN) for 3D dense alignment and face reconstruction. Unlike the traditional attention mechanism, our replay attention module aims to increase the sensitivity of the network to informative features by adaptively recalibrating the weight response in the attention, which typically reinforces the distinguishability of the learned feature representation. In this way, the network can further improve the accuracy of face reconstruction and dense alignment in unconstrained environments. Moreover, to improve the generalization performance of the model and the ability of the network to capture local details, we present a data augmentation strategy to preprocess the sample data, which generates the images that contain more local details and occluded face in cropping and pasting manner. Furthermore, we also apply the replay attention to 3D object reconstruction task to verify the commonality of this mechanism. Extensive experimental results on widely-evaluated datasets demonstrate that our approach achieves competitive performance compared to state-of-the-art methods. Code is available at \n<uri>https://github.com/zhouzhiyuan1/RADANet</uri>\n.","PeriodicalId":73307,"journal":{"name":"IEEE transactions on biometrics, behavior, and identity science","volume":"5 3","pages":"308-320"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biometrics, behavior, and identity science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10081017/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

3D face reconstruction from single-view images plays an important role in the field of biometrics, which is a long-standing challenging problem in the wild. Traditional 3DMM-based methods directly regressed parameters, which probably caused that the network learned the discriminative informative features insufficiently. In this paper, we propose a replay attention and data augmentation network (RADAN) for 3D dense alignment and face reconstruction. Unlike the traditional attention mechanism, our replay attention module aims to increase the sensitivity of the network to informative features by adaptively recalibrating the weight response in the attention, which typically reinforces the distinguishability of the learned feature representation. In this way, the network can further improve the accuracy of face reconstruction and dense alignment in unconstrained environments. Moreover, to improve the generalization performance of the model and the ability of the network to capture local details, we present a data augmentation strategy to preprocess the sample data, which generates the images that contain more local details and occluded face in cropping and pasting manner. Furthermore, we also apply the replay attention to 3D object reconstruction task to verify the commonality of this mechanism. Extensive experimental results on widely-evaluated datasets demonstrate that our approach achieves competitive performance compared to state-of-the-art methods. Code is available at https://github.com/zhouzhiyuan1/RADANet .

查看原文本刊更多论文

三维人脸和物体重建的重放注意力和数据增强网络

基于单视图图像的三维人脸重建在生物识别领域发挥着重要作用，这是一个长期存在的挑战性问题。传统的基于3DMM的方法直接对参数进行回归，这可能导致网络对判别信息特征的学习不足。在本文中，我们提出了一种用于三维密集对准和人脸重建的重放注意力和数据增强网络（RADAN）。与传统的注意力机制不同，我们的重放注意力模块旨在通过自适应地重新校准注意力中的权重响应来提高网络对信息特征的敏感性，这通常会增强学习特征表示的可区分性。通过这种方式，该网络可以进一步提高无约束环境中人脸重建和密集对齐的精度。此外，为了提高模型的泛化性能和网络捕捉局部细节的能力，我们提出了一种对样本数据进行预处理的数据增强策略，该策略以剪切和粘贴的方式生成包含更多局部细节和遮挡人脸的图像。此外，我们还将重放注意力应用于3D对象重建任务，以验证该机制的通用性。在广泛评估的数据集上的大量实验结果表明，与最先进的方法相比，我们的方法实现了有竞争力的性能。代码可在https://github.com/zhouzhiyuan1/RADANet.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on biometrics, behavior, and identity science

CiteScore

10.90

自引率

0.00%

发文量