DifFace: Blind Face Restoration with Diffused Error Contraction.

IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-07-23 DOI:10.1109/TPAMI.2024.3432651

Zongsheng Yue, Chen Change Loy

{"title":"DifFace: Blind Face Restoration with Diffused Error Contraction.","authors":"Zongsheng Yue, Chen Change Loy","doi":"10.1109/TPAMI.2024.3432651","DOIUrl":null,"url":null,"abstract":"While deep learning-based methods for blind face restoration have achieved unprecedented success, they still suffer from two major limitations. First, most of them deteriorate when facing complex degradations out of their training data. Second, these methods require multiple constraints, e.g., fidelity, perceptual, and adversarial losses, which require laborious hyper-parameter tuning to stabilize and balance their influences. In this work, we propose a novel method named DifFace that is capable of coping with unseen and complex degradations more gracefully without complicated loss designs. The key of our method is to establish a posterior distribution from the observed low-quality (LQ) image to its high-quality (HQ) counterpart. In particular, we design a transition distribution from the LQ image to the intermediate state of a pre-trained diffusion model and then gradually transmit from this intermediate state to the HQ target by recursively applying a pre-trained diffusion model. The transition distribution only relies on a restoration backbone that is trained with L1 loss on some synthetic data, which favorably avoids the cumbersome training process in existing methods. Moreover, the transition distribution can contract the error of the restoration backbone and thus makes our method more robust to unknown degradations. Comprehensive experiments show that DifFace is superior to current state-of-the-art methods, especially in cases with severe degradations. Code and model are available at https://github.com/zsyOAOA/DifFace.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPAMI.2024.3432651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

While deep learning-based methods for blind face restoration have achieved unprecedented success, they still suffer from two major limitations. First, most of them deteriorate when facing complex degradations out of their training data. Second, these methods require multiple constraints, e.g., fidelity, perceptual, and adversarial losses, which require laborious hyper-parameter tuning to stabilize and balance their influences. In this work, we propose a novel method named DifFace that is capable of coping with unseen and complex degradations more gracefully without complicated loss designs. The key of our method is to establish a posterior distribution from the observed low-quality (LQ) image to its high-quality (HQ) counterpart. In particular, we design a transition distribution from the LQ image to the intermediate state of a pre-trained diffusion model and then gradually transmit from this intermediate state to the HQ target by recursively applying a pre-trained diffusion model. The transition distribution only relies on a restoration backbone that is trained with L₁ loss on some synthetic data, which favorably avoids the cumbersome training process in existing methods. Moreover, the transition distribution can contract the error of the restoration backbone and thus makes our method more robust to unknown degradations. Comprehensive experiments show that DifFace is superior to current state-of-the-art methods, especially in cases with severe degradations. Code and model are available at https://github.com/zsyOAOA/DifFace.

查看原文本刊更多论文

DifFace：利用漫反射误差收缩进行盲人面容修复

虽然基于深度学习的盲目人脸修复方法取得了前所未有的成功，但它们仍然存在两大局限性。首先，当训练数据出现复杂退化时，大多数方法的性能都会下降。其次，这些方法需要多重约束，如保真度、感知和对抗损失，这就需要费力地进行超参数调整，以稳定和平衡它们的影响。在这项工作中，我们提出了一种名为 "DifFace "的新方法，该方法无需复杂的损失设计，就能更从容地应对看不见的复杂退化。我们方法的关键在于建立一个从观测到的低质量（LQ）图像到其高质量（HQ）对应图像的后验分布。具体来说，我们设计了一个从低质量图像到预训练扩散模型中间状态的过渡分布，然后通过递归应用预训练扩散模型，从中间状态逐步传输到高质量目标。过渡分布只依赖于在一些合成数据上进行 L1 损失训练的恢复骨干，这就避免了现有方法中繁琐的训练过程。此外，过渡分布还能缩小恢复骨干的误差，从而使我们的方法对未知退化具有更强的鲁棒性。综合实验表明，DifFace 优于目前最先进的方法，尤其是在严重退化的情况下。代码和模型见 https://github.com/zsyOAOA/DifFace。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量

文献相关原料

公司名称	产品信息	采购帮参考价格