The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior.

Proceedings. IEEE International Conference on Computer Vision Pub Date : 2023-10-01 Epub Date: 2024-01-15 DOI:10.1109/ICCV51070.2023.01140

Yilin Liu, Jiang Li, Yunkui Pang, Dong Nie, Pew-Thian Yap

{"title":"The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior.","authors":"Yilin Liu, Jiang Li, Yunkui Pang, Dong Nie, Pew-Thian Yap","doi":"10.1109/ICCV51070.2023.01140","DOIUrl":null,"url":null,"abstract":"<p><p>Deep Image Prior (DIP) shows that some network architectures inherently tend towards generating smooth images while resisting noise, a phenomenon known as spectral bias. Image denoising is a natural application of this property. Although denoising with DIP mitigates the need for large training sets, two often intertwined practical challenges need to be overcome: architectural design and noise fitting. Existing methods either handcraft or search for suitable architectures from a vast design space, due to the limited understanding of how architectural choices affect the denoising outcome. In this study, we demonstrate from a frequency perspective that unlearnt upsampling is the main driving force behind the denoising phenomenon with DIP. This finding leads to straightforward strategies for identifying a suitable architecture for every image without laborious search. Extensive experiments show that the estimated architectures achieve superior denoising results than existing methods with up to 95% fewer parameters. Thanks to this under-parameterization, the resulting architectures are less prone to noise-fitting.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2023 ","pages":"12374-12383"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11078028/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV51070.2023.01140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/15 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep Image Prior (DIP) shows that some network architectures inherently tend towards generating smooth images while resisting noise, a phenomenon known as spectral bias. Image denoising is a natural application of this property. Although denoising with DIP mitigates the need for large training sets, two often intertwined practical challenges need to be overcome: architectural design and noise fitting. Existing methods either handcraft or search for suitable architectures from a vast design space, due to the limited understanding of how architectural choices affect the denoising outcome. In this study, we demonstrate from a frequency perspective that unlearnt upsampling is the main driving force behind the denoising phenomenon with DIP. This finding leads to straightforward strategies for identifying a suitable architecture for every image without laborious search. Extensive experiments show that the estimated architectures achieve superior denoising results than existing methods with up to 95% fewer parameters. Thanks to this under-parameterization, the resulting architectures are less prone to noise-fitting.

查看原文本刊更多论文

魔鬼就在升采样中：利用深度图像先验简化去噪的架构决策

深度图像优先（DIP）表明，某些网络架构天生倾向于生成平滑图像，同时抵御噪声，这种现象被称为光谱偏差。图像去噪就是这一特性的自然应用。虽然使用 DIP 去噪可以减少对大型训练集的需求，但仍需要克服两个经常交织在一起的实际挑战：架构设计和噪声拟合。由于对架构选择如何影响去噪结果的了解有限，现有方法要么是手工制作，要么是从广阔的设计空间中寻找合适的架构。在本研究中，我们从频率的角度证明，未学习的上采样是 DIP 去噪现象背后的主要驱动力。这一发现为我们提供了无需费力搜索即可为每幅图像确定合适架构的直接策略。广泛的实验表明，估算出的架构比现有方法的去噪效果更好，参数数量最多可减少 95%。得益于这种低参数化，所得到的架构不易受噪声拟合的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. IEEE International Conference on Computer Vision

自引率

0.00%

发文量