WaveMixSR-V2：以更高的效率增强超分辨率

arXiv - EE - Image and Video Processing Pub Date : 2024-09-16 DOI:arxiv-2409.10582

Pranav Jeevan, Neeraj Nixon, Amit Sethi

{"title":"WaveMixSR-V2：以更高的效率增强超分辨率","authors":"Pranav Jeevan, Neeraj Nixon, Amit Sethi","doi":"arxiv-2409.10582","DOIUrl":null,"url":null,"abstract":"Recent advancements in single image super-resolution have been predominantly\ndriven by token mixers and transformer architectures. WaveMixSR utilized the\nWaveMix architecture, employing a two-dimensional discrete wavelet transform\nfor spatial token mixing, achieving superior performance in super-resolution\ntasks with remarkable resource efficiency. In this work, we present an enhanced\nversion of the WaveMixSR architecture by (1) replacing the traditional\ntranspose convolution layer with a pixel shuffle operation and (2) implementing\na multistage design for higher resolution tasks ($4\\times$). Our experiments\ndemonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other\narchitectures in multiple super-resolution tasks, achieving state-of-the-art\nfor the BSD100 dataset, while also consuming fewer resources, exhibits higher\nparameter efficiency, lower latency and higher throughput. Our code is\navailable at https://github.com/pranavphoenix/WaveMixSR.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency\",\"authors\":\"Pranav Jeevan, Neeraj Nixon, Amit Sethi\",\"doi\":\"arxiv-2409.10582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent advancements in single image super-resolution have been predominantly\\ndriven by token mixers and transformer architectures. WaveMixSR utilized the\\nWaveMix architecture, employing a two-dimensional discrete wavelet transform\\nfor spatial token mixing, achieving superior performance in super-resolution\\ntasks with remarkable resource efficiency. In this work, we present an enhanced\\nversion of the WaveMixSR architecture by (1) replacing the traditional\\ntranspose convolution layer with a pixel shuffle operation and (2) implementing\\na multistage design for higher resolution tasks ($4\\\\times$). Our experiments\\ndemonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other\\narchitectures in multiple super-resolution tasks, achieving state-of-the-art\\nfor the BSD100 dataset, while also consuming fewer resources, exhibits higher\\nparameter efficiency, lower latency and higher throughput. Our code is\\navailable at https://github.com/pranavphoenix/WaveMixSR.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.10582\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

单图像超分辨率的最新进展主要是由令牌混合器和变换器架构推动的。WaveMixSR 利用 WaveMix 架构，采用二维离散小波变换进行空间令牌混合，在超分辨率任务中实现了卓越的性能和显著的资源效率。在这项工作中，我们提出了 WaveMixSR 架构的增强版本，具体做法是：（1）用像素洗牌操作取代传统的跨距卷积层；（2）针对更高分辨率任务（4 美元/次）实施多级设计。我们的实验证明，我们的增强型模型--WaveMixSR-V2--在多个超分辨率任务中的表现优于其他架构，在 BSD100 数据集上达到了最先进水平，同时还消耗更少的资源，表现出更高的参数效率、更低的延迟和更高的吞吐量。我们的代码见 https://github.com/pranavphoenix/WaveMixSR。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency

Recent advancements in single image super-resolution have been predominantly driven by token mixers and transformer architectures. WaveMixSR utilized the WaveMix architecture, employing a two-dimensional discrete wavelet transform for spatial token mixing, achieving superior performance in super-resolution tasks with remarkable resource efficiency. In this work, we present an enhanced version of the WaveMixSR architecture by (1) replacing the traditional transpose convolution layer with a pixel shuffle operation and (2) implementing a multistage design for higher resolution tasks ($4\times$). Our experiments demonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other architectures in multiple super-resolution tasks, achieving state-of-the-art for the BSD100 dataset, while also consuming fewer resources, exhibits higher parameter efficiency, lower latency and higher throughput. Our code is available at https://github.com/pranavphoenix/WaveMixSR.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - EE - Image and Video Processing

自引率

0.00%

发文量