MSTNet: a multi-stage progressive network with local–global transformer fusion for image restoration

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2025-04-25 DOI:10.1007/s40747-025-01892-y

Ruyu Liu, Lin Wang, Jie He, Jiajia Wang, Jianhua Zhang, Xiufeng Liu, Chaochao Wang, Haoyu Zhang, Sheng Dai

{"title":"MSTNet: a multi-stage progressive network with local–global transformer fusion for image restoration","authors":"Ruyu Liu, Lin Wang, Jie He, Jiajia Wang, Jianhua Zhang, Xiufeng Liu, Chaochao Wang, Haoyu Zhang, Sheng Dai","doi":"10.1007/s40747-025-01892-y","DOIUrl":null,"url":null,"abstract":"<p>Image restoration is a challenging and complex problem involving recovering the original clear image from a degraded or noisy image. In the medical field, image restoration techniques can significantly improve the quality of endoscopic images, helping doctors make more accurate diagnoses and providing higher-quality data support for computer vision-assisted detection. Existing methods for image restoration mainly use convolutional neural networks (CNNs) or Transformer models, which have different advantages and limitations in capturing spatial and channel information of the image. This paper proposes a novel Multi-Stage progressive image restoration Network based on a blend of local–global Transformers, named MSTNet. Our network consists of three stages, each using a different type of Transformer module to obtain local and global information. The first two stages use window-based Transformer modules, which can effectively extract local spatial information within each window. The third stage uses channel-level Transformer modules to capture global channel information across the whole image. We also introduce a fusion module to combine the features from different Transformer branches and obtain a comprehensive and accurate feature representation. We conduct extensive experiments on various image restoration tasks, such as deblurring and denoising, evaluating our approach on both general image restoration datasets and our proposed colon dataset. The results demonstrate the effectiveness and superiority of our network over state-of-the-art methods.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"43 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01892-y","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Image restoration is a challenging and complex problem involving recovering the original clear image from a degraded or noisy image. In the medical field, image restoration techniques can significantly improve the quality of endoscopic images, helping doctors make more accurate diagnoses and providing higher-quality data support for computer vision-assisted detection. Existing methods for image restoration mainly use convolutional neural networks (CNNs) or Transformer models, which have different advantages and limitations in capturing spatial and channel information of the image. This paper proposes a novel Multi-Stage progressive image restoration Network based on a blend of local–global Transformers, named MSTNet. Our network consists of three stages, each using a different type of Transformer module to obtain local and global information. The first two stages use window-based Transformer modules, which can effectively extract local spatial information within each window. The third stage uses channel-level Transformer modules to capture global channel information across the whole image. We also introduce a fusion module to combine the features from different Transformer branches and obtain a comprehensive and accurate feature representation. We conduct extensive experiments on various image restoration tasks, such as deblurring and denoising, evaluating our approach on both general image restoration datasets and our proposed colon dataset. The results demonstrate the effectiveness and superiority of our network over state-of-the-art methods.

查看原文本刊更多论文

基于局部-全局变压器融合的多阶段渐进式网络图像恢复方法

图像恢复是一个具有挑战性和复杂性的问题，涉及从退化或噪声图像中恢复原始清晰图像。在医学领域，图像恢复技术可以显著提高内镜图像的质量，帮助医生做出更准确的诊断，为计算机视觉辅助检测提供更高质量的数据支持。现有的图像恢复方法主要采用卷积神经网络（convolutional neural networks, cnn）或Transformer模型，这两种方法在获取图像的空间信息和通道信息方面有不同的优势和局限性。本文提出了一种基于局部-全局混合变形的多阶段渐进图像恢复网络，称为MSTNet。我们的网络由三个阶段组成，每个阶段使用不同类型的Transformer模块来获取本地和全局信息。前两个阶段使用基于窗口的Transformer模块，它可以有效地提取每个窗口中的局部空间信息。第三阶段使用通道级Transformer模块来捕获整个图像的全局通道信息。我们还引入了一个融合模块，将来自不同Transformer分支的特征进行融合，从而获得全面准确的特征表示。我们对各种图像恢复任务进行了广泛的实验，例如去模糊和去噪，在一般图像恢复数据集和我们提出的冒号数据集上评估我们的方法。结果证明了我们的网络比最先进的方法的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.