图像超分辨率的扩散模型:现状和未来方向

IF 6.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Garas Gendy , Guanghui He , Nabil Sabor
{"title":"图像超分辨率的扩散模型:现状和未来方向","authors":"Garas Gendy ,&nbsp;Guanghui He ,&nbsp;Nabil Sabor","doi":"10.1016/j.neucom.2024.128911","DOIUrl":null,"url":null,"abstract":"<div><div>The single image super-resolution (SISR) task has received much attention due to the wide range of applications in many tasks. The progress in this SISR is mainly based on deep learning methods. In recent years, many methods have been developed based on using diffusion models to solve the SISR task. The performance of the diffusion-based model depends on many factors, such as the type of diffusion, noise schedule, and loss function. Based on this limitation, we developed a review paper about the state-of-the-art diffusion models, especially ones that were developed for SR. In this survey, the SR-based diffusion models are classified based on four metrics, namely the diffusion model type, the noise schedule, the loss function, and the used datasets. Then, we discuss the main categories in each metric and the developed diffusion models related to each category. Three main diffusion model types are considered, including denoising diffusion probabilistic models (DDPM), noise-conditioned score networks (NCSN), and stochastic differential equations models (SDEM), in addition to other diffusion models. In the end, we discuss the limitations of these diffusion models and show our expectations for future research in this domain.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"617 ","pages":"Article 128911"},"PeriodicalIF":6.5000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion models for image super-resolution: State-of-the-art and future directions\",\"authors\":\"Garas Gendy ,&nbsp;Guanghui He ,&nbsp;Nabil Sabor\",\"doi\":\"10.1016/j.neucom.2024.128911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The single image super-resolution (SISR) task has received much attention due to the wide range of applications in many tasks. The progress in this SISR is mainly based on deep learning methods. In recent years, many methods have been developed based on using diffusion models to solve the SISR task. The performance of the diffusion-based model depends on many factors, such as the type of diffusion, noise schedule, and loss function. Based on this limitation, we developed a review paper about the state-of-the-art diffusion models, especially ones that were developed for SR. In this survey, the SR-based diffusion models are classified based on four metrics, namely the diffusion model type, the noise schedule, the loss function, and the used datasets. Then, we discuss the main categories in each metric and the developed diffusion models related to each category. Three main diffusion model types are considered, including denoising diffusion probabilistic models (DDPM), noise-conditioned score networks (NCSN), and stochastic differential equations models (SDEM), in addition to other diffusion models. In the end, we discuss the limitations of these diffusion models and show our expectations for future research in this domain.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"617 \",\"pages\":\"Article 128911\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231224016825\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224016825","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

单幅图像超分辨率(SISR)任务由于在许多任务中的广泛应用而受到了广泛的关注。该SISR的进展主要基于深度学习方法。近年来,人们开发了许多基于扩散模型的方法来解决SISR任务。基于扩散的模型的性能取决于许多因素,如扩散类型、噪声调度和损失函数。基于这一限制,我们对目前最先进的扩散模型,特别是为sr开发的扩散模型进行了综述。在本调查中,基于sr的扩散模型基于四个指标进行分类,即扩散模型类型,噪声时间表,损失函数和使用的数据集。然后,我们讨论了每个度量的主要类别以及与每个类别相关的发展扩散模型。除了其他扩散模型外,还考虑了三种主要的扩散模型类型,包括去噪扩散概率模型(DDPM)、噪声条件评分网络(NCSN)和随机微分方程模型(SDEM)。最后,我们讨论了这些扩散模型的局限性,并对该领域的未来研究进行了展望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Diffusion models for image super-resolution: State-of-the-art and future directions
The single image super-resolution (SISR) task has received much attention due to the wide range of applications in many tasks. The progress in this SISR is mainly based on deep learning methods. In recent years, many methods have been developed based on using diffusion models to solve the SISR task. The performance of the diffusion-based model depends on many factors, such as the type of diffusion, noise schedule, and loss function. Based on this limitation, we developed a review paper about the state-of-the-art diffusion models, especially ones that were developed for SR. In this survey, the SR-based diffusion models are classified based on four metrics, namely the diffusion model type, the noise schedule, the loss function, and the used datasets. Then, we discuss the main categories in each metric and the developed diffusion models related to each category. Three main diffusion model types are considered, including denoising diffusion probabilistic models (DDPM), noise-conditioned score networks (NCSN), and stochastic differential equations models (SDEM), in addition to other diffusion models. In the end, we discuss the limitations of these diffusion models and show our expectations for future research in this domain.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信