利用生成模型为真实世界场景提供散焦增强技术

Advanced Imaging and Information Processing Pub Date : 2023-12-13 DOI:10.1117/12.3007254

Yuhao Li, Wenkang Gong, Tianle Li, Jiaqing Dong, Xianlin Song

{"title":"利用生成模型为真实世界场景提供散焦增强技术","authors":"Yuhao Li, Wenkang Gong, Tianle Li, Jiaqing Dong, Xianlin Song","doi":"10.1117/12.3007254","DOIUrl":null,"url":null,"abstract":"In recent years, significant progress has been made in deep learning-based image deblurring. These approaches utilize deep neural networks to learn the map between blurry and clear images or jointly learn the blurry kernel and clear image. They have demonstrated effectiveness in enhancing image quality, preserving details, and handling various types and degrees of blur. The objective of this study is to develop a defocus enhancement technique for real-world scenarios using score-based generative models. Stochastic differential equations (SDE) are employed to gradually introduce noise, thereby smoothing the data distribution towards a known prior distribution. The Score-Matching Langevin Dynamics (SMLD) model estimates the score for each noise scale, while Diffusion Models (DDPM) train the target model for score computation. This process constructs a score-based model capable of reversing the SDE over time. A predictor-corrector framework corrects the evolution of the reverse-time SDE, and the prior distribution is transformed back to the data distribution by removing the noise. By leveraging score-based generative models, accurate score estimation and sample generation are achieved using neural networks and numerical SDE solvers. This technique effectively restores clarity and details in defocused images, thereby enhancing overall image quality.","PeriodicalId":505225,"journal":{"name":"Advanced Imaging and Information Processing","volume":"41 9","pages":"129420F - 129420F-5"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Defocus-enhanced technique for real-world scenarios using generative models\",\"authors\":\"Yuhao Li, Wenkang Gong, Tianle Li, Jiaqing Dong, Xianlin Song\",\"doi\":\"10.1117/12.3007254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, significant progress has been made in deep learning-based image deblurring. These approaches utilize deep neural networks to learn the map between blurry and clear images or jointly learn the blurry kernel and clear image. They have demonstrated effectiveness in enhancing image quality, preserving details, and handling various types and degrees of blur. The objective of this study is to develop a defocus enhancement technique for real-world scenarios using score-based generative models. Stochastic differential equations (SDE) are employed to gradually introduce noise, thereby smoothing the data distribution towards a known prior distribution. The Score-Matching Langevin Dynamics (SMLD) model estimates the score for each noise scale, while Diffusion Models (DDPM) train the target model for score computation. This process constructs a score-based model capable of reversing the SDE over time. A predictor-corrector framework corrects the evolution of the reverse-time SDE, and the prior distribution is transformed back to the data distribution by removing the noise. By leveraging score-based generative models, accurate score estimation and sample generation are achieved using neural networks and numerical SDE solvers. This technique effectively restores clarity and details in defocused images, thereby enhancing overall image quality.\",\"PeriodicalId\":505225,\"journal\":{\"name\":\"Advanced Imaging and Information Processing\",\"volume\":\"41 9\",\"pages\":\"129420F - 129420F-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Imaging and Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.3007254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Imaging and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3007254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

近年来，基于深度学习的图像去模糊技术取得了重大进展。这些方法利用深度神经网络来学习模糊图像和清晰图像之间的映射，或联合学习模糊内核和清晰图像。这些方法在提高图像质量、保留细节以及处理各种类型和程度的模糊方面表现出了有效性。本研究的目的是利用基于分数的生成模型，开发一种适用于真实世界场景的失焦增强技术。随机微分方程（SDE）被用来逐步引入噪声，从而使数据分布趋向于已知的先验分布。分数匹配朗文动力学（SMLD）模型可估算出每个噪声尺度的分数，而扩散模型（DDPM）则可训练目标模型进行分数计算。这一过程构建了一个基于分数的模型，能够随着时间的推移逆转 SDE。预测器-校正器框架会校正反向时间 SDE 的演化，先验分布会通过去除噪声转换回数据分布。通过利用基于分数的生成模型，利用神经网络和数值 SDE 求解器实现了精确的分数估计和样本生成。这项技术能有效恢复失焦图像的清晰度和细节，从而提高整体图像质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Defocus-enhanced technique for real-world scenarios using generative models

In recent years, significant progress has been made in deep learning-based image deblurring. These approaches utilize deep neural networks to learn the map between blurry and clear images or jointly learn the blurry kernel and clear image. They have demonstrated effectiveness in enhancing image quality, preserving details, and handling various types and degrees of blur. The objective of this study is to develop a defocus enhancement technique for real-world scenarios using score-based generative models. Stochastic differential equations (SDE) are employed to gradually introduce noise, thereby smoothing the data distribution towards a known prior distribution. The Score-Matching Langevin Dynamics (SMLD) model estimates the score for each noise scale, while Diffusion Models (DDPM) train the target model for score computation. This process constructs a score-based model capable of reversing the SDE over time. A predictor-corrector framework corrects the evolution of the reverse-time SDE, and the prior distribution is transformed back to the data distribution by removing the noise. By leveraging score-based generative models, accurate score estimation and sample generation are achieved using neural networks and numerical SDE solvers. This technique effectively restores clarity and details in defocused images, thereby enhancing overall image quality.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Imaging and Information Processing

自引率

0.00%

发文量