Diff-Holo: A Residual Diffusion Model With Complex Transformer for Rapid Single-Frame Hologram Reconstruction

IF 4.8 2区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Computational Imaging Pub Date : 2025-04-16 DOI:10.1109/TCI.2025.3561683

Ziqi Bai;Xianming Liu;Cheng Guo;Kui Jiang;Junjun Jiang;Xiangyang Ji

{"title":"Diff-Holo: A Residual Diffusion Model With Complex Transformer for Rapid Single-Frame Hologram Reconstruction","authors":"Ziqi Bai;Xianming Liu;Cheng Guo;Kui Jiang;Junjun Jiang;Xiangyang Ji","doi":"10.1109/TCI.2025.3561683","DOIUrl":null,"url":null,"abstract":"Deep learning approaches have gained significant traction in holographic imaging, with diffusion models—an emerging class of deep generative models—showing particular promise in hologram reconstruction. Unlike conventional neural networks that directly generate outputs, diffusion models gradually add noise to data and train neural networks to remove it, enabling them to learn implicit priors of the underlying data distribution. However, current diffusion-based hologram reconstruction methods often require hundreds or even thousands of iterations to achieve high-fidelity results, leading to processing times of several minutes or more—falling short of the fast imaging demands of holographic systems. To address this, we propose <italic>Diff-Holo</i>, a residual diffusion model integrated with a complex transformer, designed for rapid and high-quality single-frame hologram reconstruction. Specifically, we create a shorter and more efficient Markov chain by controlling the residuals between clean images and those degraded by twin-image artifacts. Additionally, we incorporate complex-valued priors into the network by using a complex window-based transformer as the backbone, enhancing the network's ability to process complex-valued data in the reverse reconstruction process. Experimental results demonstrate that Diff-Holo achieves high-quality single-frame reconstructions in as few as 15 sampling steps, reducing reconstruction time from minutes to under 2.2 seconds.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"689-703"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Imaging","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10966195/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning approaches have gained significant traction in holographic imaging, with diffusion models—an emerging class of deep generative models—showing particular promise in hologram reconstruction. Unlike conventional neural networks that directly generate outputs, diffusion models gradually add noise to data and train neural networks to remove it, enabling them to learn implicit priors of the underlying data distribution. However, current diffusion-based hologram reconstruction methods often require hundreds or even thousands of iterations to achieve high-fidelity results, leading to processing times of several minutes or more—falling short of the fast imaging demands of holographic systems. To address this, we propose Diff-Holo, a residual diffusion model integrated with a complex transformer, designed for rapid and high-quality single-frame hologram reconstruction. Specifically, we create a shorter and more efficient Markov chain by controlling the residuals between clean images and those degraded by twin-image artifacts. Additionally, we incorporate complex-valued priors into the network by using a complex window-based transformer as the backbone, enhancing the network's ability to process complex-valued data in the reverse reconstruction process. Experimental results demonstrate that Diff-Holo achieves high-quality single-frame reconstructions in as few as 15 sampling steps, reducing reconstruction time from minutes to under 2.2 seconds.

查看原文本刊更多论文

Diff-Holo：一种用于快速单帧全息重建的复杂变压器残余扩散模型

深度学习方法在全息成像中获得了显著的吸引力，扩散模型——一种新兴的深度生成模型——在全息图重建中显示出特别的前景。与直接产生输出的传统神经网络不同，扩散模型逐渐向数据中添加噪声，并训练神经网络去除噪声，使其能够学习底层数据分布的隐式先验。然而，目前基于扩散的全息图重建方法通常需要数百甚至数千次迭代才能获得高保真度的结果，导致处理时间长达几分钟或更长，无法满足全息系统的快速成像需求。为了解决这个问题，我们提出了Diff-Holo，一种集成了复杂变压器的残余扩散模型，旨在实现快速和高质量的单帧全息图重建。具体来说，我们通过控制干净图像和被双图像伪影退化的图像之间的残差来创建更短、更有效的马尔可夫链。此外，我们使用基于复杂窗口的变压器作为主干，将复杂值先验纳入网络，增强了网络在反向重构过程中处理复杂值数据的能力。实验结果表明，Diff-Holo只需15个采样步骤即可实现高质量的单帧重建，将重建时间从几分钟缩短到2.2秒以下。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Computational Imaging Mathematics-Computational Mathematics

CiteScore

8.20

自引率

7.40%

发文量

期刊介绍： The IEEE Transactions on Computational Imaging will publish articles where computation plays an integral role in the image formation process. Papers will cover all areas of computational imaging ranging from fundamental theoretical methods to the latest innovative computational imaging system designs. Topics of interest will include advanced algorithms and mathematical techniques, model-based data inversion, methods for image and signal recovery from sparse and incomplete data, techniques for non-traditional sensing of image data, methods for dynamic information acquisition and extraction from imaging sensors, software and hardware for efficient computation in imaging systems, and highly novel imaging system design.