用于快照压缩成像的高效一步扩散细化技术

arXiv - EE - Image and Video Processing Pub Date : 2024-09-11 DOI:arxiv-2409.07417

Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang

{"title":"用于快照压缩成像的高效一步扩散细化技术","authors":"Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang","doi":"arxiv-2409.07417","DOIUrl":null,"url":null,"abstract":"Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for\ncapturing three-dimensional multispectral images (MSIs) through the complex\ninverse task of reconstructing these images from coded two-dimensional\nmeasurements. Current state-of-the-art methods, predominantly end-to-end, face\nlimitations in reconstructing high-frequency details and often rely on\nconstrained datasets like KAIST and CAVE, resulting in models with poor\ngeneralizability. In response to these challenges, this paper introduces a\nnovel one-step Diffusion Probabilistic Model within a self-supervised\nadaptation framework for Snapshot Compressive Imaging (SCI). Our approach\nleverages a pretrained SCI reconstruction network to generate initial\npredictions from two-dimensional measurements. Subsequently, a one-step\ndiffusion model produces high-frequency residuals to enhance these initial\npredictions. Additionally, acknowledging the high costs associated with\ncollecting MSIs, we develop a self-supervised paradigm based on the Equivariant\nImaging (EI) framework. Experimental results validate the superiority of our\nmodel compared to previous methods, showcasing its simplicity and adaptability\nto various end-to-end or unfolding techniques.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging\",\"authors\":\"Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang\",\"doi\":\"arxiv-2409.07417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for\\ncapturing three-dimensional multispectral images (MSIs) through the complex\\ninverse task of reconstructing these images from coded two-dimensional\\nmeasurements. Current state-of-the-art methods, predominantly end-to-end, face\\nlimitations in reconstructing high-frequency details and often rely on\\nconstrained datasets like KAIST and CAVE, resulting in models with poor\\ngeneralizability. In response to these challenges, this paper introduces a\\nnovel one-step Diffusion Probabilistic Model within a self-supervised\\nadaptation framework for Snapshot Compressive Imaging (SCI). Our approach\\nleverages a pretrained SCI reconstruction network to generate initial\\npredictions from two-dimensional measurements. Subsequently, a one-step\\ndiffusion model produces high-frequency residuals to enhance these initial\\npredictions. Additionally, acknowledging the high costs associated with\\ncollecting MSIs, we develop a self-supervised paradigm based on the Equivariant\\nImaging (EI) framework. Experimental results validate the superiority of our\\nmodel compared to previous methods, showcasing its simplicity and adaptability\\nto various end-to-end or unfolding techniques.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

编码孔径快照光谱成像（CASSI）是获取三维多光谱图像（MSI）的关键技术，其复杂的逆任务是从编码的二维测量数据中重建这些图像。目前最先进的方法主要是端到端方法，在重建高频细节方面存在局限性，而且往往依赖于 KAIST 和 CAVE 等受限数据集，导致模型的通用性较差。为了应对这些挑战，本文在快照压缩成像（SCI）的自监督适应框架内引入了一种新的一步扩散概率模型。我们的方法利用预先训练好的 SCI 重建网络，从二维测量中生成初始预测。随后，一步扩散模型产生高频残差来增强这些初始预测。此外，考虑到收集 MSIs 的成本较高，我们开发了基于等变成像（EI）框架的自监督范例。实验结果验证了我们的模型优于之前的方法，展示了它的简单性和对各种端到端或展开技术的适应性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging

Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for capturing three-dimensional multispectral images (MSIs) through the complex inverse task of reconstructing these images from coded two-dimensional measurements. Current state-of-the-art methods, predominantly end-to-end, face limitations in reconstructing high-frequency details and often rely on constrained datasets like KAIST and CAVE, resulting in models with poor generalizability. In response to these challenges, this paper introduces a novel one-step Diffusion Probabilistic Model within a self-supervised adaptation framework for Snapshot Compressive Imaging (SCI). Our approach leverages a pretrained SCI reconstruction network to generate initial predictions from two-dimensional measurements. Subsequently, a one-step diffusion model produces high-frequency residuals to enhance these initial predictions. Additionally, acknowledging the high costs associated with collecting MSIs, we develop a self-supervised paradigm based on the Equivariant Imaging (EI) framework. Experimental results validate the superiority of our model compared to previous methods, showcasing its simplicity and adaptability to various end-to-end or unfolding techniques.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - EE - Image and Video Processing

自引率

0.00%

发文量