{"title":"用于快照压缩成像的高效一步扩散细化技术","authors":"Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang","doi":"arxiv-2409.07417","DOIUrl":null,"url":null,"abstract":"Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for\ncapturing three-dimensional multispectral images (MSIs) through the complex\ninverse task of reconstructing these images from coded two-dimensional\nmeasurements. Current state-of-the-art methods, predominantly end-to-end, face\nlimitations in reconstructing high-frequency details and often rely on\nconstrained datasets like KAIST and CAVE, resulting in models with poor\ngeneralizability. In response to these challenges, this paper introduces a\nnovel one-step Diffusion Probabilistic Model within a self-supervised\nadaptation framework for Snapshot Compressive Imaging (SCI). Our approach\nleverages a pretrained SCI reconstruction network to generate initial\npredictions from two-dimensional measurements. Subsequently, a one-step\ndiffusion model produces high-frequency residuals to enhance these initial\npredictions. Additionally, acknowledging the high costs associated with\ncollecting MSIs, we develop a self-supervised paradigm based on the Equivariant\nImaging (EI) framework. Experimental results validate the superiority of our\nmodel compared to previous methods, showcasing its simplicity and adaptability\nto various end-to-end or unfolding techniques.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging\",\"authors\":\"Yunzhen Wang, Haijin Zeng, Shaoguang Huang, Hongyu Chen, Hongyan Zhang\",\"doi\":\"arxiv-2409.07417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for\\ncapturing three-dimensional multispectral images (MSIs) through the complex\\ninverse task of reconstructing these images from coded two-dimensional\\nmeasurements. Current state-of-the-art methods, predominantly end-to-end, face\\nlimitations in reconstructing high-frequency details and often rely on\\nconstrained datasets like KAIST and CAVE, resulting in models with poor\\ngeneralizability. In response to these challenges, this paper introduces a\\nnovel one-step Diffusion Probabilistic Model within a self-supervised\\nadaptation framework for Snapshot Compressive Imaging (SCI). Our approach\\nleverages a pretrained SCI reconstruction network to generate initial\\npredictions from two-dimensional measurements. Subsequently, a one-step\\ndiffusion model produces high-frequency residuals to enhance these initial\\npredictions. Additionally, acknowledging the high costs associated with\\ncollecting MSIs, we develop a self-supervised paradigm based on the Equivariant\\nImaging (EI) framework. Experimental results validate the superiority of our\\nmodel compared to previous methods, showcasing its simplicity and adaptability\\nto various end-to-end or unfolding techniques.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging
Coded Aperture Snapshot Spectral Imaging (CASSI) is a crucial technique for
capturing three-dimensional multispectral images (MSIs) through the complex
inverse task of reconstructing these images from coded two-dimensional
measurements. Current state-of-the-art methods, predominantly end-to-end, face
limitations in reconstructing high-frequency details and often rely on
constrained datasets like KAIST and CAVE, resulting in models with poor
generalizability. In response to these challenges, this paper introduces a
novel one-step Diffusion Probabilistic Model within a self-supervised
adaptation framework for Snapshot Compressive Imaging (SCI). Our approach
leverages a pretrained SCI reconstruction network to generate initial
predictions from two-dimensional measurements. Subsequently, a one-step
diffusion model produces high-frequency residuals to enhance these initial
predictions. Additionally, acknowledging the high costs associated with
collecting MSIs, we develop a self-supervised paradigm based on the Equivariant
Imaging (EI) framework. Experimental results validate the superiority of our
model compared to previous methods, showcasing its simplicity and adaptability
to various end-to-end or unfolding techniques.