Alternating Direction Unfolding With a Cross Spectral Attention Prior for Dual-Camera Compressive Hyperspectral Imaging

IF 13.7

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-08-18 DOI:10.1109/TIP.2025.3597775

Yubo Dong;Dahua Gao;Danhua Liu;Yanli Liu;Guangming Shi

{"title":"Alternating Direction Unfolding With a Cross Spectral Attention Prior for Dual-Camera Compressive Hyperspectral Imaging","authors":"Yubo Dong;Dahua Gao;Danhua Liu;Yanli Liu;Guangming Shi","doi":"10.1109/TIP.2025.3597775","DOIUrl":null,"url":null,"abstract":"Coded Aperture Snapshot Spectral Imaging (CASSI) multiplexes 3D Hyperspectral Images (HSIs) into a 2D sensor to capture dynamic spectral scenes, which, however, sacrifices the spatial information. Dual-Camera Compressive Hyperspectral Imaging (DCCHI) enhances CASSI by incorporating a Panchromatic (PAN) camera to compensate for the loss of spatial information in CASSI. However, the dual-camera structure of DCCHI disrupts the diagonal property of the product of the sensing matrix and its transpose, making it difficult to efficiently and accurately solve the data subproblem in closed-form and thereby hindering the application of model-based methods and Deep Unfolding Networks (DUNs) that rely on such a closed-form solution. To address this issue, we propose an Alternating Direction DUN, named ADRNN, which decouples the imaging model of DCCHI into a CASSI subproblem and a PAN subproblem. The ADRNN alternately solves data terms analytically and a joint prior term in these subproblems. Additionally, we propose a Cross Spectral Transformer (XST) to exploit the joint prior. The XST utilizes cross spectral attention to exploit the correlation between the compressed HSI and the PAN image, and incorporates Grouped-Query Attention (GQA) to alleviate the burden of parameters and computational cost brought by impartially treating the compressed HSI and the PAN image. Furthermore, we built a real DCCHI system and captured large-scale indoor and outdoor scenes for future academic research. Extensive experiments on both simulation and real datasets demonstrate that the proposed method achieves state-of-the-art (SOTA) performance. The code and datasets have been open-sourced at: <uri>https://github.com/ShawnDong98/ADRNN-XST</uri>","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"5325-5340"},"PeriodicalIF":13.7000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11128981/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Coded Aperture Snapshot Spectral Imaging (CASSI) multiplexes 3D Hyperspectral Images (HSIs) into a 2D sensor to capture dynamic spectral scenes, which, however, sacrifices the spatial information. Dual-Camera Compressive Hyperspectral Imaging (DCCHI) enhances CASSI by incorporating a Panchromatic (PAN) camera to compensate for the loss of spatial information in CASSI. However, the dual-camera structure of DCCHI disrupts the diagonal property of the product of the sensing matrix and its transpose, making it difficult to efficiently and accurately solve the data subproblem in closed-form and thereby hindering the application of model-based methods and Deep Unfolding Networks (DUNs) that rely on such a closed-form solution. To address this issue, we propose an Alternating Direction DUN, named ADRNN, which decouples the imaging model of DCCHI into a CASSI subproblem and a PAN subproblem. The ADRNN alternately solves data terms analytically and a joint prior term in these subproblems. Additionally, we propose a Cross Spectral Transformer (XST) to exploit the joint prior. The XST utilizes cross spectral attention to exploit the correlation between the compressed HSI and the PAN image, and incorporates Grouped-Query Attention (GQA) to alleviate the burden of parameters and computational cost brought by impartially treating the compressed HSI and the PAN image. Furthermore, we built a real DCCHI system and captured large-scale indoor and outdoor scenes for future academic research. Extensive experiments on both simulation and real datasets demonstrate that the proposed method achieves state-of-the-art (SOTA) performance. The code and datasets have been open-sourced at: https://github.com/ShawnDong98/ADRNN-XST

查看原文本刊更多论文

基于交叉光谱注意先验的双相机压缩高光谱成像交替方向展开。

编码孔径快照光谱成像（CASSI）将3D高光谱图像（hsi）多路复用到二维传感器中，以捕获动态光谱场景，但牺牲了空间信息。双相机压缩高光谱成像（dcci）通过加入全色（PAN）相机来弥补CASSI中空间信息的损失，从而增强CASSI。然而，dcci的双摄像头结构破坏了传感矩阵及其转置乘积的对角性质，使得难以高效、准确地求解封闭形式的数据子问题，从而阻碍了基于模型的方法和依赖于这种封闭形式解的深度展开网络（DUNs）的应用。为了解决这个问题，我们提出了一个交替方向DUN，称为ADRNN，它将dcci的成像模型解耦为CASSI子问题和PAN子问题。ADRNN交替地解析解决这些子问题中的数据项和联合先验项。此外，我们提出了一种交叉频谱变压器（XST）来利用关节先验。XST利用交叉光谱注意来挖掘压缩后的HSI与PAN图像之间的相关性，并结合分组查询注意（GQA）来减轻对压缩后的HSI与PAN图像进行公正处理所带来的参数负担和计算成本。此外，我们建立了一个真实的dcci系统，并捕获了大规模的室内和室外场景，为未来的学术研究做准备。在仿真和真实数据集上的大量实验表明，该方法达到了最先进的SOTA性能。代码和数据集已经在https://github.com/ShawnDong98/ADRNN-XST上开源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量