IEEE Transactions on Computational Imaging最新文献_第3页

Diff-Holo: A Residual Diffusion Model With Complex Transformer for Rapid Single-Frame Hologram Reconstruction Diff-Holo：一种用于快速单帧全息重建的复杂变压器残余扩散模型

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-04-16 DOI: 10.1109/TCI.2025.3561683

Ziqi Bai;Xianming Liu;Cheng Guo;Kui Jiang;Junjun Jiang;Xiangyang Ji

{"title":"Diff-Holo: A Residual Diffusion Model With Complex Transformer for Rapid Single-Frame Hologram Reconstruction","authors":"Ziqi Bai;Xianming Liu;Cheng Guo;Kui Jiang;Junjun Jiang;Xiangyang Ji","doi":"10.1109/TCI.2025.3561683","DOIUrl":"https://doi.org/10.1109/TCI.2025.3561683","url":null,"abstract":"Deep learning approaches have gained significant traction in holographic imaging, with diffusion models—an emerging class of deep generative models—showing particular promise in hologram reconstruction. Unlike conventional neural networks that directly generate outputs, diffusion models gradually add noise to data and train neural networks to remove it, enabling them to learn implicit priors of the underlying data distribution. However, current diffusion-based hologram reconstruction methods often require hundreds or even thousands of iterations to achieve high-fidelity results, leading to processing times of several minutes or more—falling short of the fast imaging demands of holographic systems. To address this, we propose <italic>Diff-Holo</i>, a residual diffusion model integrated with a complex transformer, designed for rapid and high-quality single-frame hologram reconstruction. Specifically, we create a shorter and more efficient Markov chain by controlling the residuals between clean images and those degraded by twin-image artifacts. Additionally, we incorporate complex-valued priors into the network by using a complex window-based transformer as the backbone, enhancing the network's ability to process complex-valued data in the reverse reconstruction process. Experimental results demonstrate that Diff-Holo achieves high-quality single-frame reconstructions in as few as 15 sampling steps, reducing reconstruction time from minutes to under 2.2 seconds.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"689-703"},"PeriodicalIF":4.2,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144171030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Scale Cascaded With Cross-Attention Network-Based Deformation Vector Field Estimation for Motion-Compensated 4D-CBCT Reconstruction 基于多尺度级联交叉注意网络的运动补偿4D-CBCT重建变形向量场估计

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-04-16 DOI: 10.1109/TCI.2025.3561660

Peng Yuan;Fei Lyu;Zhiqiang Gao;Chunfeng Yang;Dianlin Hu;Jian Zhu;Zhan Wu;Tianling Lyu;Wei Zhao;Jianmin Dong;Yang Chen

{"title":"Multi-Scale Cascaded With Cross-Attention Network-Based Deformation Vector Field Estimation for Motion-Compensated 4D-CBCT Reconstruction","authors":"Peng Yuan;Fei Lyu;Zhiqiang Gao;Chunfeng Yang;Dianlin Hu;Jian Zhu;Zhan Wu;Tianling Lyu;Wei Zhao;Jianmin Dong;Yang Chen","doi":"10.1109/TCI.2025.3561660","DOIUrl":"https://doi.org/10.1109/TCI.2025.3561660","url":null,"abstract":"Four-Dimensional Cone Beam Computed Tomography (4D-CBCT) imaging technology offers enhanced image quality and spatial resolution for intraoperative guidance, facilitating real-time tracking of tumor position changes during radiotherapy. However, this is still a task of great challenges due to insufficient projections at each respiratory phase after phase-sorting, and the image phases reconstructed directly from phase-sorted data are discrete and discontinuous. To generate high-quality 4D-CBCT deformation vector fields (DVFs), this paper leverages the preoperative static prior image to guide intraoperative dynamic sparse-view reconstruction images for reducing anatomical structure differences, ultimately achieving continuous and dynamic 4D-CBCT imaging. In this paper, we propose a Multi-scale Cascaded residual deformable vector field estimation framework based on Cross-attention in Motion-compensated 4D-CBCT reconstruction (MCCM), which combines Multi-Scale Cascaded residual registration network (MSC-Net), Cross-Attention Enhanced feature Fusion (CAEF) module and Structure-Enhanced Motion-Compensated (SEMC) module: 1) the MCCM employs a multi-scale cascaded residual network strategy, merging multi-receptive fields and multi-resolution feature maps for large-scale internal changes. 2) the CAEF is embedded into MSC-Net to facilitate effective communication and learning between features and promote the flow of information. 3) the SEMC is developed to reduce artifacts after intraoperative CBCT sparse-view reconstruction, restore global lung motion changes and local details, and enhance structural information through image fusion. The proposed method has been evaluated using simulated and clinical datasets and has shown promising results by comparative experiment. Our approach exhibits significant improvements across various evaluation metrics.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"717-731"},"PeriodicalIF":4.2,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144171032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PAH2T-Former: Paired-Attention Hybrid Hierarchical Transformer for Synergistically Enhanced FMT Reconstruction Quality and Efficiency PAH2T-Former：用于协同提高FMT重建质量和效率的配对注意混合分层变压器

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-04-09 DOI: 10.1109/TCI.2025.3559431

Peng Zhang;Xingyu Liu;Qianqian Xue;Yu Shang;Chen Liu;Ruhao Chen;Honglei Gao;Jiye Liang;Wenjian Wang;Guanglei Zhang

{"title":"PAH2T-Former: Paired-Attention Hybrid Hierarchical Transformer for Synergistically Enhanced FMT Reconstruction Quality and Efficiency","authors":"Peng Zhang;Xingyu Liu;Qianqian Xue;Yu Shang;Chen Liu;Ruhao Chen;Honglei Gao;Jiye Liang;Wenjian Wang;Guanglei Zhang","doi":"10.1109/TCI.2025.3559431","DOIUrl":"https://doi.org/10.1109/TCI.2025.3559431","url":null,"abstract":"Fluorescence molecular tomography (FMT) is a sensitive optical imaging technique that can achieve three-dimensional (3D) tomographic images at the molecular and cellular levels. However, reconstructing the internal 3D distribution of fluorescent targets from surface two-dimensional (2D) fluorescence projection data remains a challenging task. In recent years, deep learning-based FMT reconstruction has received considerable attention, demonstrating superior performance compared to conventional methods, particularly combined with Transformers. Unlike convolutional architectures that emphasize local context, Transformers leverage self-attention mechanisms to excel at capturing long-range dependencies, thereby enhancing FMT reconstruction accuracy. Nevertheless, the quadratic computational complexity of self-attention poses a bottleneck, particularly pertinent in 3D FMT reconstructions. This paper aims to propose a novel Transformer-based FMT reconstruction algorithm that not only delivers high-quality reconstruction accuracy but also maintains excellent performance in efficiency and inference speed. The key design involves introducing a novel Spatial-Channel Paired Attention Module (SC-PAM), which employs a pair of interdependent branches based on spatial and channel attention, thus effectively learn discriminative features in both spatial and channel domains, meanwhile exhibiting linear complexity relative to the input projection size. Furthermore, to facilitate data transmission between the spatial and channel branches, we share the weights of the query and key mapping functions, which provides a complementary paired attention without elevating complexity. Extensive evaluations through numerical simulations and in vivo experiments were performed to validate effectiveness of the proposed model. The results show that our PAH2T-Former method achieves the highest Dice while reducing model parameters and complexity.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"536-545"},"PeriodicalIF":4.2,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

HDD-Net: Haar Dual Domain Network for Ring Artifacts Correction HDD-Net：用于环伪影校正的Haar双域网络

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-04-01 DOI: 10.1109/TCI.2025.3551166

Xuelong Wu;Junsheng Wang;Qingjie Zhao

引用次数: 0

PACformer: A Multi-Stage Heterogeneous Convolutional-Vision Transformer for Sparse-View Photoacoustic Tomography Restoration PACformer：一种用于稀疏视场光声层析成像恢复的多级异构卷积视觉变压器

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-31 DOI: 10.1109/TCI.2025.3550716

Li He;Ruitao Chen;Xiangyu Liu;Xu Cao;Shouping Zhu;Yihan Wang

{"title":"PACformer: A Multi-Stage Heterogeneous Convolutional-Vision Transformer for Sparse-View Photoacoustic Tomography Restoration","authors":"Li He;Ruitao Chen;Xiangyu Liu;Xu Cao;Shouping Zhu;Yihan Wang","doi":"10.1109/TCI.2025.3550716","DOIUrl":"https://doi.org/10.1109/TCI.2025.3550716","url":null,"abstract":"Sparse sampling of photoacoustic (PA) signals is a crucial strategy for enhancing the feasibility of photoacoustic tomography (PAT) in clinical settings by reducing system complexity and costs. However, this approach often faces significant artifacts resulting from traditional reconstruction algorithms, underscoring the urgent need for effective solutions. To address the critical challenge of balancing computational efficiency with imaging quality, we introduce PACformer—a novel hybrid model that integrates convolutional neural networks (CNNs) with multi-head self-attentions (MSAs) to improve the reconstruction of sparse-view PAT images. While conventional CNNs excel at local feature extraction, they often struggle to capture long-range dependencies inherent in continuous structures and the diverse artifact patterns present in PAT images. PACformer tackles these limitations through a dual architecture that seamlessly combines MSAs with heterogeneous convolutional layers. Since feature representations differ in size and semantics at various stages of the deep model, PACformer employs specialized blocks for shallow and deep stages. Specifically, it utilizes efficient local convolutions and windowed MSAs for high-resolution feature maps, conditional convolutions (CondConv) integrated with MSAs for advanced feature representation in deeper stages, and Scale-Modulated Convolution combined with CondConv for the bottleneck stage. Experimental results on open-source datasets demonstrate PACformer's superior performance compared to traditional and state-of-the-art networks, validated through ablation studies and attention map visualizations. By effectively modeling both local and global artifacts, PACformer establishes itself as a robust solution for sparse-view PAT reconstruction.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"377-388"},"PeriodicalIF":4.2,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143761353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Axial Super-Resolution in Optical Coherence Tomography Images via Spectrum-Based Self-Supervised Training 基于光谱自监督训练的光学相干层析成像轴向超分辨率研究

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-31 DOI: 10.1109/TCI.2025.3555134

Zhengyang Xu;Yuting Gao;Xi Chen;Kan Lin;Linbo Liu;Yu-Cheng Chen

{"title":"Axial Super-Resolution in Optical Coherence Tomography Images via Spectrum-Based Self-Supervised Training","authors":"Zhengyang Xu;Yuting Gao;Xi Chen;Kan Lin;Linbo Liu;Yu-Cheng Chen","doi":"10.1109/TCI.2025.3555134","DOIUrl":"https://doi.org/10.1109/TCI.2025.3555134","url":null,"abstract":"High axial resolution in Optical Coherence Tomography (OCT) images is essential for accurately diagnosing skin conditions like psoriasis and keratoderma, where clear boundary delineation can reveal early disease markers. Existing deep learning super-resolution methods typically rely on intensity-based training, which only utilizes magnitude data from the OCT spectrum after Fourier transformation, limiting the reconstruction of fine boundary details. This study introduces a spectrum-based, self-supervised deep learning framework that leverages OCT spectral (fringe) data to improve axial resolution beyond system limits. By training the model directly on fringe data in a self-supervised manner, we achieve finer structural detail recovery. Evaluation metrics included Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM), and axial resolution estimation. Our framework yielded a 50% improvement in axial resolution, achieving 4.28 μm from 7.19 μm, along with PSNR gains of up to 3.37 dB and SSIM increases by 0.157, significantly enhancing boundary continuity and fine detail reconstruction. Our method surpasses intensity-based approaches in enhancing axial resolution and presents potential for iterative application to achieve even greater improvements. Significance: This framework advances OCT imaging, offering a promising, non-invasive tool for dermatological diagnostics.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"497-505"},"PeriodicalIF":4.2,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143845346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Batch-FPM: Random Batch-Update Multi-Parameter Physical Fourier Ptychography Neural Network 批处理- fpm：随机批处理更新多参数物理傅立叶平面神经网络

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-29 DOI: 10.1109/TCI.2025.3574887

Ruiqing Sun;Delong Yang;Yiyan Su;Qun Hao;Shaohui Zhang

{"title":"Batch-FPM: Random Batch-Update Multi-Parameter Physical Fourier Ptychography Neural Network","authors":"Ruiqing Sun;Delong Yang;Yiyan Su;Qun Hao;Shaohui Zhang","doi":"10.1109/TCI.2025.3574887","DOIUrl":"https://doi.org/10.1109/TCI.2025.3574887","url":null,"abstract":"Fourier Ptychographic Microscopy (FPM) is a computational imaging technique that enables high-resolution imaging over a large field of view. However, its application in the biomedical field has been limited due to the long image reconstruction time and poor noise robustness. In this paper, we propose a fast and robust FPM reconstruction method based on physical neural networks with batch updated optimization strategies, capable of achieving attractive results with low single-to-noise ratio and correcting multiple system parameters simultaneously. Our method leverages a random batch optimization approach, breaks away from the fixed sequential iterative order and gives greater attention to high-frequency information. The proposed method has better convergence performance even for low signal-to-noise ratio data sets, such as low exposure time dark-field images with an exposure time equal to one percent of the normal. As a result, it can greatly increase the image recording and result reconstruction speed without any additional hardware modifications. By utilizing advanced deep learning optimizers and perform parallel computational scheme, our method enhances GPU computational efficiency, significantly reducing reconstruction costs. Experimental results demonstrate that our method achieves near real-time digital refocusing of a 1024 × 1024 pixels region of interest on consumer-grade GPUs. This approach significantly improves temporal resolution (by reducing the exposure time of dark-field images), noise resistance, and reconstruction speed, and therefore can efficiently promote the practical application of FPM in clinical diagnostics, digital pathology, and biomedical research, etc. In addition, we believe our algorithm scheme can help researchers quickly validate and implement FPM-related ideas.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"864-871"},"PeriodicalIF":4.2,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144557937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quick Unsupervised Hyperspectral Dimensionality Reduction for Earth Observation: A Comparison 对地观测快速无监督高光谱降维：比较

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-28 DOI: 10.1109/TCI.2025.3555137

Daniela Lupu;Joseph L. Garrett;Tor Arne Johansen;Milica Orlandic;Ion Necoara

{"title":"Quick Unsupervised Hyperspectral Dimensionality Reduction for Earth Observation: A Comparison","authors":"Daniela Lupu;Joseph L. Garrett;Tor Arne Johansen;Milica Orlandic;Ion Necoara","doi":"10.1109/TCI.2025.3555137","DOIUrl":"https://doi.org/10.1109/TCI.2025.3555137","url":null,"abstract":"Dimensionality reduction can be applied to hyperspectral images so that the most useful data can be extracted and processed more quickly. This is critical in any situation in which data volume exceeds the capacity of the computational resources, particularly in the case of remote sensing platforms (e.g., drones, satellites), but also in the case of multi-year datasets. Moreover, the computational strategies of unsupervised dimensionality reduction often provide the basis for more complicated supervised techniques. In this work, eight unsupervised dimensionality reduction algorithms are tested on hyperspectral data from the HYPSO-1 earth observation satellite. Each particular algorithm is chosen to be representative of a broader collection of methods. Our extensive experiments probe the computational complexity, reconstruction accuracy, signal clarity, sensitivity to artifacts, and effects on target detection and classification of the different algorithms. No algorithm consistently outperformed the others across all tests, but some general trends regarding the characteristics of the algorithms did emerge. With half a million pixels, computational time requirements of the methods varied by 5 orders of magnitude, and the reconstruction error varied by about 3 orders of magnitude. A relationship between mutual information and artifact susceptibility was suggested by the tests. The relative performance of the algorithms differed significantly between the target detection and classification tests. Overall, these experiments both show the power of dimensionality reduction and give guidance regarding how to evaluate a technique prior to incorporating it into a processing pipeline.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"520-535"},"PeriodicalIF":4.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast Adaptive Plug-and-Play ADMM Framework for Short-Range 3-D SAR Imaging 用于近距离三维SAR成像的快速自适应即插即用ADMM框架

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-26 DOI: 10.1109/TCI.2025.3573587

The-Hien Pham;Ic-Pyo Hong

{"title":"Fast Adaptive Plug-and-Play ADMM Framework for Short-Range 3-D SAR Imaging","authors":"The-Hien Pham;Ic-Pyo Hong","doi":"10.1109/TCI.2025.3573587","DOIUrl":"https://doi.org/10.1109/TCI.2025.3573587","url":null,"abstract":"The advancement of short-range millimeter-wave (MMW) synthetic aperture radar (SAR) imaging has shown significant advancements in various fields, including security surveillance, non-destructive evaluation, and medical diagnostics. This paper presents a fast adaptive plug-and-play alternating direction method of multipliers (FA-PnP-ADMM) framework designed to improve the efficiency and accuracy of SAR image reconstruction. By addressing key challenges like image degradation caused by fast Fourier transform (FFT) operations and the computational burden of conventional ADMM methods, our framework significantly improves performance. Concretely, alongside a PnP strategy, the proposed FA-PnP-ADMM framework leverages the state-of-the-art single-frequency holographic (SFH) ADMM-based image-solving model and the adaptive parameter adjustment predicated on the relationship between relaxed ADMM and relaxed Douglas-Rachford splitting (DRS). This innovative integration significantly accelerates convergence and reduces computational overhead. Furthermore, the methodology incorporates proficient denoising deep learning (DL) architectures, encompassing convolutional neural network (CNN) and auto-encoder (AE), seamlessly embedded within the iterative process, resulting in a tailored PnP-DL-ADMM. This synergy not only enhances noise suppression and image fidelity but also adapts effectively to diverse scene complexities and noise levels. Unlike previous works that employ these techniques separately, our approach integrates adaptive optimization and DL-based denoisers into a unified framework optimized for short-range 3D SAR imaging. Experimental results demonstrate substantial improvements in both runtime and reconstruction quality, highlighting the practicality and impact of this methodology for real-world applications.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"764-778"},"PeriodicalIF":4.2,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144232128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Swap-Net: A Memory-Efficient 2.5D Network for Sparse-View 3D Cone Beam CT Reconstruction to ICF Applications Swap-Net：一种内存高效的2.5D网络，用于稀疏视图三维锥束CT重建到ICF应用

IF 4.2 2区计算机科学

IEEE Transactions on Computational Imaging Pub Date : 2025-03-23 DOI: 10.1109/TCI.2025.3572699

Xiaojian Xu;Marc L. Klasky;Michael T. McCann;Jason Hu;Jeffrey A. Fessler

{"title":"Swap-Net: A Memory-Efficient 2.5D Network for Sparse-View 3D Cone Beam CT Reconstruction to ICF Applications","authors":"Xiaojian Xu;Marc L. Klasky;Michael T. McCann;Jason Hu;Jeffrey A. Fessler","doi":"10.1109/TCI.2025.3572699","DOIUrl":"https://doi.org/10.1109/TCI.2025.3572699","url":null,"abstract":"Reconstructing 3D cone beam computed tomography (CBCT) images from a limited set of projections is an important inverse problem in many imaging applications from medicine to Inertial Confinement Fusion (ICF). The performance of traditional methods such as filtered back projection (FBP) and model-based regularization is sub-optimal when the number of available projections is limited. In the past decade, deep learning (DL) has gained great popularity for solving CT inverse problems. A typical DL-based method for CBCT image reconstruction is to learn an end-to-end mapping by training a 2D or 3D network. However, 2D networks fail to fully use global information. While 3D networks are desirable, they become impractical as image sizes increase because of the high memory cost. This paper proposes Swap-Net, a memory-efficient 2.5D network for sparse-view 3D CBCT image reconstruction. Swap-Net uses a sequence of novel axes-swapping operations to reconstruct 3D volumes in an end-to-end fashion without using full 3D convolutions. Simulation results on ICF show that Swap-Net consistently outperforms baseline methods both quantitatively and qualitatively in terms of reducing artifacts and preserving details of complex hydrodynamic simulations of relevance to the ICF community.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"872-887"},"PeriodicalIF":4.2,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144557768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0