{"title":"Multi-Scale Cascaded With Cross-Attention Network-Based Deformation Vector Field Estimation for Motion-Compensated 4D-CBCT Reconstruction","authors":"Peng Yuan;Fei Lyu;Zhiqiang Gao;Chunfeng Yang;Dianlin Hu;Jian Zhu;Zhan Wu;Tianling Lyu;Wei Zhao;Jianmin Dong;Yang Chen","doi":"10.1109/TCI.2025.3561660","DOIUrl":null,"url":null,"abstract":"Four-Dimensional Cone Beam Computed Tomography (4D-CBCT) imaging technology offers enhanced image quality and spatial resolution for intraoperative guidance, facilitating real-time tracking of tumor position changes during radiotherapy. However, this is still a task of great challenges due to insufficient projections at each respiratory phase after phase-sorting, and the image phases reconstructed directly from phase-sorted data are discrete and discontinuous. To generate high-quality 4D-CBCT deformation vector fields (DVFs), this paper leverages the preoperative static prior image to guide intraoperative dynamic sparse-view reconstruction images for reducing anatomical structure differences, ultimately achieving continuous and dynamic 4D-CBCT imaging. In this paper, we propose a Multi-scale Cascaded residual deformable vector field estimation framework based on Cross-attention in Motion-compensated 4D-CBCT reconstruction (MCCM), which combines Multi-Scale Cascaded residual registration network (MSC-Net), Cross-Attention Enhanced feature Fusion (CAEF) module and Structure-Enhanced Motion-Compensated (SEMC) module: 1) the MCCM employs a multi-scale cascaded residual network strategy, merging multi-receptive fields and multi-resolution feature maps for large-scale internal changes. 2) the CAEF is embedded into MSC-Net to facilitate effective communication and learning between features and promote the flow of information. 3) the SEMC is developed to reduce artifacts after intraoperative CBCT sparse-view reconstruction, restore global lung motion changes and local details, and enhance structural information through image fusion. The proposed method has been evaluated using simulated and clinical datasets and has shown promising results by comparative experiment. Our approach exhibits significant improvements across various evaluation metrics.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"717-731"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Imaging","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10966206/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Four-Dimensional Cone Beam Computed Tomography (4D-CBCT) imaging technology offers enhanced image quality and spatial resolution for intraoperative guidance, facilitating real-time tracking of tumor position changes during radiotherapy. However, this is still a task of great challenges due to insufficient projections at each respiratory phase after phase-sorting, and the image phases reconstructed directly from phase-sorted data are discrete and discontinuous. To generate high-quality 4D-CBCT deformation vector fields (DVFs), this paper leverages the preoperative static prior image to guide intraoperative dynamic sparse-view reconstruction images for reducing anatomical structure differences, ultimately achieving continuous and dynamic 4D-CBCT imaging. In this paper, we propose a Multi-scale Cascaded residual deformable vector field estimation framework based on Cross-attention in Motion-compensated 4D-CBCT reconstruction (MCCM), which combines Multi-Scale Cascaded residual registration network (MSC-Net), Cross-Attention Enhanced feature Fusion (CAEF) module and Structure-Enhanced Motion-Compensated (SEMC) module: 1) the MCCM employs a multi-scale cascaded residual network strategy, merging multi-receptive fields and multi-resolution feature maps for large-scale internal changes. 2) the CAEF is embedded into MSC-Net to facilitate effective communication and learning between features and promote the flow of information. 3) the SEMC is developed to reduce artifacts after intraoperative CBCT sparse-view reconstruction, restore global lung motion changes and local details, and enhance structural information through image fusion. The proposed method has been evaluated using simulated and clinical datasets and has shown promising results by comparative experiment. Our approach exhibits significant improvements across various evaluation metrics.
期刊介绍:
The IEEE Transactions on Computational Imaging will publish articles where computation plays an integral role in the image formation process. Papers will cover all areas of computational imaging ranging from fundamental theoretical methods to the latest innovative computational imaging system designs. Topics of interest will include advanced algorithms and mathematical techniques, model-based data inversion, methods for image and signal recovery from sparse and incomplete data, techniques for non-traditional sensing of image data, methods for dynamic information acquisition and extraction from imaging sensors, software and hardware for efficient computation in imaging systems, and highly novel imaging system design.