{"title":"PAH2T-Former:用于协同提高FMT重建质量和效率的配对注意混合分层变压器","authors":"Peng Zhang;Xingyu Liu;Qianqian Xue;Yu Shang;Chen Liu;Ruhao Chen;Honglei Gao;Jiye Liang;Wenjian Wang;Guanglei Zhang","doi":"10.1109/TCI.2025.3559431","DOIUrl":null,"url":null,"abstract":"Fluorescence molecular tomography (FMT) is a sensitive optical imaging technique that can achieve three-dimensional (3D) tomographic images at the molecular and cellular levels. However, reconstructing the internal 3D distribution of fluorescent targets from surface two-dimensional (2D) fluorescence projection data remains a challenging task. In recent years, deep learning-based FMT reconstruction has received considerable attention, demonstrating superior performance compared to conventional methods, particularly combined with Transformers. Unlike convolutional architectures that emphasize local context, Transformers leverage self-attention mechanisms to excel at capturing long-range dependencies, thereby enhancing FMT reconstruction accuracy. Nevertheless, the quadratic computational complexity of self-attention poses a bottleneck, particularly pertinent in 3D FMT reconstructions. This paper aims to propose a novel Transformer-based FMT reconstruction algorithm that not only delivers high-quality reconstruction accuracy but also maintains excellent performance in efficiency and inference speed. The key design involves introducing a novel Spatial-Channel Paired Attention Module (SC-PAM), which employs a pair of interdependent branches based on spatial and channel attention, thus effectively learn discriminative features in both spatial and channel domains, meanwhile exhibiting linear complexity relative to the input projection size. Furthermore, to facilitate data transmission between the spatial and channel branches, we share the weights of the query and key mapping functions, which provides a complementary paired attention without elevating complexity. Extensive evaluations through numerical simulations and in vivo experiments were performed to validate effectiveness of the proposed model. The results show that our PAH2T-Former method achieves the highest Dice while reducing model parameters and complexity.","PeriodicalId":56022,"journal":{"name":"IEEE Transactions on Computational Imaging","volume":"11 ","pages":"536-545"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PAH2T-Former: Paired-Attention Hybrid Hierarchical Transformer for Synergistically Enhanced FMT Reconstruction Quality and Efficiency\",\"authors\":\"Peng Zhang;Xingyu Liu;Qianqian Xue;Yu Shang;Chen Liu;Ruhao Chen;Honglei Gao;Jiye Liang;Wenjian Wang;Guanglei Zhang\",\"doi\":\"10.1109/TCI.2025.3559431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fluorescence molecular tomography (FMT) is a sensitive optical imaging technique that can achieve three-dimensional (3D) tomographic images at the molecular and cellular levels. However, reconstructing the internal 3D distribution of fluorescent targets from surface two-dimensional (2D) fluorescence projection data remains a challenging task. In recent years, deep learning-based FMT reconstruction has received considerable attention, demonstrating superior performance compared to conventional methods, particularly combined with Transformers. Unlike convolutional architectures that emphasize local context, Transformers leverage self-attention mechanisms to excel at capturing long-range dependencies, thereby enhancing FMT reconstruction accuracy. Nevertheless, the quadratic computational complexity of self-attention poses a bottleneck, particularly pertinent in 3D FMT reconstructions. This paper aims to propose a novel Transformer-based FMT reconstruction algorithm that not only delivers high-quality reconstruction accuracy but also maintains excellent performance in efficiency and inference speed. The key design involves introducing a novel Spatial-Channel Paired Attention Module (SC-PAM), which employs a pair of interdependent branches based on spatial and channel attention, thus effectively learn discriminative features in both spatial and channel domains, meanwhile exhibiting linear complexity relative to the input projection size. Furthermore, to facilitate data transmission between the spatial and channel branches, we share the weights of the query and key mapping functions, which provides a complementary paired attention without elevating complexity. Extensive evaluations through numerical simulations and in vivo experiments were performed to validate effectiveness of the proposed model. The results show that our PAH2T-Former method achieves the highest Dice while reducing model parameters and complexity.\",\"PeriodicalId\":56022,\"journal\":{\"name\":\"IEEE Transactions on Computational Imaging\",\"volume\":\"11 \",\"pages\":\"536-545\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Computational Imaging\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10960347/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Imaging","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10960347/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
PAH2T-Former: Paired-Attention Hybrid Hierarchical Transformer for Synergistically Enhanced FMT Reconstruction Quality and Efficiency
Fluorescence molecular tomography (FMT) is a sensitive optical imaging technique that can achieve three-dimensional (3D) tomographic images at the molecular and cellular levels. However, reconstructing the internal 3D distribution of fluorescent targets from surface two-dimensional (2D) fluorescence projection data remains a challenging task. In recent years, deep learning-based FMT reconstruction has received considerable attention, demonstrating superior performance compared to conventional methods, particularly combined with Transformers. Unlike convolutional architectures that emphasize local context, Transformers leverage self-attention mechanisms to excel at capturing long-range dependencies, thereby enhancing FMT reconstruction accuracy. Nevertheless, the quadratic computational complexity of self-attention poses a bottleneck, particularly pertinent in 3D FMT reconstructions. This paper aims to propose a novel Transformer-based FMT reconstruction algorithm that not only delivers high-quality reconstruction accuracy but also maintains excellent performance in efficiency and inference speed. The key design involves introducing a novel Spatial-Channel Paired Attention Module (SC-PAM), which employs a pair of interdependent branches based on spatial and channel attention, thus effectively learn discriminative features in both spatial and channel domains, meanwhile exhibiting linear complexity relative to the input projection size. Furthermore, to facilitate data transmission between the spatial and channel branches, we share the weights of the query and key mapping functions, which provides a complementary paired attention without elevating complexity. Extensive evaluations through numerical simulations and in vivo experiments were performed to validate effectiveness of the proposed model. The results show that our PAH2T-Former method achieves the highest Dice while reducing model parameters and complexity.
期刊介绍:
The IEEE Transactions on Computational Imaging will publish articles where computation plays an integral role in the image formation process. Papers will cover all areas of computational imaging ranging from fundamental theoretical methods to the latest innovative computational imaging system designs. Topics of interest will include advanced algorithms and mathematical techniques, model-based data inversion, methods for image and signal recovery from sparse and incomplete data, techniques for non-traditional sensing of image data, methods for dynamic information acquisition and extraction from imaging sensors, software and hardware for efficient computation in imaging systems, and highly novel imaging system design.