CT-Denoimer: efficient contextual transformer network for low-dose CT denoising.

IF 3.3 3区医学 Q2 ENGINEERING, BIOMEDICAL

Physics in medicine and biology Pub Date : 2025-06-06 DOI:10.1088/1361-6560/addea6

Yuanke Zhang, Fan Xu, Rui Zhang, Yanfei Guo, Hanxiang Wang, Bingbing Wei, Fei Ma, Jing Meng, Jianlei Liu, Hongbing Lu, Yang Chen

{"title":"CT-Denoimer: efficient contextual transformer network for low-dose CT denoising.","authors":"Yuanke Zhang, Fan Xu, Rui Zhang, Yanfei Guo, Hanxiang Wang, Bingbing Wei, Fei Ma, Jing Meng, Jianlei Liu, Hongbing Lu, Yang Chen","doi":"10.1088/1361-6560/addea6","DOIUrl":null,"url":null,"abstract":"Objective. Low-dose computed tomography (LDCT) effectively reduces radiation exposure to patients, but introduces severe noise artifacts that affect diagnostic accuracy. Recently, Transformer-based network architectures have been widely applied to LDCT image denoising, generally achieving superior results compared to traditional convolutional methods. However, these methods are often hindered by high computational costs and struggles in capturing complex local contextual features, which negatively impact denoising performanceApproach. In this work, we propose CT-Denoimer, an efficient CT Denoising Transformer network that captures both global correlations and intricate, spatially varying local contextual details in CT images, enabling the generation of high-quality images. The core of our framework is a Transformer module that consists of two key components: the multi-Dconv head transposed attention (MDTA) and the mixed contextual feed-forward network (MCFN). The MDTA block captures global correlations in the image with linear computational complexity, while the MCFN block manages multi-scale local contextual information, both static and dynamic, through a series of Enhanced Contextual Transformer modules. In addition, we incorporate operation-wise attention layers to enable collaborative refinement in the proposed CT-Denoimer, enhancing its ability to more effectively handle complex and varying noise patterns in LDCT imagesMain results. Extensive experimental validation on both the AAPM-Mayo public dataset and a real-world clinical dataset demonstrated the state-of-the-art performance of the proposed CT-Denoimer. It achieved a peak signal-to-noise ratio of 33.681 dB, a structural similarity index measure of 0.921, an information fidelity criterion of 2.857 and a visual information fidelity of 0.349. Subjective assessment by radiologists gave an average score of 4.39, confirming its clinical applicability and clear advantages over existing methodsSignificance. This study presents an innovative CT denoising Transformer network that sets a new benchmark in LDCT image denoising, excelling in both noise reduction and fine structure preservation.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/addea6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Objective. Low-dose computed tomography (LDCT) effectively reduces radiation exposure to patients, but introduces severe noise artifacts that affect diagnostic accuracy. Recently, Transformer-based network architectures have been widely applied to LDCT image denoising, generally achieving superior results compared to traditional convolutional methods. However, these methods are often hindered by high computational costs and struggles in capturing complex local contextual features, which negatively impact denoising performanceApproach. In this work, we propose CT-Denoimer, an efficient CT Denoising Transformer network that captures both global correlations and intricate, spatially varying local contextual details in CT images, enabling the generation of high-quality images. The core of our framework is a Transformer module that consists of two key components: the multi-Dconv head transposed attention (MDTA) and the mixed contextual feed-forward network (MCFN). The MDTA block captures global correlations in the image with linear computational complexity, while the MCFN block manages multi-scale local contextual information, both static and dynamic, through a series of Enhanced Contextual Transformer modules. In addition, we incorporate operation-wise attention layers to enable collaborative refinement in the proposed CT-Denoimer, enhancing its ability to more effectively handle complex and varying noise patterns in LDCT imagesMain results. Extensive experimental validation on both the AAPM-Mayo public dataset and a real-world clinical dataset demonstrated the state-of-the-art performance of the proposed CT-Denoimer. It achieved a peak signal-to-noise ratio of 33.681 dB, a structural similarity index measure of 0.921, an information fidelity criterion of 2.857 and a visual information fidelity of 0.349. Subjective assessment by radiologists gave an average score of 4.39, confirming its clinical applicability and clear advantages over existing methodsSignificance. This study presents an innovative CT denoising Transformer network that sets a new benchmark in LDCT image denoising, excelling in both noise reduction and fine structure preservation.

查看原文本刊更多论文

CT去噪：用于低剂量CT去噪的高效情境变压器网络。

目的：低剂量计算机断层扫描（LDCT）有效地减少了患者的辐射暴露，但引入了严重的噪声伪影，影响了诊断的准确性。近年来，基于变压器的网络结构在LDCT图像去噪中得到了广泛的应用，总体效果优于传统的卷积方法。然而，这些方法往往受到高计算成本和捕获复杂的局部上下文特征的困难的阻碍，这对去噪性能产生了负面影响。方法：在这项工作中，我们提出了CT- denoimer，这是一种高效的CT去噪变压器网络，可以捕获CT图像中的全局相关性和复杂的、空间变化的局部上下文细节，从而生成高质量的图像。我们的框架的核心是一个Transformer模块，它由两个关键组件组成：Multi-Dconv头转位注意（MDTA）和混合上下文前馈网络（MCFN）。MDTA块通过线性计算复杂性捕获图像中的全局相关性，而MCFN块通过一系列增强型上下文转换器（eCoT）模块管理静态和动态的多尺度局部上下文信息。此外，我们结合了操作智能注意层（OWALs），使所提出的ct -去噪体能够进行协作改进，增强其更有效地处理LDCT图像中复杂和变化的噪声模式的能力。主要结果：在AAPM-Mayo公共数据集和现实世界的临床数据集上进行了广泛的实验验证，证明了所提出的CT-Denoimer的最先进性能。峰值信噪比（PSNR）为33.681 dB，结构相似度指数（SSIM）为0.921，信息保真度标准（IFC）为2.857，视觉信息保真度（VIF）为0.349。放射科医师的主观评价平均得分为4.39分，证实了其临床适用性，与现有方法相比优势明显。意义：本研究提出了一种创新的CT去噪变压器网络，在LDCT图像去噪方面具有良好的降噪和精细结构保存性能，为LDCT图像去噪树立了新的标杆。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry