Transformer-based arterial spin labeling perfusion MRI denoising.

IF 2.9 3区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Visual Computer Pub Date : 2025-07-03 DOI:10.1007/s00371-025-04061-x

Muhammad Nadeem Cheema, Lei Zhang, Anam Nazir, Yiran Li, John A Detre, Ze Wang

{"title":"Transformer-based arterial spin labeling perfusion MRI denoising.","authors":"Muhammad Nadeem Cheema, Lei Zhang, Anam Nazir, Yiran Li, John A Detre, Ze Wang","doi":"10.1007/s00371-025-04061-x","DOIUrl":null,"url":null,"abstract":"Arterial Spin Labeling (ASL) perfusion MRI is the only non-invasive technique for quantifying regional cerebral blood flow (CBF) visualization, which is an important physiological variable. ASL MRI has a relatively low signal-to-noise-ratio (SNR), making it challenging to achieve high quality CBF images using limited data. Promising ASL CBF denoising results have been shown in recent convolutional neural network (CNN)-based methods. A common problem of these methods is the loss of output image texture and image intensity variabilities. To address this problem, we proposed a Hybrid U-Net and Swin Transformer (HUST) ASL CBF denoising method. Transformers explicitly encode spatial positions of input data and can learn features with long-range dependency. These features can substantially mitigate the image blurring issue and preserve individual data variability. We used the U-Net as the network backbone due to its demonstrated capability for capturing local and global features and replaced the original CNNs layers with transformers. Swin Transformer was used to reduce the number of parameters required by a regular transformer for image denoising. Reduction in parameters is achieved by hierarchical structure along with shifting window-based attention mechanism. The proposed method is trained and tested with 2D and 3D ASL CBF images, HUST substantially improved CBF image visualization and preserved image textures. The 2D data were acquired from 277 normal healthy subjects aged 23 to 47, 110 males, and 167 females were included. The 3D data (110 subjects) were pooled from a local database and were acquired using our background suppressed 3D stack of spirals fast spin echo pseudo-continuous ASL sequence 27-30. HUST makes it possible to substantially reduce the data acquisition time without compromising CBF quantification quality. HUST outperforms three state-of-the-art for both 2D and 3D ASL perfusion MRI data, achieving higher mean PSNR (45.15 for 3D, 33.67 for 2D) and SSIM (0.99 for 3D, 0.96 for 2D), indicating superior image quality and closer resemblance to the reference image.","PeriodicalId":49376,"journal":{"name":"Visual Computer","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12366763/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Computer","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00371-025-04061-x","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Arterial Spin Labeling (ASL) perfusion MRI is the only non-invasive technique for quantifying regional cerebral blood flow (CBF) visualization, which is an important physiological variable. ASL MRI has a relatively low signal-to-noise-ratio (SNR), making it challenging to achieve high quality CBF images using limited data. Promising ASL CBF denoising results have been shown in recent convolutional neural network (CNN)-based methods. A common problem of these methods is the loss of output image texture and image intensity variabilities. To address this problem, we proposed a Hybrid U-Net and Swin Transformer (HUST) ASL CBF denoising method. Transformers explicitly encode spatial positions of input data and can learn features with long-range dependency. These features can substantially mitigate the image blurring issue and preserve individual data variability. We used the U-Net as the network backbone due to its demonstrated capability for capturing local and global features and replaced the original CNNs layers with transformers. Swin Transformer was used to reduce the number of parameters required by a regular transformer for image denoising. Reduction in parameters is achieved by hierarchical structure along with shifting window-based attention mechanism. The proposed method is trained and tested with 2D and 3D ASL CBF images, HUST substantially improved CBF image visualization and preserved image textures. The 2D data were acquired from 277 normal healthy subjects aged 23 to 47, 110 males, and 167 females were included. The 3D data (110 subjects) were pooled from a local database and were acquired using our background suppressed 3D stack of spirals fast spin echo pseudo-continuous ASL sequence ^27-30. HUST makes it possible to substantially reduce the data acquisition time without compromising CBF quantification quality. HUST outperforms three state-of-the-art for both 2D and 3D ASL perfusion MRI data, achieving higher mean PSNR (45.15 for 3D, 33.67 for 2D) and SSIM (0.99 for 3D, 0.96 for 2D), indicating superior image quality and closer resemblance to the reference image.

查看原文本刊更多论文

基于变压器的动脉自旋标记灌注MRI去噪。

动脉自旋标记（ASL）灌注MRI是唯一一种量化区域脑血流量（CBF）可视化的无创技术，是一个重要的生理变量。ASL MRI具有相对较低的信噪比（SNR），这使得使用有限的数据获得高质量的CBF图像具有挑战性。近年来基于卷积神经网络（CNN）的方法已显示出良好的ASL脑电信号去噪效果。这些方法的一个共同问题是输出图像纹理和图像强度可变性的损失。为了解决这个问题，我们提出了一种混合U-Net和Swin变压器（HUST） ASL CBF去噪方法。变压器显式地编码输入数据的空间位置，并且可以学习具有远程依赖关系的特征。这些特征可以大大减轻图像模糊问题，并保持个人数据的可变性。我们使用U-Net作为网络骨干，因为它具有捕获本地和全局特征的能力，并用变压器取代了原来的cnn层。采用Swin变压器来减少常规变压器在图像去噪时所需的参数数量。通过分层结构和基于移动窗口的注意机制实现参数约简。通过对二维和三维ASL CBF图像的训练和测试，HUST大大提高了CBF图像的可视化效果，并保留了图像纹理。二维数据来自277名23 ~ 47岁的正常健康受试者，其中男性110人，女性167人。三维数据（110名受试者）来自本地数据库，使用我们的背景抑制螺旋快速自旋回波伪连续ASL序列27-30获得。HUST可以在不影响CBF量化质量的情况下大幅减少数据采集时间。HUST在2D和3D ASL灌注MRI数据方面都优于三种最先进的技术，实现了更高的平均PSNR （3D为45.15,2D为33.67）和SSIM (3D为0.99,2D为0.96)，表明了更高的图像质量和更接近参考图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Visual Computer 工程技术-计算机：软件工程

CiteScore

5.80

自引率

31.40%

发文量

373

审稿时长

4.3 months

期刊介绍： The Visual Computer publishes articles on all research fields of capturing, recognizing, modelling, analysing and generating shapes and images. It includes image understanding, machine learning for graphics and 3D fabrication. 3D Reconstruction Computer Animation Computational Fabrication Computational Geometry Computational Photography Computer Vision for Computer Graphics Data Compression for Graphics Geometric Modelling Geometric Processing HCI and Computer Graphics Human Modelling Image Analysis Image Based Rendering Image Processing Machine Learning for Graphics Medical Imaging Pattern Recognition Physically Based Modelling Illumination and Rendering Methods Robotics and Vision Saliency Methods Scientific Visualization Shape and Surface Modelling Shape Analysis and Image Retrieval Shape Matching Sketch-based Modelling Solid Modelling Stylized rendering Textures Virtual and Augmented Reality Visual Analytics Volume Rendering All papers are subject to thorough review and, if accepted, will be revised accordingly. Original contributions, describing advances in the theory in the above mentioned fields as well as practical results and original applications, are invited.