Res-MoCoDiff: Residual-guided diffusion models for motion artifact correction in brain MRI.

ArXiv Pub Date : 2025-09-04

Mojtaba Safari, Shansong Wang, Qiang Li, Zach Eidex, Richard L J Qiu, Chih-Wei Chang, Hui Mao, Xiaofeng Yang

{"title":"Res-MoCoDiff: Residual-guided diffusion models for motion artifact correction in brain MRI.","authors":"Mojtaba Safari, Shansong Wang, Qiang Li, Zach Eidex, Richard L J Qiu, Chih-Wei Chang, Hui Mao, Xiaofeng Yang","doi":"","DOIUrl":null,"url":null,"abstract":"Objective: Motion artifacts in brain MRI, mainly from rigid head motion, degrade image quality and hinder downstream applications. Conventional methods to mitigate these artifacts, including repeated acquisitions or motion tracking, impose workflow burdens. This study introduces Res-MoCoDiff, an efficient denoising diffusion probabilistic model specifically designed for MRI motion artifact correction.Approach: Res-MoCoDiff exploits a novel residual error shifting mechanism during the forward diffusion process to incorporate information from motion-corrupted images. This mechanism allows the model to simulate the evolution of noise with a probability distribution closely matching that of the corrupted data, enabling a reverse diffusion process that requires only four steps. The model employs a U-net backbone, with attention layers replaced by Swin Transformer blocks, to enhance robustness across resolutions. Furthermore, the training process integrates a combined l1+l2 loss function, which promotes image sharpness and reduces pixel-level errors. Res-MoCoDiff was evaluated on both an in-silico dataset generated using a realistic motion simulation framework and an in-vivo MR-ART dataset. Comparative analyses were conducted against established methods, including CycleGAN, Pix2pix, and a diffusion model with a vision transformer backbone, using quantitative metrics such as PSNR, SSIM, and NMSE.Main results: The proposed method demonstrated superior performance in removing motion artifacts across minor, moderate, and heavy distortion levels. Res-MoCoDiff consistently achieved the highest SSIM and the lowest NMSE values, with a PSNR of up to 41.91+-2.94 dB for minor distortions. Notably, the average sampling time was reduced to 0.37 seconds per batch of two image slices, compared with 101.74 seconds for conventional approaches.","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083705/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ArXiv","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: Motion artifacts in brain MRI, mainly from rigid head motion, degrade image quality and hinder downstream applications. Conventional methods to mitigate these artifacts, including repeated acquisitions or motion tracking, impose workflow burdens. This study introduces Res-MoCoDiff, an efficient denoising diffusion probabilistic model specifically designed for MRI motion artifact correction.

Approach: Res-MoCoDiff exploits a novel residual error shifting mechanism during the forward diffusion process to incorporate information from motion-corrupted images. This mechanism allows the model to simulate the evolution of noise with a probability distribution closely matching that of the corrupted data, enabling a reverse diffusion process that requires only four steps. The model employs a U-net backbone, with attention layers replaced by Swin Transformer blocks, to enhance robustness across resolutions. Furthermore, the training process integrates a combined l1+l2 loss function, which promotes image sharpness and reduces pixel-level errors. Res-MoCoDiff was evaluated on both an in-silico dataset generated using a realistic motion simulation framework and an in-vivo MR-ART dataset. Comparative analyses were conducted against established methods, including CycleGAN, Pix2pix, and a diffusion model with a vision transformer backbone, using quantitative metrics such as PSNR, SSIM, and NMSE.

Main results: The proposed method demonstrated superior performance in removing motion artifacts across minor, moderate, and heavy distortion levels. Res-MoCoDiff consistently achieved the highest SSIM and the lowest NMSE values, with a PSNR of up to 41.91+-2.94 dB for minor distortions. Notably, the average sampling time was reduced to 0.37 seconds per batch of two image slices, compared with 101.74 seconds for conventional approaches.

本刊更多论文

基于有效残差引导去噪扩散概率模型的MRI运动校正。

目的：磁共振成像（MRI）中的运动伪影显著降低图像质量并损害定量分析。传统的缓解策略，如重复采集或运动跟踪，成本高昂且工作流程密集。本文介绍了一种针对MRI运动伪影校正的高效去噪扩散概率模型Res-MoCoDiff。方法：Res-MoCoDiff在正向扩散过程中引入了一种新的残差移位机制，将噪声分布与运动损坏的数据对齐，从而实现有效的四步反向扩散。通过swing - transformer增强的U-net骨干网阻塞了传统的注意力层，提高了跨分辨率的适应性。训练采用l1+l2损失的组合，提高图像清晰度并减少像素级错误。Res-MoCoDiff在使用逼真运动模拟框架生成的合成数据集和体内数据集上进行了评估。采用峰值信噪比（PSNR）、结构相似指数（SSIM）和归一化均方误差（NMSE）等定量指标，与CycleGAN、Pix2pix和MT-DDPM等既定方法进行比较分析。结果：所提出的方法在消除所有运动严重程度的运动伪影方面表现出优越的性能。Res-MoCoDiff持续获得最高的SSIM和最低的NMSE值，对于轻微失真，PSNR高达41.91+-2.94 dB。值得注意的是，每批两个图像切片的平均采样时间减少到0.37秒，而传统方法的采样时间为101.74秒。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ArXiv

自引率

0.00%

发文量