Generation model meets Swin transformer for unsupervised low-dose CT reconstruction

Yu Li, Xueqin Sun, SuKai Wang, Yingwei Qin, Jinxiao Pan, Ping Chen
{"title":"Generation model meets Swin transformer for unsupervised low-dose CT reconstruction","authors":"Yu Li, Xueqin Sun, SuKai Wang, Yingwei Qin, Jinxiao Pan, Ping Chen","doi":"10.1088/2632-2153/ad370e","DOIUrl":null,"url":null,"abstract":"\n Computed Tomography (CT) has evolved into an indispensable tool for clinical diagnosis. Reducing radiation dose crucially minimizes adverse effects but may introduce noise and artifacts in reconstructed images, affecting diagnostic processes for physicians. Scholars have tackled deep learning training instability by exploring diffusion models. Given the scarcity of clinical data, we propose the Unsupervised Image Domain Score Generation model (UISG) for low-dose CT reconstruction. During training, normal-dose CT images are utilized as network inputs to train a score-based generative model that captures the prior distribution of CT images. In the iterative reconstruction, the initial CT image is obtained using a filtered back-projection algorithm. Subsequently, diffusion-based prior, high-frequency convolutional sparse coding prior, and data-consistency steps are employed to obtain the high-quality reconstructed image. Given the global characteristics of noise, the score network of the diffusion model utilizes a swin transformer structure to enhance the model's ability to capture long-range dependencies. Furthermore, convolutional sparse coding is applied exclusively to the high-frequency components of the image, to prevent over-smoothing or the loss of crucial anatomical details during the denoising process. Quantitative and qualitative results indicate that UISG outperforms competing methods in terms of denoising and generalization performance.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":" 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning: Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ad370e","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Computed Tomography (CT) has evolved into an indispensable tool for clinical diagnosis. Reducing radiation dose crucially minimizes adverse effects but may introduce noise and artifacts in reconstructed images, affecting diagnostic processes for physicians. Scholars have tackled deep learning training instability by exploring diffusion models. Given the scarcity of clinical data, we propose the Unsupervised Image Domain Score Generation model (UISG) for low-dose CT reconstruction. During training, normal-dose CT images are utilized as network inputs to train a score-based generative model that captures the prior distribution of CT images. In the iterative reconstruction, the initial CT image is obtained using a filtered back-projection algorithm. Subsequently, diffusion-based prior, high-frequency convolutional sparse coding prior, and data-consistency steps are employed to obtain the high-quality reconstructed image. Given the global characteristics of noise, the score network of the diffusion model utilizes a swin transformer structure to enhance the model's ability to capture long-range dependencies. Furthermore, convolutional sparse coding is applied exclusively to the high-frequency components of the image, to prevent over-smoothing or the loss of crucial anatomical details during the denoising process. Quantitative and qualitative results indicate that UISG outperforms competing methods in terms of denoising and generalization performance.
符合斯温变换器的生成模型,用于无监督低剂量 CT 重建
计算机断层扫描(CT)已发展成为临床诊断不可或缺的工具。降低辐射剂量可以最大限度地减少不良影响,但可能会在重建图像中引入噪声和伪影,影响医生的诊断过程。学者们通过探索扩散模型解决了深度学习训练不稳定的问题。鉴于临床数据的匮乏,我们提出了用于低剂量 CT 重建的无监督图像域分数生成模型(UISG)。在训练过程中,正常剂量的 CT 图像被用作网络输入,以训练基于分数的生成模型,该模型能捕捉 CT 图像的先验分布。在迭代重建过程中,使用滤波反投影算法获得初始 CT 图像。随后,采用基于扩散的先验、高频卷积稀疏编码先验和数据一致性步骤来获得高质量的重建图像。考虑到噪声的全局特性,扩散模型的得分网络采用了swin变换器结构,以增强模型捕捉长程依赖性的能力。此外,卷积稀疏编码只应用于图像的高频成分,以防止在去噪过程中过度平滑或丢失关键的解剖细节。定量和定性结果表明,就去噪和泛化性能而言,UISG 优于其他竞争方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信