{"title":"Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models","authors":"Sho Ozaki, Shizuo Kaji, Toshikazu Imae, Kanabu Nawa, Hideomi Yamashita, Keiichi Nakagawa","doi":"arxiv-2408.03156","DOIUrl":null,"url":null,"abstract":"Image generative AI has garnered significant attention in recent years. In\nparticular, the diffusion model, a core component of recent generative AI,\nproduces high-quality images with rich diversity. In this study, we propose a\nnovel CT reconstruction method by combining the denoising diffusion\nprobabilistic model with iterative CT reconstruction. In sharp contrast to\nprevious studies, we optimize the fidelity loss of CT reconstruction with\nrespect to the latent variable of the diffusion model, instead of the image and\nmodel parameters. To suppress anatomical structure changes produced by the\ndiffusion model, we shallow the diffusion and reverse processes, and fix a set\nof added noises in the reverse process to make it deterministic during\ninference. We demonstrate the effectiveness of the proposed method through\nsparse view CT reconstruction of 1/10 view projection data. Despite the\nsimplicity of the implementation, the proposed method shows the capability of\nreconstructing high-quality images while preserving the patient's anatomical\nstructure, and outperforms existing methods including iterative reconstruction,\niterative reconstruction with total variation, and the diffusion model alone in\nterms of quantitative indices such as SSIM and PSNR. We also explore further\nsparse view CT using 1/20 view projection data with the same trained diffusion\nmodel. As the number of iterations increases, image quality improvement\ncomparable to that of 1/10 sparse view CT reconstruction is achieved. In\nprinciple, the proposed method can be widely applied not only to CT but also to\nother imaging modalities such as MRI, PET, and SPECT.","PeriodicalId":501378,"journal":{"name":"arXiv - PHYS - Medical Physics","volume":"91 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Medical Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.03156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Image generative AI has garnered significant attention in recent years. In
particular, the diffusion model, a core component of recent generative AI,
produces high-quality images with rich diversity. In this study, we propose a
novel CT reconstruction method by combining the denoising diffusion
probabilistic model with iterative CT reconstruction. In sharp contrast to
previous studies, we optimize the fidelity loss of CT reconstruction with
respect to the latent variable of the diffusion model, instead of the image and
model parameters. To suppress anatomical structure changes produced by the
diffusion model, we shallow the diffusion and reverse processes, and fix a set
of added noises in the reverse process to make it deterministic during
inference. We demonstrate the effectiveness of the proposed method through
sparse view CT reconstruction of 1/10 view projection data. Despite the
simplicity of the implementation, the proposed method shows the capability of
reconstructing high-quality images while preserving the patient's anatomical
structure, and outperforms existing methods including iterative reconstruction,
iterative reconstruction with total variation, and the diffusion model alone in
terms of quantitative indices such as SSIM and PSNR. We also explore further
sparse view CT using 1/20 view projection data with the same trained diffusion
model. As the number of iterations increases, image quality improvement
comparable to that of 1/10 sparse view CT reconstruction is achieved. In
principle, the proposed method can be widely applied not only to CT but also to
other imaging modalities such as MRI, PET, and SPECT.