{"title":"Semi-Supervised MVCT Enhancement Using Diffusion Model Refined With KVCT Priors","authors":"Mengxun Zheng;Long Tang;Peiwen Liang;Shuang Jin;Xiaotong Xu;Zhe Su;Hua Zhang","doi":"10.1109/TRPMS.2025.3529582","DOIUrl":null,"url":null,"abstract":"Megavoltage computed tomography (MVCT) on the tomotherapy system has been widely used as a tomographic imaging modality for image-guided radiotherapy. However, the quality of MVCT images is often compromised by poor tissue contrast and significant noise. Conventional networks designed to enhance CT quality typically require the clean ground-truth images, which are not feasible for MVCT. In this study, we introduce a semi-supervised framework named Semi-Diff, which leverages the denoising diffusion probabilistic model and the prior information sourced from kilovoltage computed tomography (KVCT) to address challenges in MVCT enhancement. Specifically, employing a discriminative prior learning method, we first learn a mapping function to estimate MVCT noise and perform state matching. With this state matching dictionary, we then represent the MVCT image as a sample from an intermediate posterior distribution within the diffusion Markov chain, which enables the reverse conditional sampling process of the diffusion model to start directly from the noisy MVCT images. To fully explore the prior information from the plan KVCT images of the same patients, we introduce a novel diffusion base network called RefNet, whose dynamic feature aggregation module can extract and align the relevant features from reference KVCT image to enhance image restoration performance. Quantitative evaluations using simulated digital phantom data show that the proposed Semi-Diff model achieves the average FSIM score of 0.954, PSNR score of 33.22 dB, and RMSE value of 0.023, demonstrating improvements of approximately 2.16% in FSIM, 0.59% in PSNR, and a reduction of 3.58% in RMSE compared to the best-performing baseline method. Results from physical phantom and patient data further validate the model’s superior performance in noise suppression and structural preservation.","PeriodicalId":46807,"journal":{"name":"IEEE Transactions on Radiation and Plasma Medical Sciences","volume":"9 5","pages":"667-679"},"PeriodicalIF":4.6000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Radiation and Plasma Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10840351/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Megavoltage computed tomography (MVCT) on the tomotherapy system has been widely used as a tomographic imaging modality for image-guided radiotherapy. However, the quality of MVCT images is often compromised by poor tissue contrast and significant noise. Conventional networks designed to enhance CT quality typically require the clean ground-truth images, which are not feasible for MVCT. In this study, we introduce a semi-supervised framework named Semi-Diff, which leverages the denoising diffusion probabilistic model and the prior information sourced from kilovoltage computed tomography (KVCT) to address challenges in MVCT enhancement. Specifically, employing a discriminative prior learning method, we first learn a mapping function to estimate MVCT noise and perform state matching. With this state matching dictionary, we then represent the MVCT image as a sample from an intermediate posterior distribution within the diffusion Markov chain, which enables the reverse conditional sampling process of the diffusion model to start directly from the noisy MVCT images. To fully explore the prior information from the plan KVCT images of the same patients, we introduce a novel diffusion base network called RefNet, whose dynamic feature aggregation module can extract and align the relevant features from reference KVCT image to enhance image restoration performance. Quantitative evaluations using simulated digital phantom data show that the proposed Semi-Diff model achieves the average FSIM score of 0.954, PSNR score of 33.22 dB, and RMSE value of 0.023, demonstrating improvements of approximately 2.16% in FSIM, 0.59% in PSNR, and a reduction of 3.58% in RMSE compared to the best-performing baseline method. Results from physical phantom and patient data further validate the model’s superior performance in noise suppression and structural preservation.