利用KVCT先验改进扩散模型的半监督MVCT增强

IF 3.5 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

IEEE Transactions on Radiation and Plasma Medical Sciences Pub Date : 2025-01-15 DOI:10.1109/TRPMS.2025.3529582

Mengxun Zheng;Long Tang;Peiwen Liang;Shuang Jin;Xiaotong Xu;Zhe Su;Hua Zhang

{"title":"利用KVCT先验改进扩散模型的半监督MVCT增强","authors":"Mengxun Zheng;Long Tang;Peiwen Liang;Shuang Jin;Xiaotong Xu;Zhe Su;Hua Zhang","doi":"10.1109/TRPMS.2025.3529582","DOIUrl":null,"url":null,"abstract":"Megavoltage computed tomography (MVCT) on the tomotherapy system has been widely used as a tomographic imaging modality for image-guided radiotherapy. However, the quality of MVCT images is often compromised by poor tissue contrast and significant noise. Conventional networks designed to enhance CT quality typically require the clean ground-truth images, which are not feasible for MVCT. In this study, we introduce a semi-supervised framework named Semi-Diff, which leverages the denoising diffusion probabilistic model and the prior information sourced from kilovoltage computed tomography (KVCT) to address challenges in MVCT enhancement. Specifically, employing a discriminative prior learning method, we first learn a mapping function to estimate MVCT noise and perform state matching. With this state matching dictionary, we then represent the MVCT image as a sample from an intermediate posterior distribution within the diffusion Markov chain, which enables the reverse conditional sampling process of the diffusion model to start directly from the noisy MVCT images. To fully explore the prior information from the plan KVCT images of the same patients, we introduce a novel diffusion base network called RefNet, whose dynamic feature aggregation module can extract and align the relevant features from reference KVCT image to enhance image restoration performance. Quantitative evaluations using simulated digital phantom data show that the proposed Semi-Diff model achieves the average FSIM score of 0.954, PSNR score of 33.22 dB, and RMSE value of 0.023, demonstrating improvements of approximately 2.16% in FSIM, 0.59% in PSNR, and a reduction of 3.58% in RMSE compared to the best-performing baseline method. Results from physical phantom and patient data further validate the model’s superior performance in noise suppression and structural preservation.","PeriodicalId":46807,"journal":{"name":"IEEE Transactions on Radiation and Plasma Medical Sciences","volume":"9 5","pages":"667-679"},"PeriodicalIF":3.5000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semi-Supervised MVCT Enhancement Using Diffusion Model Refined With KVCT Priors\",\"authors\":\"Mengxun Zheng;Long Tang;Peiwen Liang;Shuang Jin;Xiaotong Xu;Zhe Su;Hua Zhang\",\"doi\":\"10.1109/TRPMS.2025.3529582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Megavoltage computed tomography (MVCT) on the tomotherapy system has been widely used as a tomographic imaging modality for image-guided radiotherapy. However, the quality of MVCT images is often compromised by poor tissue contrast and significant noise. Conventional networks designed to enhance CT quality typically require the clean ground-truth images, which are not feasible for MVCT. In this study, we introduce a semi-supervised framework named Semi-Diff, which leverages the denoising diffusion probabilistic model and the prior information sourced from kilovoltage computed tomography (KVCT) to address challenges in MVCT enhancement. Specifically, employing a discriminative prior learning method, we first learn a mapping function to estimate MVCT noise and perform state matching. With this state matching dictionary, we then represent the MVCT image as a sample from an intermediate posterior distribution within the diffusion Markov chain, which enables the reverse conditional sampling process of the diffusion model to start directly from the noisy MVCT images. To fully explore the prior information from the plan KVCT images of the same patients, we introduce a novel diffusion base network called RefNet, whose dynamic feature aggregation module can extract and align the relevant features from reference KVCT image to enhance image restoration performance. Quantitative evaluations using simulated digital phantom data show that the proposed Semi-Diff model achieves the average FSIM score of 0.954, PSNR score of 33.22 dB, and RMSE value of 0.023, demonstrating improvements of approximately 2.16% in FSIM, 0.59% in PSNR, and a reduction of 3.58% in RMSE compared to the best-performing baseline method. Results from physical phantom and patient data further validate the model’s superior performance in noise suppression and structural preservation.\",\"PeriodicalId\":46807,\"journal\":{\"name\":\"IEEE Transactions on Radiation and Plasma Medical Sciences\",\"volume\":\"9 5\",\"pages\":\"667-679\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Radiation and Plasma Medical Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10840351/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Radiation and Plasma Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10840351/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

巨压计算机断层扫描（MVCT）在断层治疗系统上作为一种断层成像方式已被广泛应用于图像引导放射治疗。然而，MVCT图像的质量往往受到组织对比度差和显著噪声的影响。用于提高CT质量的传统网络通常需要干净的真地图像，这对于MVCT来说是不可行的。在这项研究中，我们引入了一个名为Semi-Diff的半监督框架，它利用去噪扩散概率模型和来自千伏计算机断层扫描（KVCT）的先验信息来解决MVCT增强中的挑战。具体而言，我们采用判别先验学习方法，首先学习映射函数来估计MVCT噪声并进行状态匹配。利用该状态匹配字典，我们将MVCT图像表示为扩散马尔可夫链内中间后验分布的样本，这使得扩散模型的反向条件采样过程可以直接从带噪声的MVCT图像开始。为了充分挖掘相同患者的计划KVCT图像中的先验信息，我们引入了一种新的扩散基网络RefNet，其动态特征聚合模块可以从参考KVCT图像中提取和对齐相关特征，以提高图像恢复性能。利用模拟数字幻影数据进行的定量评估表明，所提出的半差分模型的FSIM平均得分为0.954，PSNR得分为33.22 dB， RMSE值为0.023，与最佳基准方法相比，FSIM提高了约2.16%，PSNR提高了0.59%，RMSE降低了3.58%。物理模型和患者数据的结果进一步验证了该模型在噪声抑制和结构保存方面的优越性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Semi-Supervised MVCT Enhancement Using Diffusion Model Refined With KVCT Priors

Megavoltage computed tomography (MVCT) on the tomotherapy system has been widely used as a tomographic imaging modality for image-guided radiotherapy. However, the quality of MVCT images is often compromised by poor tissue contrast and significant noise. Conventional networks designed to enhance CT quality typically require the clean ground-truth images, which are not feasible for MVCT. In this study, we introduce a semi-supervised framework named Semi-Diff, which leverages the denoising diffusion probabilistic model and the prior information sourced from kilovoltage computed tomography (KVCT) to address challenges in MVCT enhancement. Specifically, employing a discriminative prior learning method, we first learn a mapping function to estimate MVCT noise and perform state matching. With this state matching dictionary, we then represent the MVCT image as a sample from an intermediate posterior distribution within the diffusion Markov chain, which enables the reverse conditional sampling process of the diffusion model to start directly from the noisy MVCT images. To fully explore the prior information from the plan KVCT images of the same patients, we introduce a novel diffusion base network called RefNet, whose dynamic feature aggregation module can extract and align the relevant features from reference KVCT image to enhance image restoration performance. Quantitative evaluations using simulated digital phantom data show that the proposed Semi-Diff model achieves the average FSIM score of 0.954, PSNR score of 33.22 dB, and RMSE value of 0.023, demonstrating improvements of approximately 2.16% in FSIM, 0.59% in PSNR, and a reduction of 3.58% in RMSE compared to the best-performing baseline method. Results from physical phantom and patient data further validate the model’s superior performance in noise suppression and structural preservation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Radiation and Plasma Medical Sciences RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

8.00

自引率

18.20%

发文量

109