基于有限角度测量的扩散和低秩先验的视觉语言模型辅助光谱CT重建。

IF 3.4 3区医学 Q2 ENGINEERING, BIOMEDICAL

Physics in medicine and biology Pub Date : 2025-09-19 DOI:10.1088/1361-6560/ae0974

Yizhong Wang, Ningning Liang, Junru Ren, Xinrui Zhang, Ye Shen, Ailong Cai, Zhizhong Zheng, Lei Li, Bin Yan

{"title":"基于有限角度测量的扩散和低秩先验的视觉语言模型辅助光谱CT重建。","authors":"Yizhong Wang, Ningning Liang, Junru Ren, Xinrui Zhang, Ye Shen, Ailong Cai, Zhizhong Zheng, Lei Li, Bin Yan","doi":"10.1088/1361-6560/ae0974","DOIUrl":null,"url":null,"abstract":"Objective: Spectral computed tomography (CT) is a critical tool in clinical practice, offering capabilities in multi-energy spectrum imaging and material identification. The limited-angle (LA) scanning strategy has attracted attention for its advantages in fast data acquisition and reduced radiation exposure, aligning with the as low as reasonably achievable principle. However, most deep learning-based methods require separate models for each LA setting, which limits their flexibility in adapting to new conditions. In this study, we developed a novel Visual-Language model-assisted Spectral CT Reconstruction (VLSR) method to address LA artifacts and enable multi-setting adaptation within a single model.Approach: The VLSR method integrates the image-text perception ability of visual-language models and the image generation potential of diffusion models. Prompt engineering is introduced to better represent LA artifact characteristics, further improving artifact accuracy. Additionally, a collaborative sampling framework combining data consistency, low-rank regularization, and image-domain diffusion models is developed to produce high-quality and consistent spectral CT reconstructions.Main results: The performance of VLSR is superior to other comparison methods. Under the scanning angles of 90° and 60° for simulated data, the VLSR method improves peak signal noise ratio by at least 0.41 dB and 1.13 dB compared with other methods.Significance: VLSR method can reconstruct high-quality spectral CT images under diverse LA configurations, allowing faster and more flexible scans with dose reductions.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Visual language model-assisted spectral CT reconstruction by diffusion and low-rank priors from limited-angle measurements.\",\"authors\":\"Yizhong Wang, Ningning Liang, Junru Ren, Xinrui Zhang, Ye Shen, Ailong Cai, Zhizhong Zheng, Lei Li, Bin Yan\",\"doi\":\"10.1088/1361-6560/ae0974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: Spectral computed tomography (CT) is a critical tool in clinical practice, offering capabilities in multi-energy spectrum imaging and material identification. The limited-angle (LA) scanning strategy has attracted attention for its advantages in fast data acquisition and reduced radiation exposure, aligning with the as low as reasonably achievable principle. However, most deep learning-based methods require separate models for each LA setting, which limits their flexibility in adapting to new conditions. In this study, we developed a novel Visual-Language model-assisted Spectral CT Reconstruction (VLSR) method to address LA artifacts and enable multi-setting adaptation within a single model.Approach: The VLSR method integrates the image-text perception ability of visual-language models and the image generation potential of diffusion models. Prompt engineering is introduced to better represent LA artifact characteristics, further improving artifact accuracy. Additionally, a collaborative sampling framework combining data consistency, low-rank regularization, and image-domain diffusion models is developed to produce high-quality and consistent spectral CT reconstructions.Main results: The performance of VLSR is superior to other comparison methods. Under the scanning angles of 90° and 60° for simulated data, the VLSR method improves peak signal noise ratio by at least 0.41 dB and 1.13 dB compared with other methods.Significance: VLSR method can reconstruct high-quality spectral CT images under diverse LA configurations, allowing faster and more flexible scans with dose reductions.\",\"PeriodicalId\":20185,\"journal\":{\"name\":\"Physics in medicine and biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physics in medicine and biology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6560/ae0974\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/ae0974","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

目的：光谱计算机断层扫描（CT）是临床实践中的重要工具，提供了多能谱成像和材料识别的能力。有限角度（LA）扫描策略以其快速获取数据和减少辐射暴露的优点而受到关注，符合尽可能低的合理可达原则。然而，大多数基于深度学习的方法需要为每个LA设置单独的模型，这限制了它们适应新条件的灵活性。在这项研究中，我们开发了一种新的视觉语言模型辅助光谱CT重建（VLSR）方法来处理LA伪影，并在单个模型中实现多设置适应。方法：VLSR方法综合了视觉语言模型的图像-文本感知能力和扩散模型的图像生成潜力。引入提示工程，更好地表达LA工件特性，进一步提高工件精度。此外，开发了一种结合数据一致性、低秩正则化和图像域扩散模型的协作采样框架，以产生高质量和一致的光谱CT重建。主要结果：VLSR的性能优于其他比较方法。在模拟数据的90°和60°扫描角度下，与其他方法相比，VLSR方法的峰值信噪比分别提高了0.41 dB和1.13 dB。意义：VLSR方法可以在不同的LA配置下重建高质量的光谱CT图像，使扫描速度更快、更灵活、剂量更小。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Visual language model-assisted spectral CT reconstruction by diffusion and low-rank priors from limited-angle measurements.

Objective: Spectral computed tomography (CT) is a critical tool in clinical practice, offering capabilities in multi-energy spectrum imaging and material identification. The limited-angle (LA) scanning strategy has attracted attention for its advantages in fast data acquisition and reduced radiation exposure, aligning with the as low as reasonably achievable principle. However, most deep learning-based methods require separate models for each LA setting, which limits their flexibility in adapting to new conditions. In this study, we developed a novel Visual-Language model-assisted Spectral CT Reconstruction (VLSR) method to address LA artifacts and enable multi-setting adaptation within a single model.

Approach: The VLSR method integrates the image-text perception ability of visual-language models and the image generation potential of diffusion models. Prompt engineering is introduced to better represent LA artifact characteristics, further improving artifact accuracy. Additionally, a collaborative sampling framework combining data consistency, low-rank regularization, and image-domain diffusion models is developed to produce high-quality and consistent spectral CT reconstructions.

Main results: The performance of VLSR is superior to other comparison methods. Under the scanning angles of 90° and 60° for simulated data, the VLSR method improves peak signal noise ratio by at least 0.41 dB and 1.13 dB compared with other methods.

Significance: VLSR method can reconstruct high-quality spectral CT images under diverse LA configurations, allowing faster and more flexible scans with dose reductions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry