{"title":"用稀疏编码和数据自适应、自监督深度学习去噪儿童心脏光子计数CT数据","authors":"Darin P. Clark, Joseph Y. Cao, Cristian T. Badea","doi":"10.1002/mp.17918","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The judicious use of CT in pediatric cardiac applications is warranted because young patients face the need for repeated imaging and increased lifetime cancer risk after ionizing radiation exposure. The quality of pediatric cardiac CT scans is variable because of limited protocols optimizations for pediatric patients, the common presence of metallic implants following treatment, and disparities in denoising algorithm performance between adult and pediatric scans. Two recent technological developments promise to improve the average quality of pediatric CT scans at fixed or reduced dose: clinical photon-counting CT (PCCT) and deep learning (DL) algorithms for CT image denoising. Given advancements to accommodate variable image quality, these technologies will deliver improved spatial resolution, noise performance, and contrast resolution for pediatric cardiac CT imaging.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>To advance self-supervised DL denoising methods to accommodate variable image quality in pediatric cardiac CT data.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Starting with the popular Vision Transformer (ViT) DL architecture, two targeted architectural changes were made: (1) the multi-layer perceptrons (MLPs) were modified to allow cross-token recombination of encoded image data following attention computations (parallels patch-wise weighting and averaging in non-local means [NLM]), and (2) the network head was replaced with the equivalent of an overcomplete dictionary to perform dictionary sparse coding (SC). This modified, 3D ViT (mViT) was then trained in a dynamic fashion: the balance between data fidelity and representation sparsity was adjusted during training such that the average fidelity error remained consistent with localized estimates of image noise. To demonstrate the newly proposed method, the mViT was trained with pediatric cardiac photon-counting x-ray CT data with variable levels of image noise (NAEOTOM Alpha PCCT scanner; retrospective data from 20 patients scanned at Duke University; ages: 1–18 years; iterative reconstruction noise level in the left ventricle: 20–55 HU). Data from one patient with the highest levels of noise was reserved for validation. Testing data included Alpha data from three additional Duke patients (2 < 1 year old) and a murine cardiac PCCT data set acquired on a preclinical system.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The validation denoising results demonstrate that SC with the mViT preserves anatomic structures relevant to the diagnosis and treatment of congenital heart defects (coronary artery origins; valve leaflets; left ventricle boundaries) while achieving similar intensity bias and lower intensity variance values than competing denoising methods (bilateral filtration [BF], NLM, dictionary SC, block matching 4D, orthogonal matching pursuit, Noise2Void). Applying the trained mViT network to preclinical PCCT demonstrated robust generalization performance to high levels of image noise (∼230 HU) and differing image contrast; however, applying the network to clinical PCCT data in younger patients (< 1 year old) demonstrated some smoothing of image details in data already heavily denoised during reconstruction.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This work demonstrates robust, self-supervised denoising of pediatric cardiac PCCT data through data adaptation during network training based on local noise estimates. The trained network generalizes to data sets with high levels of noise and differing image contrast relative to the training data, suggesting that self-supervised fine tuning may allow the trained network to address related CT denoising problems.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 7","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Denoising pediatric cardiac photon-counting CT data with sparse coding and data-adaptive, self-supervised deep learning\",\"authors\":\"Darin P. Clark, Joseph Y. Cao, Cristian T. Badea\",\"doi\":\"10.1002/mp.17918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>The judicious use of CT in pediatric cardiac applications is warranted because young patients face the need for repeated imaging and increased lifetime cancer risk after ionizing radiation exposure. The quality of pediatric cardiac CT scans is variable because of limited protocols optimizations for pediatric patients, the common presence of metallic implants following treatment, and disparities in denoising algorithm performance between adult and pediatric scans. Two recent technological developments promise to improve the average quality of pediatric CT scans at fixed or reduced dose: clinical photon-counting CT (PCCT) and deep learning (DL) algorithms for CT image denoising. Given advancements to accommodate variable image quality, these technologies will deliver improved spatial resolution, noise performance, and contrast resolution for pediatric cardiac CT imaging.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>To advance self-supervised DL denoising methods to accommodate variable image quality in pediatric cardiac CT data.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>Starting with the popular Vision Transformer (ViT) DL architecture, two targeted architectural changes were made: (1) the multi-layer perceptrons (MLPs) were modified to allow cross-token recombination of encoded image data following attention computations (parallels patch-wise weighting and averaging in non-local means [NLM]), and (2) the network head was replaced with the equivalent of an overcomplete dictionary to perform dictionary sparse coding (SC). This modified, 3D ViT (mViT) was then trained in a dynamic fashion: the balance between data fidelity and representation sparsity was adjusted during training such that the average fidelity error remained consistent with localized estimates of image noise. To demonstrate the newly proposed method, the mViT was trained with pediatric cardiac photon-counting x-ray CT data with variable levels of image noise (NAEOTOM Alpha PCCT scanner; retrospective data from 20 patients scanned at Duke University; ages: 1–18 years; iterative reconstruction noise level in the left ventricle: 20–55 HU). Data from one patient with the highest levels of noise was reserved for validation. Testing data included Alpha data from three additional Duke patients (2 < 1 year old) and a murine cardiac PCCT data set acquired on a preclinical system.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The validation denoising results demonstrate that SC with the mViT preserves anatomic structures relevant to the diagnosis and treatment of congenital heart defects (coronary artery origins; valve leaflets; left ventricle boundaries) while achieving similar intensity bias and lower intensity variance values than competing denoising methods (bilateral filtration [BF], NLM, dictionary SC, block matching 4D, orthogonal matching pursuit, Noise2Void). Applying the trained mViT network to preclinical PCCT demonstrated robust generalization performance to high levels of image noise (∼230 HU) and differing image contrast; however, applying the network to clinical PCCT data in younger patients (< 1 year old) demonstrated some smoothing of image details in data already heavily denoised during reconstruction.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>This work demonstrates robust, self-supervised denoising of pediatric cardiac PCCT data through data adaptation during network training based on local noise estimates. The trained network generalizes to data sets with high levels of noise and differing image contrast relative to the training data, suggesting that self-supervised fine tuning may allow the trained network to address related CT denoising problems.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 7\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/mp.17918\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17918","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Denoising pediatric cardiac photon-counting CT data with sparse coding and data-adaptive, self-supervised deep learning
Background
The judicious use of CT in pediatric cardiac applications is warranted because young patients face the need for repeated imaging and increased lifetime cancer risk after ionizing radiation exposure. The quality of pediatric cardiac CT scans is variable because of limited protocols optimizations for pediatric patients, the common presence of metallic implants following treatment, and disparities in denoising algorithm performance between adult and pediatric scans. Two recent technological developments promise to improve the average quality of pediatric CT scans at fixed or reduced dose: clinical photon-counting CT (PCCT) and deep learning (DL) algorithms for CT image denoising. Given advancements to accommodate variable image quality, these technologies will deliver improved spatial resolution, noise performance, and contrast resolution for pediatric cardiac CT imaging.
Purpose
To advance self-supervised DL denoising methods to accommodate variable image quality in pediatric cardiac CT data.
Methods
Starting with the popular Vision Transformer (ViT) DL architecture, two targeted architectural changes were made: (1) the multi-layer perceptrons (MLPs) were modified to allow cross-token recombination of encoded image data following attention computations (parallels patch-wise weighting and averaging in non-local means [NLM]), and (2) the network head was replaced with the equivalent of an overcomplete dictionary to perform dictionary sparse coding (SC). This modified, 3D ViT (mViT) was then trained in a dynamic fashion: the balance between data fidelity and representation sparsity was adjusted during training such that the average fidelity error remained consistent with localized estimates of image noise. To demonstrate the newly proposed method, the mViT was trained with pediatric cardiac photon-counting x-ray CT data with variable levels of image noise (NAEOTOM Alpha PCCT scanner; retrospective data from 20 patients scanned at Duke University; ages: 1–18 years; iterative reconstruction noise level in the left ventricle: 20–55 HU). Data from one patient with the highest levels of noise was reserved for validation. Testing data included Alpha data from three additional Duke patients (2 < 1 year old) and a murine cardiac PCCT data set acquired on a preclinical system.
Results
The validation denoising results demonstrate that SC with the mViT preserves anatomic structures relevant to the diagnosis and treatment of congenital heart defects (coronary artery origins; valve leaflets; left ventricle boundaries) while achieving similar intensity bias and lower intensity variance values than competing denoising methods (bilateral filtration [BF], NLM, dictionary SC, block matching 4D, orthogonal matching pursuit, Noise2Void). Applying the trained mViT network to preclinical PCCT demonstrated robust generalization performance to high levels of image noise (∼230 HU) and differing image contrast; however, applying the network to clinical PCCT data in younger patients (< 1 year old) demonstrated some smoothing of image details in data already heavily denoised during reconstruction.
Conclusions
This work demonstrates robust, self-supervised denoising of pediatric cardiac PCCT data through data adaptation during network training based on local noise estimates. The trained network generalizes to data sets with high levels of noise and differing image contrast relative to the training data, suggesting that self-supervised fine tuning may allow the trained network to address related CT denoising problems.
期刊介绍:
Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments
Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.