Thomas Braure, Delphine Lazaro, David Hateau, Vincent Brandon, Kévin Ginsburger
{"title":"稀疏视图计算机断层扫描图像重建的条件反射生成潜在优化。","authors":"Thomas Braure, Delphine Lazaro, David Hateau, Vincent Brandon, Kévin Ginsburger","doi":"10.1117/1.JMI.12.2.024004","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The issue of delivered doses during computed tomography (CT) scans encouraged sparser sets of X-ray projection, severely degrading reconstructions from conventional methods. Although most deep learning approaches benefit from large supervised datasets, they cannot generalize to new acquisition protocols (geometry, source/detector specifications). To address this issue, we developed a method working without training data and independently of experimental setups. In addition, our model may be initialized on small unsupervised datasets to enhance reconstructions.</p><p><strong>Approach: </strong>We propose a conditioned generative latent optimization (cGLO) in which a decoder reconstructs multiple slices simultaneously with a shared objective. It is tested on full-dose sparse-view CT for varying projection sets: (a) without training data against Deep Image Prior (DIP) and (b) with training datasets of multiple sizes against state-of-the-art score-based generative models (SGMs). Peak signal-to-noise ratio (PSNR) and structural SIMilarity (SSIM) metrics are used to quantify reconstruction quality.</p><p><strong>Results: </strong>cGLO demonstrates better SSIM than SGMs (between <math><mrow><mo>+</mo> <mn>0.034</mn></mrow> </math> and <math><mrow><mo>+</mo> <mn>0.139</mn></mrow> </math> ) and has an increasing advantage for smaller datasets reaching a <math><mrow><mo>+</mo> <mn>6.06</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> PSNR gain. Our strategy also outperforms DIP with at least a <math><mrow><mo>+</mo> <mn>1.52</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> PSNR advantage and peaks at <math><mrow><mo>+</mo> <mn>3.15</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> with fewer angles. Moreover, cGLO does not create artifacts or structural deformations contrary to DIP and SGMs.</p><p><strong>Conclusions: </strong>We propose a parsimonious and robust reconstruction technique offering similar to better performances when compared with state-of-the-art methods regarding full-dose sparse-view CT. Our strategy could be readily applied to any imaging reconstruction task without any assumption about the acquisition protocol or the quantity of available data.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 2","pages":"024004"},"PeriodicalIF":1.7000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11961077/pdf/","citationCount":"0","resultStr":"{\"title\":\"Conditioning generative latent optimization for sparse-view computed tomography image reconstruction.\",\"authors\":\"Thomas Braure, Delphine Lazaro, David Hateau, Vincent Brandon, Kévin Ginsburger\",\"doi\":\"10.1117/1.JMI.12.2.024004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>The issue of delivered doses during computed tomography (CT) scans encouraged sparser sets of X-ray projection, severely degrading reconstructions from conventional methods. Although most deep learning approaches benefit from large supervised datasets, they cannot generalize to new acquisition protocols (geometry, source/detector specifications). To address this issue, we developed a method working without training data and independently of experimental setups. In addition, our model may be initialized on small unsupervised datasets to enhance reconstructions.</p><p><strong>Approach: </strong>We propose a conditioned generative latent optimization (cGLO) in which a decoder reconstructs multiple slices simultaneously with a shared objective. It is tested on full-dose sparse-view CT for varying projection sets: (a) without training data against Deep Image Prior (DIP) and (b) with training datasets of multiple sizes against state-of-the-art score-based generative models (SGMs). Peak signal-to-noise ratio (PSNR) and structural SIMilarity (SSIM) metrics are used to quantify reconstruction quality.</p><p><strong>Results: </strong>cGLO demonstrates better SSIM than SGMs (between <math><mrow><mo>+</mo> <mn>0.034</mn></mrow> </math> and <math><mrow><mo>+</mo> <mn>0.139</mn></mrow> </math> ) and has an increasing advantage for smaller datasets reaching a <math><mrow><mo>+</mo> <mn>6.06</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> PSNR gain. Our strategy also outperforms DIP with at least a <math><mrow><mo>+</mo> <mn>1.52</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> PSNR advantage and peaks at <math><mrow><mo>+</mo> <mn>3.15</mn> <mtext> </mtext> <mi>dB</mi></mrow> </math> with fewer angles. Moreover, cGLO does not create artifacts or structural deformations contrary to DIP and SGMs.</p><p><strong>Conclusions: </strong>We propose a parsimonious and robust reconstruction technique offering similar to better performances when compared with state-of-the-art methods regarding full-dose sparse-view CT. Our strategy could be readily applied to any imaging reconstruction task without any assumption about the acquisition protocol or the quantity of available data.</p>\",\"PeriodicalId\":47707,\"journal\":{\"name\":\"Journal of Medical Imaging\",\"volume\":\"12 2\",\"pages\":\"024004\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11961077/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Medical Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JMI.12.2.024004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/1 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.2.024004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/1 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Conditioning generative latent optimization for sparse-view computed tomography image reconstruction.
Purpose: The issue of delivered doses during computed tomography (CT) scans encouraged sparser sets of X-ray projection, severely degrading reconstructions from conventional methods. Although most deep learning approaches benefit from large supervised datasets, they cannot generalize to new acquisition protocols (geometry, source/detector specifications). To address this issue, we developed a method working without training data and independently of experimental setups. In addition, our model may be initialized on small unsupervised datasets to enhance reconstructions.
Approach: We propose a conditioned generative latent optimization (cGLO) in which a decoder reconstructs multiple slices simultaneously with a shared objective. It is tested on full-dose sparse-view CT for varying projection sets: (a) without training data against Deep Image Prior (DIP) and (b) with training datasets of multiple sizes against state-of-the-art score-based generative models (SGMs). Peak signal-to-noise ratio (PSNR) and structural SIMilarity (SSIM) metrics are used to quantify reconstruction quality.
Results: cGLO demonstrates better SSIM than SGMs (between and ) and has an increasing advantage for smaller datasets reaching a PSNR gain. Our strategy also outperforms DIP with at least a PSNR advantage and peaks at with fewer angles. Moreover, cGLO does not create artifacts or structural deformations contrary to DIP and SGMs.
Conclusions: We propose a parsimonious and robust reconstruction technique offering similar to better performances when compared with state-of-the-art methods regarding full-dose sparse-view CT. Our strategy could be readily applied to any imaging reconstruction task without any assumption about the acquisition protocol or the quantity of available data.
期刊介绍:
JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.