Twaha Kabika, Cai Hongsen, Zhu Hongling, Dong Jingxian, Zhang Siyuan, Mingyue Ding, Deng Xianbo, Hou Wenguang, Wang Yan
{"title":"改进基于神经辐射场的医学图像合成中的姿态精度和几何形状。","authors":"Twaha Kabika, Cai Hongsen, Zhu Hongling, Dong Jingxian, Zhang Siyuan, Mingyue Ding, Deng Xianbo, Hou Wenguang, Wang Yan","doi":"10.1002/mp.17832","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Neural radiance field (NeRF) models have garnered significant attention for their impressive ability to synthesize high-quality novel scene views from posed 2D images. Recently, the MedNeRF algorithm was developed to render complete computed tomography (CT) projections from a single or a few x-ray images further. Despite this advancement, MedNeRF struggles with accurate pose reconstruction, crucial for radiologists during image analysis, leading to blurry geometry in the generated outputs.</p><p><strong>Purpose: </strong>Motivated by these challenges, our research aims to address MedNeRF's limitations in pose accuracy and image clarity. Specifically, we seek to improve the pose accuracy of reconstructed images and enhance the generated output's anatomical detail and quality.</p><p><strong>Methods: </strong>We propose a novel pose-aware discriminator that estimates pose differences between generated and real patches, ensuring accurate poses and deeper anatomical structures in generated images. We enhance volumetric rendering from single-view x-rays by introducing a customized distortion adaptive loss function and present the HTDataset, a new dataset pair that better mimics machine-generated x-rays, offering clearer anatomical depictions with reduced noise.</p><p><strong>Results: </strong>Our method successfully renders images with correct poses and high fidelity, outperforming existing state-of-the-art methods. The results demonstrate superior performance in both qualitative and quantitative metrics.</p><p><strong>Conclusions: </strong>The proposed approach addresses the pose reconstruction challenge in MedNeRF, enhances the anatomical detail, and reduces noise in generated images. The use of HTDataset and the innovative discriminator structure lead to significant improvements in the accuracy and quality of the rendered images, setting a new benchmark in the field.</p>","PeriodicalId":94136,"journal":{"name":"Medical physics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving pose accuracy and geometry in neural radiance field-based medical image synthesis.\",\"authors\":\"Twaha Kabika, Cai Hongsen, Zhu Hongling, Dong Jingxian, Zhang Siyuan, Mingyue Ding, Deng Xianbo, Hou Wenguang, Wang Yan\",\"doi\":\"10.1002/mp.17832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Neural radiance field (NeRF) models have garnered significant attention for their impressive ability to synthesize high-quality novel scene views from posed 2D images. Recently, the MedNeRF algorithm was developed to render complete computed tomography (CT) projections from a single or a few x-ray images further. Despite this advancement, MedNeRF struggles with accurate pose reconstruction, crucial for radiologists during image analysis, leading to blurry geometry in the generated outputs.</p><p><strong>Purpose: </strong>Motivated by these challenges, our research aims to address MedNeRF's limitations in pose accuracy and image clarity. Specifically, we seek to improve the pose accuracy of reconstructed images and enhance the generated output's anatomical detail and quality.</p><p><strong>Methods: </strong>We propose a novel pose-aware discriminator that estimates pose differences between generated and real patches, ensuring accurate poses and deeper anatomical structures in generated images. We enhance volumetric rendering from single-view x-rays by introducing a customized distortion adaptive loss function and present the HTDataset, a new dataset pair that better mimics machine-generated x-rays, offering clearer anatomical depictions with reduced noise.</p><p><strong>Results: </strong>Our method successfully renders images with correct poses and high fidelity, outperforming existing state-of-the-art methods. The results demonstrate superior performance in both qualitative and quantitative metrics.</p><p><strong>Conclusions: </strong>The proposed approach addresses the pose reconstruction challenge in MedNeRF, enhances the anatomical detail, and reduces noise in generated images. The use of HTDataset and the innovative discriminator structure lead to significant improvements in the accuracy and quality of the rendered images, setting a new benchmark in the field.</p>\",\"PeriodicalId\":94136,\"journal\":{\"name\":\"Medical physics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/mp.17832\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/mp.17832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving pose accuracy and geometry in neural radiance field-based medical image synthesis.
Background: Neural radiance field (NeRF) models have garnered significant attention for their impressive ability to synthesize high-quality novel scene views from posed 2D images. Recently, the MedNeRF algorithm was developed to render complete computed tomography (CT) projections from a single or a few x-ray images further. Despite this advancement, MedNeRF struggles with accurate pose reconstruction, crucial for radiologists during image analysis, leading to blurry geometry in the generated outputs.
Purpose: Motivated by these challenges, our research aims to address MedNeRF's limitations in pose accuracy and image clarity. Specifically, we seek to improve the pose accuracy of reconstructed images and enhance the generated output's anatomical detail and quality.
Methods: We propose a novel pose-aware discriminator that estimates pose differences between generated and real patches, ensuring accurate poses and deeper anatomical structures in generated images. We enhance volumetric rendering from single-view x-rays by introducing a customized distortion adaptive loss function and present the HTDataset, a new dataset pair that better mimics machine-generated x-rays, offering clearer anatomical depictions with reduced noise.
Results: Our method successfully renders images with correct poses and high fidelity, outperforming existing state-of-the-art methods. The results demonstrate superior performance in both qualitative and quantitative metrics.
Conclusions: The proposed approach addresses the pose reconstruction challenge in MedNeRF, enhances the anatomical detail, and reduces noise in generated images. The use of HTDataset and the innovative discriminator structure lead to significant improvements in the accuracy and quality of the rendered images, setting a new benchmark in the field.