{"title":"锥束x射线透射成像的学习透视畸变校正","authors":"Yixing Huang;Andreas Maier;Fuxin Fan;Björn Kreher;Xiaolin Huang;Rainer Fietkau;Hongbin Han;Florian Putz;Christoph Bert","doi":"10.1109/TRPMS.2025.3551501","DOIUrl":null,"url":null,"abstract":"In cone-beam X-ray transmission imaging, perspective distortion (PD) causes difficulty in direct, accurate geometric assessments of anatomical structures. Since PD correction from a single view is highly ill-posed due to missing stereo/3-D information, the efficacy of different view combinations is investigated in this work. Our theoretical analysis reveals that the 0°&180° complementary view setting provides a practical way to identify perspectively deformed structures by assessing the deviation between the two views. In addition, it provides bounding information and reduces uncertainty for learning PD. Beyond view combinations, the impact of learning PD in different spatial domains, specifically Cartesian and polar coordinates, is explored. Two representative networks Pix2pixGAN and TransU-Net for correcting PD are investigated. Experiments on numerical bead phantom data and head CT data demonstrate the advantage of complementary views over other view combinations (a 0° single view, 0°&90° orthogonal views, and 0°&5° small angular views). Results further show that both Pix2pixGAN and TransU-Net achieve better performance in polar space than Cartesian space. The efficacy of the proposed framework on real cone-beam computed tomography (CBCT) projection data and its potential to handle bulky metal implants and surgical screws indicate the promising aspects of future real applications.","PeriodicalId":46807,"journal":{"name":"IEEE Transactions on Radiation and Plasma Medical Sciences","volume":"9 7","pages":"927-938"},"PeriodicalIF":3.5000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Perspective Distortion Correction in Cone-Beam X-Ray Transmission Imaging\",\"authors\":\"Yixing Huang;Andreas Maier;Fuxin Fan;Björn Kreher;Xiaolin Huang;Rainer Fietkau;Hongbin Han;Florian Putz;Christoph Bert\",\"doi\":\"10.1109/TRPMS.2025.3551501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In cone-beam X-ray transmission imaging, perspective distortion (PD) causes difficulty in direct, accurate geometric assessments of anatomical structures. Since PD correction from a single view is highly ill-posed due to missing stereo/3-D information, the efficacy of different view combinations is investigated in this work. Our theoretical analysis reveals that the 0°&180° complementary view setting provides a practical way to identify perspectively deformed structures by assessing the deviation between the two views. In addition, it provides bounding information and reduces uncertainty for learning PD. Beyond view combinations, the impact of learning PD in different spatial domains, specifically Cartesian and polar coordinates, is explored. Two representative networks Pix2pixGAN and TransU-Net for correcting PD are investigated. Experiments on numerical bead phantom data and head CT data demonstrate the advantage of complementary views over other view combinations (a 0° single view, 0°&90° orthogonal views, and 0°&5° small angular views). Results further show that both Pix2pixGAN and TransU-Net achieve better performance in polar space than Cartesian space. The efficacy of the proposed framework on real cone-beam computed tomography (CBCT) projection data and its potential to handle bulky metal implants and surgical screws indicate the promising aspects of future real applications.\",\"PeriodicalId\":46807,\"journal\":{\"name\":\"IEEE Transactions on Radiation and Plasma Medical Sciences\",\"volume\":\"9 7\",\"pages\":\"927-938\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Radiation and Plasma Medical Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10930580/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Radiation and Plasma Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10930580/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Learning Perspective Distortion Correction in Cone-Beam X-Ray Transmission Imaging
In cone-beam X-ray transmission imaging, perspective distortion (PD) causes difficulty in direct, accurate geometric assessments of anatomical structures. Since PD correction from a single view is highly ill-posed due to missing stereo/3-D information, the efficacy of different view combinations is investigated in this work. Our theoretical analysis reveals that the 0°&180° complementary view setting provides a practical way to identify perspectively deformed structures by assessing the deviation between the two views. In addition, it provides bounding information and reduces uncertainty for learning PD. Beyond view combinations, the impact of learning PD in different spatial domains, specifically Cartesian and polar coordinates, is explored. Two representative networks Pix2pixGAN and TransU-Net for correcting PD are investigated. Experiments on numerical bead phantom data and head CT data demonstrate the advantage of complementary views over other view combinations (a 0° single view, 0°&90° orthogonal views, and 0°&5° small angular views). Results further show that both Pix2pixGAN and TransU-Net achieve better performance in polar space than Cartesian space. The efficacy of the proposed framework on real cone-beam computed tomography (CBCT) projection data and its potential to handle bulky metal implants and surgical screws indicate the promising aspects of future real applications.