{"title":"Facilitating Radiograph Interpretation: Refined Generative Models for Precise Bone Suppression in Chest X-rays.","authors":"Samar Ibrahim, Sahar Selim, Mustafa Elattar","doi":"10.1007/s10278-025-01461-2","DOIUrl":null,"url":null,"abstract":"<p><p>Chest X-ray (CXR) is crucial for diagnosing lung diseases, especially lung nodules. Recent studies indicate that bones, such as ribs and clavicles, obscure 82 to 95% of undiagnosed lung cancers. The development of computer-aided detection (CAD) systems with automated bone suppression is vital to improve detection rates and support early clinical decision-making. Current bone suppression methods face challenges: they often depend on manual subtraction of bone-only images from CXRs, leading to inefficiency and poor generalization; there is significant information loss in data compression within deep convolutional end-to-end architectures; and a balance between model efficiency and accuracy has not been sufficiently achieved in existing research. We introduce a novel end-to-end architecture, the mask-guided model, to address these challenges. Leveraging the Pix2Pix framework, our model enhances computational efficiency by reducing parameter count by 92.5%. It features a rib mask-guided module with a mask encoder and cross-attention mechanism, which provides spatial constraints, reduces information loss during encoder compression, and preserves non-relevant areas. An ablation study evaluates the impact of various factors. The model undergoes initial training on digitally reconstructed radiographs (DRRs) derived from CT projections for bone suppression and is fine-tuned on the JSRT dataset to accelerate convergence. The mask-guided model surpasses previous state-of-the-art methods, showing superior bone suppression performance in terms of structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), and processing speed. It achieves an SSIM of 0.99 ± 0.002 and a PSNR of 36.14 ± 1.13 on the JSRT dataset. This study underscores the proposed model's effectiveness compared to existing methods, showcasing its capability to reduce model size and increase accuracy. This makes it well-suited for deployment in affordable, low-power hardware devices across various clinical settings.</p>","PeriodicalId":516858,"journal":{"name":"Journal of imaging informatics in medicine","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of imaging informatics in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10278-025-01461-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Chest X-ray (CXR) is crucial for diagnosing lung diseases, especially lung nodules. Recent studies indicate that bones, such as ribs and clavicles, obscure 82 to 95% of undiagnosed lung cancers. The development of computer-aided detection (CAD) systems with automated bone suppression is vital to improve detection rates and support early clinical decision-making. Current bone suppression methods face challenges: they often depend on manual subtraction of bone-only images from CXRs, leading to inefficiency and poor generalization; there is significant information loss in data compression within deep convolutional end-to-end architectures; and a balance between model efficiency and accuracy has not been sufficiently achieved in existing research. We introduce a novel end-to-end architecture, the mask-guided model, to address these challenges. Leveraging the Pix2Pix framework, our model enhances computational efficiency by reducing parameter count by 92.5%. It features a rib mask-guided module with a mask encoder and cross-attention mechanism, which provides spatial constraints, reduces information loss during encoder compression, and preserves non-relevant areas. An ablation study evaluates the impact of various factors. The model undergoes initial training on digitally reconstructed radiographs (DRRs) derived from CT projections for bone suppression and is fine-tuned on the JSRT dataset to accelerate convergence. The mask-guided model surpasses previous state-of-the-art methods, showing superior bone suppression performance in terms of structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), and processing speed. It achieves an SSIM of 0.99 ± 0.002 and a PSNR of 36.14 ± 1.13 on the JSRT dataset. This study underscores the proposed model's effectiveness compared to existing methods, showcasing its capability to reduce model size and increase accuracy. This makes it well-suited for deployment in affordable, low-power hardware devices across various clinical settings.