{"title":"基于变压器的RT-DETR框架用于胸部x射线疾病的准确检测","authors":"Rakesh Mutukuru , Akula Rajesh , Vasanthi Ponduri , Javeed Ahammed , Lakshmi Prasanna Kothala","doi":"10.1016/j.irbm.2025.100912","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>Chest X-ray (CXR) analysis is a vital tool for early disease detection, enabling timely diagnosis and treatment. However, achieving accurate CXR disease detection remains challenging due to low image contrast, overlapping anatomical structures, and variations in imaging conditions. This study aims to develop a robust and efficient disease detection framework using the RT-DETR model to address these limitations and improve diagnostic accuracy.</div></div><div><h3>Material and Methods</h3><div>The proposed framework integrates transformer-based components within the RT-DETR architecture for enhanced feature representation. The backbone incorporates HGBlock and HGStem modules to capture multi-scale spatial representations through hierarchical gradient flow. In the neck network, the Attention-Intensified Feature Interaction (AIFI) module and the Reparameterized Efficient Path Aggregation (REpc3) module refine feature fusion and strengthen contextual understanding. The detection head employs the RT-DETR decoder with an efficient query-based mechanism to improve localization and classification precision. Statistical validation was conducted using the Wilcoxon Test, Paired T-Test, and Kruskal-Wallis Test to ensure the reliability of performance outcomes.</div></div><div><h3>Results</h3><div>The proposed RT-DETR framework achieved a precision of 55.7%, outperforming YOLOv7x's 47.7% by 8%. The recall was comparable, with our model achieving 43.0% versus YOLOv7x's 43.1%. Importantly, the mean Average Precision (mAP) of our model reached 45.3%, representing a 3.7% improvement over YOLOv7x's 41.6%. These results confirm the model's superior performance and its statistical significance as validated by the applied tests.</div></div><div><h3>Conclusion</h3><div>The RT-DETR-based disease detection framework demonstrates improved accuracy and robustness in CXR analysis compared to state-of-the-art models. Its integration of advanced transformer-based modules enhances feature representation and detection precision. The results highlight its potential as a reliable and efficient tool for automated chest X-ray disease diagnosis, offering strong applicability in real-world clinical settings.</div></div>","PeriodicalId":14605,"journal":{"name":"Irbm","volume":"46 6","pages":"Article 100912"},"PeriodicalIF":4.2000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transformer-Based RT-DETR Framework for Accurate Chest X-Ray Disease Detection\",\"authors\":\"Rakesh Mutukuru , Akula Rajesh , Vasanthi Ponduri , Javeed Ahammed , Lakshmi Prasanna Kothala\",\"doi\":\"10.1016/j.irbm.2025.100912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>Chest X-ray (CXR) analysis is a vital tool for early disease detection, enabling timely diagnosis and treatment. However, achieving accurate CXR disease detection remains challenging due to low image contrast, overlapping anatomical structures, and variations in imaging conditions. This study aims to develop a robust and efficient disease detection framework using the RT-DETR model to address these limitations and improve diagnostic accuracy.</div></div><div><h3>Material and Methods</h3><div>The proposed framework integrates transformer-based components within the RT-DETR architecture for enhanced feature representation. The backbone incorporates HGBlock and HGStem modules to capture multi-scale spatial representations through hierarchical gradient flow. In the neck network, the Attention-Intensified Feature Interaction (AIFI) module and the Reparameterized Efficient Path Aggregation (REpc3) module refine feature fusion and strengthen contextual understanding. The detection head employs the RT-DETR decoder with an efficient query-based mechanism to improve localization and classification precision. Statistical validation was conducted using the Wilcoxon Test, Paired T-Test, and Kruskal-Wallis Test to ensure the reliability of performance outcomes.</div></div><div><h3>Results</h3><div>The proposed RT-DETR framework achieved a precision of 55.7%, outperforming YOLOv7x's 47.7% by 8%. The recall was comparable, with our model achieving 43.0% versus YOLOv7x's 43.1%. Importantly, the mean Average Precision (mAP) of our model reached 45.3%, representing a 3.7% improvement over YOLOv7x's 41.6%. These results confirm the model's superior performance and its statistical significance as validated by the applied tests.</div></div><div><h3>Conclusion</h3><div>The RT-DETR-based disease detection framework demonstrates improved accuracy and robustness in CXR analysis compared to state-of-the-art models. Its integration of advanced transformer-based modules enhances feature representation and detection precision. The results highlight its potential as a reliable and efficient tool for automated chest X-ray disease diagnosis, offering strong applicability in real-world clinical settings.</div></div>\",\"PeriodicalId\":14605,\"journal\":{\"name\":\"Irbm\",\"volume\":\"46 6\",\"pages\":\"Article 100912\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Irbm\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1959031825000375\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Irbm","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1959031825000375","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
Transformer-Based RT-DETR Framework for Accurate Chest X-Ray Disease Detection
Objectives
Chest X-ray (CXR) analysis is a vital tool for early disease detection, enabling timely diagnosis and treatment. However, achieving accurate CXR disease detection remains challenging due to low image contrast, overlapping anatomical structures, and variations in imaging conditions. This study aims to develop a robust and efficient disease detection framework using the RT-DETR model to address these limitations and improve diagnostic accuracy.
Material and Methods
The proposed framework integrates transformer-based components within the RT-DETR architecture for enhanced feature representation. The backbone incorporates HGBlock and HGStem modules to capture multi-scale spatial representations through hierarchical gradient flow. In the neck network, the Attention-Intensified Feature Interaction (AIFI) module and the Reparameterized Efficient Path Aggregation (REpc3) module refine feature fusion and strengthen contextual understanding. The detection head employs the RT-DETR decoder with an efficient query-based mechanism to improve localization and classification precision. Statistical validation was conducted using the Wilcoxon Test, Paired T-Test, and Kruskal-Wallis Test to ensure the reliability of performance outcomes.
Results
The proposed RT-DETR framework achieved a precision of 55.7%, outperforming YOLOv7x's 47.7% by 8%. The recall was comparable, with our model achieving 43.0% versus YOLOv7x's 43.1%. Importantly, the mean Average Precision (mAP) of our model reached 45.3%, representing a 3.7% improvement over YOLOv7x's 41.6%. These results confirm the model's superior performance and its statistical significance as validated by the applied tests.
Conclusion
The RT-DETR-based disease detection framework demonstrates improved accuracy and robustness in CXR analysis compared to state-of-the-art models. Its integration of advanced transformer-based modules enhances feature representation and detection precision. The results highlight its potential as a reliable and efficient tool for automated chest X-ray disease diagnosis, offering strong applicability in real-world clinical settings.
期刊介绍:
IRBM is the journal of the AGBM (Alliance for engineering in Biology an Medicine / Alliance pour le génie biologique et médical) and the SFGBM (BioMedical Engineering French Society / Société française de génie biologique médical) and the AFIB (French Association of Biomedical Engineers / Association française des ingénieurs biomédicaux).
As a vehicle of information and knowledge in the field of biomedical technologies, IRBM is devoted to fundamental as well as clinical research. Biomedical engineering and use of new technologies are the cornerstones of IRBM, providing authors and users with the latest information. Its six issues per year propose reviews (state-of-the-art and current knowledge), original articles directed at fundamental research and articles focusing on biomedical engineering. All articles are submitted to peer reviewers acting as guarantors for IRBM''s scientific and medical content. The field covered by IRBM includes all the discipline of Biomedical engineering. Thereby, the type of papers published include those that cover the technological and methodological development in:
-Physiological and Biological Signal processing (EEG, MEG, ECG…)-
Medical Image processing-
Biomechanics-
Biomaterials-
Medical Physics-
Biophysics-
Physiological and Biological Sensors-
Information technologies in healthcare-
Disability research-
Computational physiology-
…