Saiprasad Potharaju , Swapnali N. Tambe , Kishore Dasari , N. Srikanth , Rampay Venkatarao , Sagar Tambe
{"title":"Enhanced X-ray image classification for pneumonia detection using deep learning based CBAM and SE mechanisms","authors":"Saiprasad Potharaju , Swapnali N. Tambe , Kishore Dasari , N. Srikanth , Rampay Venkatarao , Sagar Tambe","doi":"10.1016/j.ibmed.2025.100299","DOIUrl":null,"url":null,"abstract":"<div><h3>Problem considered</h3><div>Pneumonia, a global health concern, remains a significant cause of morbidity and mortality, particularly in children under five and the elderly. Diagnostic challenges are pronounced in resource-limited settings, where expertise in radiological interpretation is scarce. X-ray imaging, a common diagnostic tool, often fails to provide accurate results without expert analysis. This gap in timely and precise diagnosis leads to delayed treatments and worsening patient outcomes. The emergence of antibiotic-resistant strains further emphasizes the urgency for innovative diagnostic solutions.</div></div><div><h3>Methods</h3><div>This research integrates advanced attention mechanisms into convolutional neural networks (CNNs) to enhance pneumonia detection from X-ray images. Utilizing a dataset of 5816 X-rays, preprocessing steps included normalization and data augmentation to improve robustness. The baseline CNN model was augmented with Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation (SE) networks, which prioritize critical image regions and recalibrate feature channels. Comparative evaluations were conducted using ResNet50 combined with CBAM.</div></div><div><h3>Results</h3><div>The CBAM-enhanced CNN achieved 98.6 % accuracy, improving upon the baseline CNN's 92.08 %, with a sensitivity of 98.3 % and specificity of 97.9 %. The SE-integrated CNN followed with 96.25 % accuracy, demonstrating superior feature recalibration. ResNet50 with CBAM attained 93.32 % accuracy. Compared to standard CNN models, these models exhibited reduced overfitting, improved generalization, and enhanced feature extraction. The proposed approach ensures a higher precision rate in detecting pneumonia from X-ray images. The model is designed for real-world clinical applications, particularly in low-resource healthcare settings. A lightweight, user-friendly web application was developed to assist radiologists and general practitioners in automated pneumonia detection, reducing reliance on expert interpretation.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100299"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521225001036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Problem considered
Pneumonia, a global health concern, remains a significant cause of morbidity and mortality, particularly in children under five and the elderly. Diagnostic challenges are pronounced in resource-limited settings, where expertise in radiological interpretation is scarce. X-ray imaging, a common diagnostic tool, often fails to provide accurate results without expert analysis. This gap in timely and precise diagnosis leads to delayed treatments and worsening patient outcomes. The emergence of antibiotic-resistant strains further emphasizes the urgency for innovative diagnostic solutions.
Methods
This research integrates advanced attention mechanisms into convolutional neural networks (CNNs) to enhance pneumonia detection from X-ray images. Utilizing a dataset of 5816 X-rays, preprocessing steps included normalization and data augmentation to improve robustness. The baseline CNN model was augmented with Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation (SE) networks, which prioritize critical image regions and recalibrate feature channels. Comparative evaluations were conducted using ResNet50 combined with CBAM.
Results
The CBAM-enhanced CNN achieved 98.6 % accuracy, improving upon the baseline CNN's 92.08 %, with a sensitivity of 98.3 % and specificity of 97.9 %. The SE-integrated CNN followed with 96.25 % accuracy, demonstrating superior feature recalibration. ResNet50 with CBAM attained 93.32 % accuracy. Compared to standard CNN models, these models exhibited reduced overfitting, improved generalization, and enhanced feature extraction. The proposed approach ensures a higher precision rate in detecting pneumonia from X-ray images. The model is designed for real-world clinical applications, particularly in low-resource healthcare settings. A lightweight, user-friendly web application was developed to assist radiologists and general practitioners in automated pneumonia detection, reducing reliance on expert interpretation.