Rashid Ali, Fiaz Gul Khan, Zia Ur Rehman, Daehan Kwak, Farman Ali
{"title":"Enhanced Diabetic Retinopathy Detection: An Explainable Semi-Supervised Approach Using Contrastive Learning.","authors":"Rashid Ali, Fiaz Gul Khan, Zia Ur Rehman, Daehan Kwak, Farman Ali","doi":"10.1109/JBHI.2025.3551696","DOIUrl":null,"url":null,"abstract":"<p><p>Diabetic retinopathy (DR) is a leading cause of blindness and represents a critical challenge to global vision health. Early detection is essential to preventing irreversible eye damage. Automated medical image analysis plays a pivotal role in enabling timely diagnosis. However, the development of robust diagnostic models is challenged by the scarcity of labeled data and the prevalence of imbalanced and unlabeled datasets. Semi-supervised learning offers a potential solution by leveraging unlabeled data to enhance model performance. However, it is often limited by challenges such as unreliable pseudo-labeling, the exclusion of low-confidence data, and biases introduced by imbalanced datasets. To address these limitations, we propose a novel semi-supervised learning framework for DR detection that combines similarity and contrastive learning. Our approach utilizes class prototypes and an ensemble of classifiers to generate reliable pseudo-labels for unlabeled data. Unlike traditional methods that discard unreliable samples, our framework integrates them into the training process using contrastive learning. This allows us to extract valuable features and improve overall performance. Furthermore, we enhance the model's transparency and interpretability by incorporating the explainable AI technique GradCAM, which provides insights into the model's predictions for specific images. We evaluated the proposed method on the publicly available Kaggle DR dataset for diabetic retinopathy classification. Experimental results demonstrate that our approach achieves improved performance compared to existing semi-supervised learning methods. It also effectively leverages unreliable samples, highlighting its potential to advance DR diagnosis.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3551696","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetic retinopathy (DR) is a leading cause of blindness and represents a critical challenge to global vision health. Early detection is essential to preventing irreversible eye damage. Automated medical image analysis plays a pivotal role in enabling timely diagnosis. However, the development of robust diagnostic models is challenged by the scarcity of labeled data and the prevalence of imbalanced and unlabeled datasets. Semi-supervised learning offers a potential solution by leveraging unlabeled data to enhance model performance. However, it is often limited by challenges such as unreliable pseudo-labeling, the exclusion of low-confidence data, and biases introduced by imbalanced datasets. To address these limitations, we propose a novel semi-supervised learning framework for DR detection that combines similarity and contrastive learning. Our approach utilizes class prototypes and an ensemble of classifiers to generate reliable pseudo-labels for unlabeled data. Unlike traditional methods that discard unreliable samples, our framework integrates them into the training process using contrastive learning. This allows us to extract valuable features and improve overall performance. Furthermore, we enhance the model's transparency and interpretability by incorporating the explainable AI technique GradCAM, which provides insights into the model's predictions for specific images. We evaluated the proposed method on the publicly available Kaggle DR dataset for diabetic retinopathy classification. Experimental results demonstrate that our approach achieves improved performance compared to existing semi-supervised learning methods. It also effectively leverages unreliable samples, highlighting its potential to advance DR diagnosis.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.