{"title":"GAN-Enhanced Hybrid Deep Learning with Explainable AI for Automated Cataract Diagnosis.","authors":"Shashank Mouli Satapathy, Mitali Gopinath Paul, Anusha Garg, Suhani Bhatnagar","doi":"10.1007/s10916-025-02249-1","DOIUrl":null,"url":null,"abstract":"<p><p>Cataracts, among the most prevalent eye disorders, result in diminished vision due to cloudiness in the eye's natural lens. Timely diagnosis is crucial for preventing irreversible damage. While effective, existing automated systems encounter difficulties like limited dataset variety, lack of interpretability, and suboptimal generalization in real-world scenarios. This study presents a novel deep learning-based method that incorporates Generative AI (GenAI) and Explainable AI (XAI) to enhance cataract detection. The proposed methodology leverages a fine-tuned InceptionResNetV2 with additional layers, trained on a hybrid dataset enriched by merging six open-source datasets, along with synthetic images generated via Generative Adversarial Networks (GANs). Class weights address data imbalance, while stratified K-Fold cross-validation ensures robust evaluation. Our system offers graphical interpretation through Gradient-weighted Class Activation Mapping (Grad-CAM) heatmaps, supporting clinical transparency and reliability. The model evaluation reports a mean K-Fold accuracy of 97.58% with a standard deviation of 0.0040, and a 95% confidence interval (CI) of (0.9702, 0.9814). On the external dataset, the model achieved an overall accuracy of 97%, an AUC of 0.9944, and for the cataract class, a precision of 96%, recall (sensitivity) of 94%, F1-score of 95%. Our method, by incorporating synthetic images and explainable AI, ensures enhanced data diversity, addresses class imbalance, reduced dependency on large annotated datasets, and offers greater interpretability that facilitates expert validation and builds stronger clinical trust, making it superior to existing cataract detection systems.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"123"},"PeriodicalIF":5.7000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02249-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Cataracts, among the most prevalent eye disorders, result in diminished vision due to cloudiness in the eye's natural lens. Timely diagnosis is crucial for preventing irreversible damage. While effective, existing automated systems encounter difficulties like limited dataset variety, lack of interpretability, and suboptimal generalization in real-world scenarios. This study presents a novel deep learning-based method that incorporates Generative AI (GenAI) and Explainable AI (XAI) to enhance cataract detection. The proposed methodology leverages a fine-tuned InceptionResNetV2 with additional layers, trained on a hybrid dataset enriched by merging six open-source datasets, along with synthetic images generated via Generative Adversarial Networks (GANs). Class weights address data imbalance, while stratified K-Fold cross-validation ensures robust evaluation. Our system offers graphical interpretation through Gradient-weighted Class Activation Mapping (Grad-CAM) heatmaps, supporting clinical transparency and reliability. The model evaluation reports a mean K-Fold accuracy of 97.58% with a standard deviation of 0.0040, and a 95% confidence interval (CI) of (0.9702, 0.9814). On the external dataset, the model achieved an overall accuracy of 97%, an AUC of 0.9944, and for the cataract class, a precision of 96%, recall (sensitivity) of 94%, F1-score of 95%. Our method, by incorporating synthetic images and explainable AI, ensures enhanced data diversity, addresses class imbalance, reduced dependency on large annotated datasets, and offers greater interpretability that facilitates expert validation and builds stronger clinical trust, making it superior to existing cataract detection systems.
期刊介绍:
Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.