GAN-Enhanced Hybrid Deep Learning with Explainable AI for Automated Cataract Diagnosis.

IF 5.7 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Journal of Medical Systems Pub Date : 2025-10-02 DOI:10.1007/s10916-025-02249-1

Shashank Mouli Satapathy, Mitali Gopinath Paul, Anusha Garg, Suhani Bhatnagar

{"title":"GAN-Enhanced Hybrid Deep Learning with Explainable AI for Automated Cataract Diagnosis.","authors":"Shashank Mouli Satapathy, Mitali Gopinath Paul, Anusha Garg, Suhani Bhatnagar","doi":"10.1007/s10916-025-02249-1","DOIUrl":null,"url":null,"abstract":"<p><p>Cataracts, among the most prevalent eye disorders, result in diminished vision due to cloudiness in the eye's natural lens. Timely diagnosis is crucial for preventing irreversible damage. While effective, existing automated systems encounter difficulties like limited dataset variety, lack of interpretability, and suboptimal generalization in real-world scenarios. This study presents a novel deep learning-based method that incorporates Generative AI (GenAI) and Explainable AI (XAI) to enhance cataract detection. The proposed methodology leverages a fine-tuned InceptionResNetV2 with additional layers, trained on a hybrid dataset enriched by merging six open-source datasets, along with synthetic images generated via Generative Adversarial Networks (GANs). Class weights address data imbalance, while stratified K-Fold cross-validation ensures robust evaluation. Our system offers graphical interpretation through Gradient-weighted Class Activation Mapping (Grad-CAM) heatmaps, supporting clinical transparency and reliability. The model evaluation reports a mean K-Fold accuracy of 97.58% with a standard deviation of 0.0040, and a 95% confidence interval (CI) of (0.9702, 0.9814). On the external dataset, the model achieved an overall accuracy of 97%, an AUC of 0.9944, and for the cataract class, a precision of 96%, recall (sensitivity) of 94%, F1-score of 95%. Our method, by incorporating synthetic images and explainable AI, ensures enhanced data diversity, addresses class imbalance, reduced dependency on large annotated datasets, and offers greater interpretability that facilitates expert validation and builds stronger clinical trust, making it superior to existing cataract detection systems.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"123"},"PeriodicalIF":5.7000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02249-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Cataracts, among the most prevalent eye disorders, result in diminished vision due to cloudiness in the eye's natural lens. Timely diagnosis is crucial for preventing irreversible damage. While effective, existing automated systems encounter difficulties like limited dataset variety, lack of interpretability, and suboptimal generalization in real-world scenarios. This study presents a novel deep learning-based method that incorporates Generative AI (GenAI) and Explainable AI (XAI) to enhance cataract detection. The proposed methodology leverages a fine-tuned InceptionResNetV2 with additional layers, trained on a hybrid dataset enriched by merging six open-source datasets, along with synthetic images generated via Generative Adversarial Networks (GANs). Class weights address data imbalance, while stratified K-Fold cross-validation ensures robust evaluation. Our system offers graphical interpretation through Gradient-weighted Class Activation Mapping (Grad-CAM) heatmaps, supporting clinical transparency and reliability. The model evaluation reports a mean K-Fold accuracy of 97.58% with a standard deviation of 0.0040, and a 95% confidence interval (CI) of (0.9702, 0.9814). On the external dataset, the model achieved an overall accuracy of 97%, an AUC of 0.9944, and for the cataract class, a precision of 96%, recall (sensitivity) of 94%, F1-score of 95%. Our method, by incorporating synthetic images and explainable AI, ensures enhanced data diversity, addresses class imbalance, reduced dependency on large annotated datasets, and offers greater interpretability that facilitates expert validation and builds stronger clinical trust, making it superior to existing cataract detection systems.

查看原文本刊更多论文

gan增强混合深度学习与可解释的人工智能用于自动白内障诊断。

白内障是最常见的眼部疾病之一，由于眼睛的自然晶状体浑浊而导致视力下降。及时诊断对于预防不可逆转的损害至关重要。虽然有效，但现有的自动化系统遇到了诸如数据集种类有限、缺乏可解释性以及在现实场景中的次优泛化等困难。本研究提出了一种基于深度学习的新方法，该方法结合了生成人工智能（GenAI）和可解释人工智能（XAI）来增强白内障检测。所提出的方法利用经过微调的带有额外层的InceptionResNetV2，在合并了六个开源数据集的混合数据集上进行训练，以及通过生成对抗网络（gan）生成的合成图像。类权重解决了数据不平衡，而分层K-Fold交叉验证确保了稳健的评估。我们的系统通过梯度加权类激活图（Grad-CAM）热图提供图形解释，支持临床透明度和可靠性。模型评价的K-Fold平均准确率为97.58%，标准差为0.0040,95%置信区间（CI）为（0.9702,0.9814）。在外部数据集上，该模型的总体准确率为97%，AUC为0.9944，对于白内障类别，准确率为96%，召回率（灵敏度）为94%，f1评分为95%。我们的方法结合了合成图像和可解释的人工智能，确保了增强的数据多样性，解决了类别失衡，减少了对大型注释数据集的依赖，并提供了更大的可解释性，促进了专家验证，建立了更强的临床信任，使其优于现有的白内障检测系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Systems 医学-卫生保健

CiteScore

11.60

自引率

1.90%

发文量

审稿时长

4.8 months

期刊介绍： Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.