An optimized multi-scale dilated attention layer for keratoconus disease classification.

IF 1.4 4区 医学 Q3 OPHTHALMOLOGY
K Balaji, N Gobalakrishnan
{"title":"An optimized multi-scale dilated attention layer for keratoconus disease classification.","authors":"K Balaji, N Gobalakrishnan","doi":"10.1007/s10792-025-03688-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Keratoconus (KCN) is a progressive and non-inflammatory corneal disorder characterized by thinning and conical deformation of the cornea, resulting in visual impairment. Early and accurate detection is crucial to prevent disease progression. Conventional diagnostic methods are time-consuming and depend on expert evaluation. This study introduces an advanced deep learning (DL) model aimed at automating KCN detection using corneal topography images.</p><p><strong>Materials and methods: </strong>The proposed model, Optimized MSDALNet, integrates a Multi-Scale Dilated Attention Layer (MSDAL) to capture local and global corneal features at varying spatial resolutions. Training is optimized using Arctic Puffin Optimization (APO), a metaheuristic algorithm inspired by puffin foraging behavior. The model includes Explainable AI (XAI) capabilities using Grad-CAM for visual interpretability. Experiments were conducted using a public KCN dataset with over 1,100 labeled corneal topography images categorized into Normal, Suspect, and KCN classes. Standard pre-processing, data augmentation, and performance evaluation metrics (accuracy, precision, recall, specificity, FNR, MCC, AUC) were applied.</p><p><strong>Results: </strong>The Optimized MSDALNet achieved superior classification performance with an accuracy of 99.5%, precision of 99.4%, and specificity of 98.4%. The proposed model outperformed existing methods such as CNN, ViT, and Swin Transformer in terms of accuracy, computational cost (1.2 GFLOPs), and inference speed (8.4 ms/image). Grad-CAM visualization confirmed the model's focus on clinically relevant corneal regions. An ablation study demonstrated the impact of each component in the proposed framework.</p><p><strong>Conclusion: </strong>The Optimized MSDALNet combined with APO delivers an effective and interpretable solution for KCN detection. The model excels in feature extraction, computational efficiency, and clinical transparency. Limitations include dataset size and lack of multimodal inputs. Future work will focus on incorporating diverse datasets and additional patient data to enhance generalizability and diagnostic robustness.</p>","PeriodicalId":14473,"journal":{"name":"International Ophthalmology","volume":"45 1","pages":"318"},"PeriodicalIF":1.4000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10792-025-03688-y","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Keratoconus (KCN) is a progressive and non-inflammatory corneal disorder characterized by thinning and conical deformation of the cornea, resulting in visual impairment. Early and accurate detection is crucial to prevent disease progression. Conventional diagnostic methods are time-consuming and depend on expert evaluation. This study introduces an advanced deep learning (DL) model aimed at automating KCN detection using corneal topography images.

Materials and methods: The proposed model, Optimized MSDALNet, integrates a Multi-Scale Dilated Attention Layer (MSDAL) to capture local and global corneal features at varying spatial resolutions. Training is optimized using Arctic Puffin Optimization (APO), a metaheuristic algorithm inspired by puffin foraging behavior. The model includes Explainable AI (XAI) capabilities using Grad-CAM for visual interpretability. Experiments were conducted using a public KCN dataset with over 1,100 labeled corneal topography images categorized into Normal, Suspect, and KCN classes. Standard pre-processing, data augmentation, and performance evaluation metrics (accuracy, precision, recall, specificity, FNR, MCC, AUC) were applied.

Results: The Optimized MSDALNet achieved superior classification performance with an accuracy of 99.5%, precision of 99.4%, and specificity of 98.4%. The proposed model outperformed existing methods such as CNN, ViT, and Swin Transformer in terms of accuracy, computational cost (1.2 GFLOPs), and inference speed (8.4 ms/image). Grad-CAM visualization confirmed the model's focus on clinically relevant corneal regions. An ablation study demonstrated the impact of each component in the proposed framework.

Conclusion: The Optimized MSDALNet combined with APO delivers an effective and interpretable solution for KCN detection. The model excels in feature extraction, computational efficiency, and clinical transparency. Limitations include dataset size and lack of multimodal inputs. Future work will focus on incorporating diverse datasets and additional patient data to enhance generalizability and diagnostic robustness.

圆锥角膜疾病分类的优化多尺度放大注意层。
圆锥角膜(KCN)是一种进行性非炎症性角膜疾病,以角膜变薄和锥形变形为特征,导致视力障碍。早期和准确的检测对于预防疾病进展至关重要。传统的诊断方法耗时且依赖于专家评估。本研究引入了一种先进的深度学习(DL)模型,旨在利用角膜地形图自动检测KCN。材料和方法:优化的MSDALNet模型集成了一个多尺度扩展注意层(MSDAL),以捕获不同空间分辨率下的局部和全局角膜特征。训练采用北极海雀优化算法(Arctic Puffin Optimization, APO),这是一种受海雀觅食行为启发的元启发式算法。该模型包括可解释的AI (XAI)功能,使用Grad-CAM实现视觉可解释性。实验使用公共KCN数据集进行,该数据集包含1100多张标记的角膜地形图,分为正常、可疑和KCN类。采用标准的预处理、数据增强和性能评估指标(准确度、精密度、召回率、特异性、FNR、MCC、AUC)。结果:优化后的MSDALNet分类准确率为99.5%,精密度为99.4%,特异性为98.4%。该模型在精度、计算成本(1.2 GFLOPs)和推理速度(8.4 ms/image)方面优于CNN、ViT和Swin Transformer等现有方法。Grad-CAM可视化证实了该模型对临床相关角膜区域的关注。一项消融研究证明了提议框架中每个组成部分的影响。结论:优化后的MSDALNet结合APO为KCN检测提供了一种有效且可解释的解决方案。该模型在特征提取、计算效率和临床透明度方面表现出色。限制包括数据集大小和缺乏多模态输入。未来的工作将集中在整合不同的数据集和额外的患者数据,以提高普遍性和诊断的稳健性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.20
自引率
0.00%
发文量
451
期刊介绍: International Ophthalmology provides the clinician with articles on all the relevant subspecialties of ophthalmology, with a broad international scope. The emphasis is on presentation of the latest clinical research in the field. In addition, the journal includes regular sections devoted to new developments in technologies, products, and techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信