B. Chitradevi , P. Mathiyalagan , A. Ramachandran , R. Dhanapal , K. Sheikdavood , S. Gnanamurugan
{"title":"Conv-ViT: An improved discrete convolution-based vision transformer for diabetic retinopathy detection","authors":"B. Chitradevi , P. Mathiyalagan , A. Ramachandran , R. Dhanapal , K. Sheikdavood , S. Gnanamurugan","doi":"10.1016/j.fraope.2025.100477","DOIUrl":null,"url":null,"abstract":"<div><div>Timely detection of Diabetic Retinopathy (DR), a major cause of irreversible blindness, is important to avert vision impairment. Present computer-aided diagnostic methods often suffer from poor segmentation, image noise, and a lack of generalization across datasets. This study proposes Conv-ViT, a hybrid model that integrates convolutional networks to overcome the disadvantages of the aforementioned models. Probability-based particle swarm optimization (PBPSO) was applied to achieve accurate segmentation, median filtering was applied to remove noise, and local binary pattern (LBP) was used to extract texture features. An innovative Electric Fish Optimization Arithmetic Algorithm (EFAOA), which enhances the compromise between exploration and exploitation during hyperparameter fine-tuning, was introduced to bolster model efficiency. Evaluation on the MESSIDOR dataset showed remarkable results, with an accuracy of 99.58 %, 98.86 %, 98.87 %, and an F1-score of 98.85 %, respectively. These results indicate the generalizability of the model beyond that of current state-of-the-art algorithms. The Conv-ViT framework represents a robust and scalable solution for the early detection of DR and holds great promise for use in automated cloud-based diagnostic systems.</div></div>","PeriodicalId":100554,"journal":{"name":"Franklin Open","volume":"14 ","pages":"Article 100477"},"PeriodicalIF":0.0000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Franklin Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2773186325002622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/25 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Timely detection of Diabetic Retinopathy (DR), a major cause of irreversible blindness, is important to avert vision impairment. Present computer-aided diagnostic methods often suffer from poor segmentation, image noise, and a lack of generalization across datasets. This study proposes Conv-ViT, a hybrid model that integrates convolutional networks to overcome the disadvantages of the aforementioned models. Probability-based particle swarm optimization (PBPSO) was applied to achieve accurate segmentation, median filtering was applied to remove noise, and local binary pattern (LBP) was used to extract texture features. An innovative Electric Fish Optimization Arithmetic Algorithm (EFAOA), which enhances the compromise between exploration and exploitation during hyperparameter fine-tuning, was introduced to bolster model efficiency. Evaluation on the MESSIDOR dataset showed remarkable results, with an accuracy of 99.58 %, 98.86 %, 98.87 %, and an F1-score of 98.85 %, respectively. These results indicate the generalizability of the model beyond that of current state-of-the-art algorithms. The Conv-ViT framework represents a robust and scalable solution for the early detection of DR and holds great promise for use in automated cloud-based diagnostic systems.