{"title":"ViT-HHO:利用哈里斯-霍克优化技术检测糖尿病视网膜病变的优化视觉转换器","authors":"Vishal Awasthi , Namita Awasthi , Hemant Kumar , Shubhendra Singh , Prabal Pratap Singh , Poonam Dixit , Rashi Agarwal","doi":"10.1016/j.mex.2024.103018","DOIUrl":null,"url":null,"abstract":"<div><div>Diabetic retinopathy (DR) is a significant cause of vision impairment globally, emphasizing the importance of timely and precise detection to prevent severe consequences. This study presents an optimized Vision Transformer (ViT) model that incorporates Harris Hawk Optimization (HHO) to improve the automated detection of diabetic retinopathy (DR). The ViT architecture utilizes self-attention mechanisms to capture local and global features in retinal images. Additionally, HHO optimizes key hyperparameters to maximize the performance of the model. The proposed ViT-HHO model achieved exceptional performance on the APTOS-2019 and IDRiD datasets. Specifically, it achieved 99.83 % accuracy, 99.78 % sensitivity, 99.85 % specificity, and 99.80 % AUC-ROC on the APTOS-2019 dataset, surpassing traditional CNNs and alternative optimization techniques. The model exhibited strong generalization on the IDRiID dataset, achieving an accuracy of 99.11 % and an AUC-ROC of 99.12 %. The ViT-HHO model demonstrates the potential for enhancing the clinical detection of diabetic retinopathy (DR), providing high precision and reliability.<ul><li><span>•</span><span><div>An optimized Vision Transformer (ViT) model was developed using HHO for improved detection of Diabetic Retinopathy (DR).</div></span></li><li><span>•</span><span><div>The model was validated on the APTOS-2019 and IDRiID datasets, demonstrating superior accuracy and AUC-ROC metrics.</div></span></li><li><span>•</span><span><div>The model's generalization and robustness were demonstrated through comprehensive performance evaluations.</div></span></li></ul></div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"13 ","pages":"Article 103018"},"PeriodicalIF":1.6000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ViT-HHO: Optimized vision transformer for diabetic retinopathy detection using Harris Hawk optimization\",\"authors\":\"Vishal Awasthi , Namita Awasthi , Hemant Kumar , Shubhendra Singh , Prabal Pratap Singh , Poonam Dixit , Rashi Agarwal\",\"doi\":\"10.1016/j.mex.2024.103018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Diabetic retinopathy (DR) is a significant cause of vision impairment globally, emphasizing the importance of timely and precise detection to prevent severe consequences. This study presents an optimized Vision Transformer (ViT) model that incorporates Harris Hawk Optimization (HHO) to improve the automated detection of diabetic retinopathy (DR). The ViT architecture utilizes self-attention mechanisms to capture local and global features in retinal images. Additionally, HHO optimizes key hyperparameters to maximize the performance of the model. The proposed ViT-HHO model achieved exceptional performance on the APTOS-2019 and IDRiD datasets. Specifically, it achieved 99.83 % accuracy, 99.78 % sensitivity, 99.85 % specificity, and 99.80 % AUC-ROC on the APTOS-2019 dataset, surpassing traditional CNNs and alternative optimization techniques. The model exhibited strong generalization on the IDRiID dataset, achieving an accuracy of 99.11 % and an AUC-ROC of 99.12 %. The ViT-HHO model demonstrates the potential for enhancing the clinical detection of diabetic retinopathy (DR), providing high precision and reliability.<ul><li><span>•</span><span><div>An optimized Vision Transformer (ViT) model was developed using HHO for improved detection of Diabetic Retinopathy (DR).</div></span></li><li><span>•</span><span><div>The model was validated on the APTOS-2019 and IDRiID datasets, demonstrating superior accuracy and AUC-ROC metrics.</div></span></li><li><span>•</span><span><div>The model's generalization and robustness were demonstrated through comprehensive performance evaluations.</div></span></li></ul></div></div>\",\"PeriodicalId\":18446,\"journal\":{\"name\":\"MethodsX\",\"volume\":\"13 \",\"pages\":\"Article 103018\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MethodsX\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2215016124004692\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016124004692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
ViT-HHO: Optimized vision transformer for diabetic retinopathy detection using Harris Hawk optimization
Diabetic retinopathy (DR) is a significant cause of vision impairment globally, emphasizing the importance of timely and precise detection to prevent severe consequences. This study presents an optimized Vision Transformer (ViT) model that incorporates Harris Hawk Optimization (HHO) to improve the automated detection of diabetic retinopathy (DR). The ViT architecture utilizes self-attention mechanisms to capture local and global features in retinal images. Additionally, HHO optimizes key hyperparameters to maximize the performance of the model. The proposed ViT-HHO model achieved exceptional performance on the APTOS-2019 and IDRiD datasets. Specifically, it achieved 99.83 % accuracy, 99.78 % sensitivity, 99.85 % specificity, and 99.80 % AUC-ROC on the APTOS-2019 dataset, surpassing traditional CNNs and alternative optimization techniques. The model exhibited strong generalization on the IDRiID dataset, achieving an accuracy of 99.11 % and an AUC-ROC of 99.12 %. The ViT-HHO model demonstrates the potential for enhancing the clinical detection of diabetic retinopathy (DR), providing high precision and reliability.
•
An optimized Vision Transformer (ViT) model was developed using HHO for improved detection of Diabetic Retinopathy (DR).
•
The model was validated on the APTOS-2019 and IDRiID datasets, demonstrating superior accuracy and AUC-ROC metrics.
•
The model's generalization and robustness were demonstrated through comprehensive performance evaluations.