{"title":"基于新型分类头的图像级多标签视网膜疾病分类","authors":"Orhan Sivaz, Murat Aykut","doi":"10.1016/j.compeleceng.2025.110410","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"124 ","pages":"Article 110410"},"PeriodicalIF":4.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image-level multi-label retinal disease classification with a novel classification head\",\"authors\":\"Orhan Sivaz, Murat Aykut\",\"doi\":\"10.1016/j.compeleceng.2025.110410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"124 \",\"pages\":\"Article 110410\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625003532\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003532","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Image-level multi-label retinal disease classification with a novel classification head
Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.