基于新型分类头的图像级多标签视网膜疾病分类

IF 4 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Orhan Sivaz, Murat Aykut
{"title":"基于新型分类头的图像级多标签视网膜疾病分类","authors":"Orhan Sivaz,&nbsp;Murat Aykut","doi":"10.1016/j.compeleceng.2025.110410","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"124 ","pages":"Article 110410"},"PeriodicalIF":4.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image-level multi-label retinal disease classification with a novel classification head\",\"authors\":\"Orhan Sivaz,&nbsp;Murat Aykut\",\"doi\":\"10.1016/j.compeleceng.2025.110410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"124 \",\"pages\":\"Article 110410\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625003532\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003532","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

视网膜疾病的自动识别是阻止疾病进展的重要一步。考虑到人们可能同时患有多种疾病,并且需要知道哪只眼睛患病,本研究侧重于从多标签眼底图像中分别检测视网膜疾病。在提出的模型中,图像首先通过数据增强步骤,然后给出Swin Transformer V2主干,该主干的重点是捕获全局上下文。将获得的强大特征赋予新开发的分流交叉注意(SCA)分类头,通过防止信息丢失和检测不同尺度的特征来增强分类能力。该模型结合了自适应锐度感知最小化(ASAM)优化器来提高收敛能力,以及可扩展邻居判别损失(SNDL)来有效捕获多标签数据集上的标签间依赖关系。性能评估是在公开可用的眼病智能识别数据集上进行的。从Kappa、F1和曲线下面积得分的平均值来看,非现场和现场测试场景的最终得分分别达到87.60%和85.11%,是文献中最好的。当每个指标被单独评估时,它几乎在所有指标中都位于顶部。为了进一步强调所提出的SCA分类头的鲁棒性,将其与不同的流行分类头进行了比较,并在不同的主干和数据集上进行了测试,在所有场景下都获得了优异的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Image-level multi-label retinal disease classification with a novel classification head
Automatic recognition of retinal diseases is an important step that serves to halt the disease’s progression. Considering that people can suffer from more than one disease at the same time and the need to know which eye(s) is diseased, this study focused on detecting retinal diseases from multi-label fundus images separately. In the proposed model, the image is first passed through the data augmentation step and then given to the Swin Transformer V2 backbone, which focuses on capturing the global context. The powerful features that were obtained are given to the newly developed Shunted Cross-Attention (SCA) classification head, which strengthens classification ability by preventing information loss and detecting features at different scales. The proposed model incorporates the Adaptive Sharpness-Aware Minimization (ASAM) optimizer to improve convergence ability and the Scalable Neighbor Discriminative Loss (SNDL) to effectively capture inter-label dependencies on multi-label datasets. The performance evaluations have been conducted on the publicly available Ocular Disease Intelligent Recognition dataset. Considering the final score, which is the average of Kappa, F1, and Area Under Curve scores, 87.60% and 85.11% are achieved for off-site and on-site test scenarios, respectively, which is the best in the literature. When each metric is evaluated separately, it is seen at the top in almost all of them. To further emphasize the proposed SCA classification head’s robustness, it is compared with different popular classification heads and tested with different backbones and datasets, and superior results are obtained for all scenarios.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信