WaveAttention-ResNet:基于深度学习的多种视网膜疾病辅助诊断智能诊断模型。

IF 2.3
Frontiers in radiology Pub Date : 2025-07-29 eCollection Date: 2025-01-01 DOI:10.3389/fradi.2025.1608052
Biao Guo, Daqing Wang, Ruiqi Zhang, Jia Hou, Wenchao Liu, YongFei Wu, Xudong Yang, Lijuan Zhang
{"title":"WaveAttention-ResNet:基于深度学习的多种视网膜疾病辅助诊断智能诊断模型。","authors":"Biao Guo, Daqing Wang, Ruiqi Zhang, Jia Hou, Wenchao Liu, YongFei Wu, Xudong Yang, Lijuan Zhang","doi":"10.3389/fradi.2025.1608052","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study constructs a deep learning-based combined algorithm named WaveAttention ResNet (WARN) to investigate the classification accuracy for seven common retinal diseases and the feasibility of AI-assisted diagnosis in this field.</p><p><strong>Methods: </strong>First, a deep learning-based classification network is constructed. The network is built upon ResNet18, integrated with the Convolutional Block Attention Module (CBAM) and wavelet convolution modules, forming the WARN method for retinal disease classification. Second, the public OCTDL dataset is used to train WARN, which contains classification data for seven retinal disease types: age-related macular degeneration (AMD), diabetic macular edema (DME), epiretinal membrane (ERM), normal (NO), retinal artery occlusion (RAO), retinal vein occlusion (RVO), and vitreomacular interface disease (VID). During this process, ablation experiments and significance tests are conducted on WARN, and comprehensive analyses of various indicators for WARN, ResNet-18, ResNet-50, Swin Transformer v2, EfficientNet, and Vision Transformer (ViT) are performed in retinal disease classification tasks. Finally, data provided by Shanxi Eye Hospital are used for testing, and classification results are analyzed.</p><p><strong>Results: </strong>WARN demonstrates excellent performance on the public OCTDL dataset. Ablation experiments and significance tests confirm the effectiveness of WARN, achieving an accuracy of 90.68%, F1-score of 91.29%, AUC of 97.50%, precision of 93.31%, and recall of 90.68% with relatively short training time. In the dataset from Shanxi Eye Hospital, WARN also performs well, with a recall of 90.85%, precision of 79.94%, and accuracy of 89.18%.</p><p><strong>Conclusion: </strong>This study fully confirms that the constructed WARN is efficient and feasible for classifying seven common retinal diseases. It further highlights the enormous potential and broad application prospects of AI technology in the field of auxiliary medical diagnosis.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"5 ","pages":"1608052"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12339474/pdf/","citationCount":"0","resultStr":"{\"title\":\"WaveAttention-ResNet: a deep learning-based intelligent diagnostic model for the auxiliary diagnosis of multiple retinal diseases.\",\"authors\":\"Biao Guo, Daqing Wang, Ruiqi Zhang, Jia Hou, Wenchao Liu, YongFei Wu, Xudong Yang, Lijuan Zhang\",\"doi\":\"10.3389/fradi.2025.1608052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>This study constructs a deep learning-based combined algorithm named WaveAttention ResNet (WARN) to investigate the classification accuracy for seven common retinal diseases and the feasibility of AI-assisted diagnosis in this field.</p><p><strong>Methods: </strong>First, a deep learning-based classification network is constructed. The network is built upon ResNet18, integrated with the Convolutional Block Attention Module (CBAM) and wavelet convolution modules, forming the WARN method for retinal disease classification. Second, the public OCTDL dataset is used to train WARN, which contains classification data for seven retinal disease types: age-related macular degeneration (AMD), diabetic macular edema (DME), epiretinal membrane (ERM), normal (NO), retinal artery occlusion (RAO), retinal vein occlusion (RVO), and vitreomacular interface disease (VID). During this process, ablation experiments and significance tests are conducted on WARN, and comprehensive analyses of various indicators for WARN, ResNet-18, ResNet-50, Swin Transformer v2, EfficientNet, and Vision Transformer (ViT) are performed in retinal disease classification tasks. Finally, data provided by Shanxi Eye Hospital are used for testing, and classification results are analyzed.</p><p><strong>Results: </strong>WARN demonstrates excellent performance on the public OCTDL dataset. Ablation experiments and significance tests confirm the effectiveness of WARN, achieving an accuracy of 90.68%, F1-score of 91.29%, AUC of 97.50%, precision of 93.31%, and recall of 90.68% with relatively short training time. In the dataset from Shanxi Eye Hospital, WARN also performs well, with a recall of 90.85%, precision of 79.94%, and accuracy of 89.18%.</p><p><strong>Conclusion: </strong>This study fully confirms that the constructed WARN is efficient and feasible for classifying seven common retinal diseases. It further highlights the enormous potential and broad application prospects of AI technology in the field of auxiliary medical diagnosis.</p>\",\"PeriodicalId\":73101,\"journal\":{\"name\":\"Frontiers in radiology\",\"volume\":\"5 \",\"pages\":\"1608052\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12339474/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in radiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fradi.2025.1608052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in radiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fradi.2025.1608052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:构建基于深度学习的WaveAttention ResNet (WARN)组合算法,探讨7种常见视网膜疾病的分类准确率及人工智能辅助诊断在该领域的可行性。方法:首先,构建基于深度学习的分类网络。该网络以ResNet18为基础,结合卷积块注意模块(Convolutional Block Attention Module, CBAM)和小波卷积模块,形成视网膜疾病分类的WARN方法。其次,使用公开的OCTDL数据集训练WARN,该数据集包含7种视网膜疾病类型的分类数据:年龄相关性黄斑变性(AMD)、糖尿病性黄斑水肿(DME)、视网膜前膜(ERM)、正常(NO)、视网膜动脉闭塞(RAO)、视网膜静脉闭塞(RVO)和玻璃体黄斑界面病(VID)。在此过程中,对WARN进行消融实验和显著性检验,综合分析WARN、ResNet-18、ResNet-50、Swin Transformer v2、EfficientNet、Vision Transformer (ViT)在视网膜疾病分类任务中的各项指标。最后利用山西省眼科医院提供的数据进行检验,并对分类结果进行分析。结果:WARN在公共OCTDL数据集上展示了出色的性能。消融实验和显著性检验证实了WARN的有效性,训练时间相对较短,准确率为90.68%,f1得分为91.29%,AUC为97.50%,准确率为93.31%,召回率为90.68%。在山西眼科医院的数据集中,WARN也表现良好,召回率为90.85%,准确率为79.94%,准确率为89.18%。结论:本研究充分证实了构建的WARN对7种常见视网膜疾病的分类是有效可行的。进一步凸显了人工智能技术在医疗辅助诊断领域的巨大潜力和广阔应用前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
WaveAttention-ResNet: a deep learning-based intelligent diagnostic model for the auxiliary diagnosis of multiple retinal diseases.

Objective: This study constructs a deep learning-based combined algorithm named WaveAttention ResNet (WARN) to investigate the classification accuracy for seven common retinal diseases and the feasibility of AI-assisted diagnosis in this field.

Methods: First, a deep learning-based classification network is constructed. The network is built upon ResNet18, integrated with the Convolutional Block Attention Module (CBAM) and wavelet convolution modules, forming the WARN method for retinal disease classification. Second, the public OCTDL dataset is used to train WARN, which contains classification data for seven retinal disease types: age-related macular degeneration (AMD), diabetic macular edema (DME), epiretinal membrane (ERM), normal (NO), retinal artery occlusion (RAO), retinal vein occlusion (RVO), and vitreomacular interface disease (VID). During this process, ablation experiments and significance tests are conducted on WARN, and comprehensive analyses of various indicators for WARN, ResNet-18, ResNet-50, Swin Transformer v2, EfficientNet, and Vision Transformer (ViT) are performed in retinal disease classification tasks. Finally, data provided by Shanxi Eye Hospital are used for testing, and classification results are analyzed.

Results: WARN demonstrates excellent performance on the public OCTDL dataset. Ablation experiments and significance tests confirm the effectiveness of WARN, achieving an accuracy of 90.68%, F1-score of 91.29%, AUC of 97.50%, precision of 93.31%, and recall of 90.68% with relatively short training time. In the dataset from Shanxi Eye Hospital, WARN also performs well, with a recall of 90.85%, precision of 79.94%, and accuracy of 89.18%.

Conclusion: This study fully confirms that the constructed WARN is efficient and feasible for classifying seven common retinal diseases. It further highlights the enormous potential and broad application prospects of AI technology in the field of auxiliary medical diagnosis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信