WFDENet:基于小波频分解增强网络的糖尿病视网膜病变分割

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xuan Li, Ding Ma, Xiangqian Wu
{"title":"WFDENet:基于小波频分解增强网络的糖尿病视网膜病变分割","authors":"Xuan Li,&nbsp;Ding Ma,&nbsp;Xiangqian Wu","doi":"10.1016/j.patcog.2025.112492","DOIUrl":null,"url":null,"abstract":"<div><div>The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at <span><span>https://github.com/xuanli01/WFDENet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112492"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"WFDENet: Wavelet-based frequency decomposition and enhancement network for diabetic retinopathy lesion segmentation\",\"authors\":\"Xuan Li,&nbsp;Ding Ma,&nbsp;Xiangqian Wu\",\"doi\":\"10.1016/j.patcog.2025.112492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at <span><span>https://github.com/xuanli01/WFDENet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"172 \",\"pages\":\"Article 112492\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325011550\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325011550","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

准确的语义信息和详细信息的获取是实现糖尿病视网膜病变准确分割的必要条件。为了实现这一目标,考虑到高级和低级编码器特征分别包含丰富的语义和细节,大多数现有的DRLS方法都侧重于设计精细的多级特征细化和融合方式。然而,它们忽略了对多层次特征固有的低频和高频信息的探索,这些信息也可以描述语义和细节。为了填补这一空白,我们提出了一种基于小波的频率分解和增强网络(WFDENet),该网络通过增强多级编码器特征的低频和高频成分来同时细化语义和详细表示。具体来说,通过离散小波变换(DWT)获得的低频和高频分量分别由低频增强器(LFB)和高频增强器(HFB)增强。高频元件包含丰富的细节,但也有更多的噪声。为了抑制噪声和增强关键特征,在HFB中,我们设计了一个复卷积频率注意模块(CCFAM),该模块利用复卷积产生动态复值信道和空间注意来改善高频分量的傅里叶频谱。此外,考虑到多尺度信息的重要性,我们对多尺度频率特征进行聚合,丰富了LFB和HFB的频率成分。在IDRiD和DDR数据集上的实验结果表明,我们的WFDENet优于最先进的方法。源代码可从https://github.com/xuanli01/WFDENet获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
WFDENet: Wavelet-based frequency decomposition and enhancement network for diabetic retinopathy lesion segmentation
The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at https://github.com/xuanli01/WFDENet.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信