WFDENet：基于小波频分解增强网络的糖尿病视网膜病变分割

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-10-06 DOI:10.1016/j.patcog.2025.112492

Xuan Li, Ding Ma, Xiangqian Wu

{"title":"WFDENet：基于小波频分解增强网络的糖尿病视网膜病变分割","authors":"Xuan Li, Ding Ma, Xiangqian Wu","doi":"10.1016/j.patcog.2025.112492","DOIUrl":null,"url":null,"abstract":"<div><div>The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at <span><span>https://github.com/xuanli01/WFDENet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112492"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"WFDENet: Wavelet-based frequency decomposition and enhancement network for diabetic retinopathy lesion segmentation\",\"authors\":\"Xuan Li, Ding Ma, Xiangqian Wu\",\"doi\":\"10.1016/j.patcog.2025.112492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at <span><span>https://github.com/xuanli01/WFDENet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"172 \",\"pages\":\"Article 112492\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325011550\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325011550","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

准确的语义信息和详细信息的获取是实现糖尿病视网膜病变准确分割的必要条件。为了实现这一目标，考虑到高级和低级编码器特征分别包含丰富的语义和细节，大多数现有的DRLS方法都侧重于设计精细的多级特征细化和融合方式。然而，它们忽略了对多层次特征固有的低频和高频信息的探索，这些信息也可以描述语义和细节。为了填补这一空白，我们提出了一种基于小波的频率分解和增强网络（WFDENet），该网络通过增强多级编码器特征的低频和高频成分来同时细化语义和详细表示。具体来说，通过离散小波变换（DWT）获得的低频和高频分量分别由低频增强器（LFB）和高频增强器（HFB）增强。高频元件包含丰富的细节，但也有更多的噪声。为了抑制噪声和增强关键特征，在HFB中，我们设计了一个复卷积频率注意模块（CCFAM），该模块利用复卷积产生动态复值信道和空间注意来改善高频分量的傅里叶频谱。此外，考虑到多尺度信息的重要性，我们对多尺度频率特征进行聚合，丰富了LFB和HFB的频率成分。在IDRiD和DDR数据集上的实验结果表明，我们的WFDENet优于最先进的方法。源代码可从https://github.com/xuanli01/WFDENet获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

WFDENet: Wavelet-based frequency decomposition and enhancement network for diabetic retinopathy lesion segmentation

The acquisition of precise semantic and detailed information is indispensable for high-accuracy diabetic retinopathy lesion segmentation (DRLS). To achieve this, noticing that high- and low-level encoder features respectively contain rich semantics and details, most existing DRLS methods focus on the design of delicate multi-level feature refinement and fusion manners. However, they ignore the exploration of intrinsic low- and high-frequency information of multi-level features, which can also describe the semantics and details. To fill this gap, we propose a Wavelet-based Frequency Decomposition and Enhancement Network (WFDENet), which simultaneously refines semantic and detailed representations by enhancing the low- and high-frequency components of the multi-level encoder features. Specifically, the low- and high-frequency components, which are acquired via discrete wavelet transform (DWT), are boosted by a low-frequency booster (LFB) and a high-frequency booster (HFB), respectively. High-frequency components contain abundant details but also more noise. To suppress the noise and strengthen critical features, in HFB, we devise a complex convolutional frequency attention module (CCFAM), which utilizes complex convolutions to generate dynamic complex-valued channel and spatial attention to improve the Fourier spectrum of high-frequency components. Moreover, considering the importance of multi-scale information, we aggregate the multi-scale frequency features to enrich the frequency components in both LFB and HFB. Experimental results on IDRiD and DDR datasets show that our WFDENet outperforms state-of-the-art methods. The source code is available at https://github.com/xuanli01/WFDENet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.