Yinghua Fu, Mangmang Liu, Ge Zhang, Jiansheng Peng
{"title":"Lightweight Frequency Recalibration Network for Diabetic Retinopathy Multi-Lesion Segmentation","authors":"Yinghua Fu, Mangmang Liu, Ge Zhang, Jiansheng Peng","doi":"10.3390/app14166941","DOIUrl":null,"url":null,"abstract":"Automated segmentation of diabetic retinopathy (DR) lesions is crucial for assessing DR severity and diagnosis. Most previous segmentation methods overlook the detrimental impact of texture information bias, resulting in suboptimal segmentation results. Additionally, the role of lesion shape is not thoroughly considered. In this paper, we propose a lightweight frequency recalibration network (LFRC-Net) for simultaneous multi-lesion DR segmentation, which integrates a frequency recalibration module into the bottleneck layers of the encoder to analyze texture information and shape features together. The module utilizes a Gaussian pyramid to generate features at different scales, constructs a Laplacian pyramid using a difference of Gaussian filter, and then analyzes object features in different frequency domains with the Laplacian pyramid. The high-frequency component handles texture information, while the low-frequency area focuses on learning the shape features of DR lesions. By adaptively recalibrating these frequency representations, our method can differentiate the objects of interest. In the decoder, we introduce a residual attention module (RAM) to enhance lesion feature extraction and efficiently suppress irrelevant information. We evaluate the proposed model’s segmentation performance on two public datasets, IDRiD and DDR, and a private dataset, an ultra-wide-field fundus images dataset. Extensive comparative experiments and ablation studies are conducted across multiple datasets. With minimal model parameters, our approach achieves an mAP_PR of 60.51%, 34.83%, and 14.35% for the segmentation of EX, HE, and MA on the DDR dataset and also obtains excellent results for EX and SE on the IDRiD dataset, which validates the effectiveness of our network.","PeriodicalId":502388,"journal":{"name":"Applied Sciences","volume":"1 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/app14166941","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Automated segmentation of diabetic retinopathy (DR) lesions is crucial for assessing DR severity and diagnosis. Most previous segmentation methods overlook the detrimental impact of texture information bias, resulting in suboptimal segmentation results. Additionally, the role of lesion shape is not thoroughly considered. In this paper, we propose a lightweight frequency recalibration network (LFRC-Net) for simultaneous multi-lesion DR segmentation, which integrates a frequency recalibration module into the bottleneck layers of the encoder to analyze texture information and shape features together. The module utilizes a Gaussian pyramid to generate features at different scales, constructs a Laplacian pyramid using a difference of Gaussian filter, and then analyzes object features in different frequency domains with the Laplacian pyramid. The high-frequency component handles texture information, while the low-frequency area focuses on learning the shape features of DR lesions. By adaptively recalibrating these frequency representations, our method can differentiate the objects of interest. In the decoder, we introduce a residual attention module (RAM) to enhance lesion feature extraction and efficiently suppress irrelevant information. We evaluate the proposed model’s segmentation performance on two public datasets, IDRiD and DDR, and a private dataset, an ultra-wide-field fundus images dataset. Extensive comparative experiments and ablation studies are conducted across multiple datasets. With minimal model parameters, our approach achieves an mAP_PR of 60.51%, 34.83%, and 14.35% for the segmentation of EX, HE, and MA on the DDR dataset and also obtains excellent results for EX and SE on the IDRiD dataset, which validates the effectiveness of our network.
糖尿病视网膜病变(DR)病变的自动分割对于评估 DR 的严重程度和诊断至关重要。之前的大多数分割方法都忽略了纹理信息偏差的不利影响,导致分割结果不理想。此外,病变形状的作用也没有得到充分考虑。在本文中,我们提出了一种用于多病灶 DR 同步分割的轻量级频率再校准网络(LFRC-Net),它将频率再校准模块集成到编码器的瓶颈层中,以同时分析纹理信息和形状特征。该模块利用高斯金字塔生成不同尺度的特征,利用高斯差分滤波器构建拉普拉斯金字塔,然后利用拉普拉斯金字塔分析不同频域的物体特征。高频部分处理纹理信息,而低频区域则侧重于学习 DR 病变的形状特征。通过自适应地重新校准这些频率表示,我们的方法可以区分感兴趣的物体。在解码器中,我们引入了残差注意模块(RAM),以增强病变特征提取并有效抑制无关信息。我们在两个公共数据集(IDRiD 和 DDR)和一个私人数据集(超宽视野眼底图像数据集)上评估了所提模型的分割性能。我们在多个数据集上进行了广泛的对比实验和消融研究。在模型参数最小的情况下,我们的方法在 DDR 数据集上对 EX、HE 和 MA 的分割中分别取得了 60.51%、34.83% 和 14.35% 的 mAP_PR,在 IDRiD 数据集上对 EX 和 SE 的分割中也取得了优异的结果,这验证了我们网络的有效性。