Lightweight Frequency Recalibration Network for Diabetic Retinopathy Multi-Lesion Segmentation

Yinghua Fu, Mangmang Liu, Ge Zhang, Jiansheng Peng
{"title":"Lightweight Frequency Recalibration Network for Diabetic Retinopathy Multi-Lesion Segmentation","authors":"Yinghua Fu, Mangmang Liu, Ge Zhang, Jiansheng Peng","doi":"10.3390/app14166941","DOIUrl":null,"url":null,"abstract":"Automated segmentation of diabetic retinopathy (DR) lesions is crucial for assessing DR severity and diagnosis. Most previous segmentation methods overlook the detrimental impact of texture information bias, resulting in suboptimal segmentation results. Additionally, the role of lesion shape is not thoroughly considered. In this paper, we propose a lightweight frequency recalibration network (LFRC-Net) for simultaneous multi-lesion DR segmentation, which integrates a frequency recalibration module into the bottleneck layers of the encoder to analyze texture information and shape features together. The module utilizes a Gaussian pyramid to generate features at different scales, constructs a Laplacian pyramid using a difference of Gaussian filter, and then analyzes object features in different frequency domains with the Laplacian pyramid. The high-frequency component handles texture information, while the low-frequency area focuses on learning the shape features of DR lesions. By adaptively recalibrating these frequency representations, our method can differentiate the objects of interest. In the decoder, we introduce a residual attention module (RAM) to enhance lesion feature extraction and efficiently suppress irrelevant information. We evaluate the proposed model’s segmentation performance on two public datasets, IDRiD and DDR, and a private dataset, an ultra-wide-field fundus images dataset. Extensive comparative experiments and ablation studies are conducted across multiple datasets. With minimal model parameters, our approach achieves an mAP_PR of 60.51%, 34.83%, and 14.35% for the segmentation of EX, HE, and MA on the DDR dataset and also obtains excellent results for EX and SE on the IDRiD dataset, which validates the effectiveness of our network.","PeriodicalId":502388,"journal":{"name":"Applied Sciences","volume":"1 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/app14166941","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Automated segmentation of diabetic retinopathy (DR) lesions is crucial for assessing DR severity and diagnosis. Most previous segmentation methods overlook the detrimental impact of texture information bias, resulting in suboptimal segmentation results. Additionally, the role of lesion shape is not thoroughly considered. In this paper, we propose a lightweight frequency recalibration network (LFRC-Net) for simultaneous multi-lesion DR segmentation, which integrates a frequency recalibration module into the bottleneck layers of the encoder to analyze texture information and shape features together. The module utilizes a Gaussian pyramid to generate features at different scales, constructs a Laplacian pyramid using a difference of Gaussian filter, and then analyzes object features in different frequency domains with the Laplacian pyramid. The high-frequency component handles texture information, while the low-frequency area focuses on learning the shape features of DR lesions. By adaptively recalibrating these frequency representations, our method can differentiate the objects of interest. In the decoder, we introduce a residual attention module (RAM) to enhance lesion feature extraction and efficiently suppress irrelevant information. We evaluate the proposed model’s segmentation performance on two public datasets, IDRiD and DDR, and a private dataset, an ultra-wide-field fundus images dataset. Extensive comparative experiments and ablation studies are conducted across multiple datasets. With minimal model parameters, our approach achieves an mAP_PR of 60.51%, 34.83%, and 14.35% for the segmentation of EX, HE, and MA on the DDR dataset and also obtains excellent results for EX and SE on the IDRiD dataset, which validates the effectiveness of our network.
用于糖尿病视网膜病变多病灶分割的轻量级频率重新校准网络
糖尿病视网膜病变(DR)病变的自动分割对于评估 DR 的严重程度和诊断至关重要。之前的大多数分割方法都忽略了纹理信息偏差的不利影响,导致分割结果不理想。此外,病变形状的作用也没有得到充分考虑。在本文中,我们提出了一种用于多病灶 DR 同步分割的轻量级频率再校准网络(LFRC-Net),它将频率再校准模块集成到编码器的瓶颈层中,以同时分析纹理信息和形状特征。该模块利用高斯金字塔生成不同尺度的特征,利用高斯差分滤波器构建拉普拉斯金字塔,然后利用拉普拉斯金字塔分析不同频域的物体特征。高频部分处理纹理信息,而低频区域则侧重于学习 DR 病变的形状特征。通过自适应地重新校准这些频率表示,我们的方法可以区分感兴趣的物体。在解码器中,我们引入了残差注意模块(RAM),以增强病变特征提取并有效抑制无关信息。我们在两个公共数据集(IDRiD 和 DDR)和一个私人数据集(超宽视野眼底图像数据集)上评估了所提模型的分割性能。我们在多个数据集上进行了广泛的对比实验和消融研究。在模型参数最小的情况下,我们的方法在 DDR 数据集上对 EX、HE 和 MA 的分割中分别取得了 60.51%、34.83% 和 14.35% 的 mAP_PR,在 IDRiD 数据集上对 EX 和 SE 的分割中也取得了优异的结果,这验证了我们网络的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信