{"title":"Multi-scale local-global transformer with contrastive learning for biomarkers segmentation in retinal OCT images","authors":"Xiaoming Liu , Yuanzhe Ding , Ying Zhang , Jinshan Tang","doi":"10.1016/j.bbe.2024.02.001","DOIUrl":null,"url":null,"abstract":"<div><p>Quantitative analysis of biomarkers in Optical Coherence Tomography (OCT) images plays an import role in the diagnosis and treatment of retinal diseases. However, biomarker segmentation in retinal OCT images is very hard due to the large variations in size and shape of retinal biomarkers, blurred boundaries, low contrast, and speckle interference. We proposed a novel <strong>M</strong>ulti-<strong>s</strong>cale <strong>L</strong>ocal-<strong>G</strong>lobal <strong>T</strong>ransformer network (MsLGT-Net) for biomarker segmentation in retinal OCT images. The network combines the proposed <strong>M</strong>ulti-scale <strong>F</strong>usion <strong>A</strong>ttention (MFA) module, <strong>L</strong>ocal-<strong>G</strong>lobal <strong>T</strong>ransformer (LGT) module, and <strong>C</strong>ontrastive <strong>L</strong>earning <strong>E</strong>nhancement (CLE) module to tackle the challenges of biomarker segmentation. Specifically, the proposed MFA module aims to enhance the network’s ability to learn multi-scale features of retinal biomarkers by effectively combining the local detail information and contextual semantic information of biomarkers at different scales, and improve the representation ability for different classes of biomarkers. The LGT module is designed to learn local and global information adaptively from multi-scale fused features to address the challenge of small biomarker segmentation. In addition, to distinguish features between different types of retinal biomarkers, we propose the CLE module to enhance the feature representation of different biomarkers. Our proposed method is validated on one public dataset and one local dataset. The experimental results show that the proposed method is more effective than other state-of-the-art methods.</p></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"44 1","pages":"Pages 231-246"},"PeriodicalIF":5.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S020852162400007X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Quantitative analysis of biomarkers in Optical Coherence Tomography (OCT) images plays an import role in the diagnosis and treatment of retinal diseases. However, biomarker segmentation in retinal OCT images is very hard due to the large variations in size and shape of retinal biomarkers, blurred boundaries, low contrast, and speckle interference. We proposed a novel Multi-scale Local-Global Transformer network (MsLGT-Net) for biomarker segmentation in retinal OCT images. The network combines the proposed Multi-scale Fusion Attention (MFA) module, Local-Global Transformer (LGT) module, and Contrastive Learning Enhancement (CLE) module to tackle the challenges of biomarker segmentation. Specifically, the proposed MFA module aims to enhance the network’s ability to learn multi-scale features of retinal biomarkers by effectively combining the local detail information and contextual semantic information of biomarkers at different scales, and improve the representation ability for different classes of biomarkers. The LGT module is designed to learn local and global information adaptively from multi-scale fused features to address the challenge of small biomarker segmentation. In addition, to distinguish features between different types of retinal biomarkers, we propose the CLE module to enhance the feature representation of different biomarkers. Our proposed method is validated on one public dataset and one local dataset. The experimental results show that the proposed method is more effective than other state-of-the-art methods.
对光学相干断层扫描(OCT)图像中的生物标记进行定量分析,在视网膜疾病的诊断和治疗中发挥着重要作用。然而,由于视网膜生物标记物的大小和形状变化很大、边界模糊、对比度低以及斑点干扰,在视网膜 OCT 图像中进行生物标记物分割非常困难。我们提出了一种用于视网膜 OCT 图像生物标记物分割的新型多尺度局部-全局变换器网络(MsLGT-Net)。该网络结合了所提出的多尺度融合注意(MFA)模块、局部-全局变换器(LGT)模块和对比学习增强(CLE)模块,以应对生物标记物分割的挑战。具体来说,所提出的 MFA 模块旨在通过有效结合不同尺度生物标记的局部细节信息和上下文语义信息,增强网络学习视网膜生物标记多尺度特征的能力,并提高对不同类别生物标记的表征能力。LGT 模块旨在从多尺度融合特征中自适应地学习局部和全局信息,以解决小型生物标记物分割的难题。此外,为了区分不同类型视网膜生物标记物的特征,我们提出了 CLE 模块,以增强不同生物标记物的特征表示能力。我们提出的方法在一个公共数据集和一个本地数据集上进行了验证。实验结果表明,所提出的方法比其他最先进的方法更有效。
期刊介绍:
Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.