MACFNet：用于OD和OC分割的多注意力跨尺度融合网络

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-07-20 DOI:10.1016/j.bspc.2025.108311

Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou

{"title":"MACFNet：用于OD和OC分割的多注意力跨尺度融合网络","authors":"Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou","doi":"10.1016/j.bspc.2025.108311","DOIUrl":null,"url":null,"abstract":"<div><div>Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"110 ","pages":"Article 108311"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MACFNet: Multi-attention cross-scale fusion network for OD and OC segmentation\",\"authors\":\"Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou\",\"doi\":\"10.1016/j.bspc.2025.108311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"110 \",\"pages\":\"Article 108311\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425008225\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425008225","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

变压器有望从眼底图像中分割视盘（OD）和视杯（OC），为自动化青光眼检测提供潜在的进步。然而，传统的基于补丁或基于令牌的图像处理通常会损害基本的通道特定信息和空间信息。这导致关键特征和空间关系的表示不足，最终降低了分割精度。为了解决这些问题，提出了一种多注意力跨尺度融合网络（MACFNet）。具体来说，MACFNet采用自关注卷积（SC）模块作为编码器-解码器，协同卷积和Transformer架构的优势，同时减轻与自关注操作相关的计算负担。此外，编码器中还加入了Dual Attention （DA）模块，专注于医学图像的关键区域和特征，充分利用通道和位置特征信息。在此基础上，针对跳接过程设计了多尺度交叉注意（Multi-scale Cross Attention， MSCA）模块，通过整合全局语义特征和局部语义特征，有效地弥合了不同阶段的语义缺口。在两个公共数据集（DRISHTI-GS1和REFUGE）上进行实验评估，与现有方法相比，在OD和OC分割任务中获得了最高的IoU和Dice分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MACFNet: Multi-attention cross-scale fusion network for OD and OC segmentation

Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.