Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou
{"title":"MACFNet:用于OD和OC分割的多注意力跨尺度融合网络","authors":"Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou","doi":"10.1016/j.bspc.2025.108311","DOIUrl":null,"url":null,"abstract":"<div><div>Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"110 ","pages":"Article 108311"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MACFNet: Multi-attention cross-scale fusion network for OD and OC segmentation\",\"authors\":\"Yugen Yi , Bin Zhou , Yangtao Hu , Siwei Luo , Jin Gao , Yirui Jiang , Xinping Rao , Wei Zhou\",\"doi\":\"10.1016/j.bspc.2025.108311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"110 \",\"pages\":\"Article 108311\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425008225\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425008225","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
MACFNet: Multi-attention cross-scale fusion network for OD and OC segmentation
Transformers show promise for segmenting the Optic Disc (OD) and Optic Cup (OC) from fundus images, offering potential advancements in automated glaucoma detection. However, traditional patch-based or token-based image processing often compromises essential channel-specific and spatial information. This results in an insufficient representation of key features and spatial relationships, ultimately diminishing segmentation accuracy. To address these challenges, a Multi-attention Cross-scale Fusion Network (MACFNet) is proposed. Specifically, MACFNet takes a Self-attention Convolution (SC) module as the encoder-decoder, synergizing the strengths of convolution and Transformer architectures while mitigating the computational burden associated with self-attention operations. Moreover, the Dual Attention (DA) modules are also incorporated into the encoder, which focuses on critical regions and features of medical images, thereby fully leveraging channel and positional feature information. Furthermore, a Multi-scale Cross Attention (MSCA) module is designed for the skip connection process, which effectively bridges the semantic gap of different stages by integrating global semantic features with local features. Experimental evaluations on two public datasets, DRISHTI-GS1 and REFUGE, achieving state-of-the-art performance in OD and OC segmentation tasks with the highest IoU and Dice scores compared to existing methods.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.