Yugen Yi , Yi He , Hong Li , Xuan Wu , Jiangyan Dai , Siwei Luo , Quancai Li , Wei Zhou
{"title":"MFNet:医学图像分割的多融合网络","authors":"Yugen Yi , Yi He , Hong Li , Xuan Wu , Jiangyan Dai , Siwei Luo , Quancai Li , Wei Zhou","doi":"10.1016/j.dsp.2025.105219","DOIUrl":null,"url":null,"abstract":"<div><div>Medical image segmentation can distinguish and determine various structures, tissues, or lesions, providing crucial information for clinical diagnosis. However, it faces numerous challenges. On the one hand, medical images possess complex structures, diverse morphologies, uneven contrast, and blurred borders between target tissues and the background, all of which complicate the segmentation process. On the other hand, there exists semantic gaps between low-level and high-level features as well as between the encoder and decoder, which greatly impacts the segmentation effectiveness. In order to overcome these drawbacks, a <strong>M</strong>ulti-<strong>F</strong>usion <strong>Net</strong>work (<strong>MFNet</strong>) is presented to integrate semantic and feature fusion. In this method, two novel modules including Multi-Level Semantic Fusion (MLSF) module and Multi-Scale Progressive Fusion (MSPF) module are designed to heighten the representation capability of capturing diverse semantic and scale information. Moreover, a Multi-Stage Progressive Fusion Decoder (MSPFD) model is developed to substitute the traditional bottom-up aggregation decoder with a hierarchical fusion decoder to integrate features from different levels step by step. Meanwhile, an Interaction and Fusion of Adjacent Levels (IFAL) module is introduced to merge higher-level and lower-level features, effectively learning the semantic consistency and reducing this semantic gap. To evaluate the performance of our designed network, we evaluate it against several SOTA methods on four benchmark datasets including ISIC2018, GlaS, ACDC, and Synapse. Comparative results indicate that MFNet achieves remarkable ability on medical image segmentation.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"163 ","pages":"Article 105219"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MFNet: Multi-fusion network for medical image segmentation\",\"authors\":\"Yugen Yi , Yi He , Hong Li , Xuan Wu , Jiangyan Dai , Siwei Luo , Quancai Li , Wei Zhou\",\"doi\":\"10.1016/j.dsp.2025.105219\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Medical image segmentation can distinguish and determine various structures, tissues, or lesions, providing crucial information for clinical diagnosis. However, it faces numerous challenges. On the one hand, medical images possess complex structures, diverse morphologies, uneven contrast, and blurred borders between target tissues and the background, all of which complicate the segmentation process. On the other hand, there exists semantic gaps between low-level and high-level features as well as between the encoder and decoder, which greatly impacts the segmentation effectiveness. In order to overcome these drawbacks, a <strong>M</strong>ulti-<strong>F</strong>usion <strong>Net</strong>work (<strong>MFNet</strong>) is presented to integrate semantic and feature fusion. In this method, two novel modules including Multi-Level Semantic Fusion (MLSF) module and Multi-Scale Progressive Fusion (MSPF) module are designed to heighten the representation capability of capturing diverse semantic and scale information. Moreover, a Multi-Stage Progressive Fusion Decoder (MSPFD) model is developed to substitute the traditional bottom-up aggregation decoder with a hierarchical fusion decoder to integrate features from different levels step by step. Meanwhile, an Interaction and Fusion of Adjacent Levels (IFAL) module is introduced to merge higher-level and lower-level features, effectively learning the semantic consistency and reducing this semantic gap. To evaluate the performance of our designed network, we evaluate it against several SOTA methods on four benchmark datasets including ISIC2018, GlaS, ACDC, and Synapse. Comparative results indicate that MFNet achieves remarkable ability on medical image segmentation.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"163 \",\"pages\":\"Article 105219\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425002416\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425002416","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
医学图像分割可以区分和确定各种结构、组织或病变,为临床诊断提供重要信息。然而,它面临着许多挑战。一方面,医学图像结构复杂,形态多样,对比度不均匀,目标组织与背景之间的边界模糊,这些都使分割过程变得复杂。另一方面,低阶和高阶特征之间、编码器和解码器之间存在语义差距,极大地影响了分割效果。为了克服这些缺点,提出了一种融合语义和特征融合的多融合网络(MFNet)。该方法设计了多级语义融合(MLSF)模块和多尺度渐进融合(MSPF)模块,提高了捕获不同语义和尺度信息的表示能力。此外,提出了多级递进融合解码器模型,用分层融合解码器取代传统的自下而上聚合解码器,逐步整合不同层次的特征。同时,引入IFAL (Interaction and Fusion of邻层特征)模块,对高、低层特征进行融合,有效学习语义一致性,减少语义缺口。为了评估我们设计的网络的性能,我们在四个基准数据集(包括ISIC2018、GlaS、ACDC和Synapse)上对几种SOTA方法进行了评估。对比结果表明,MFNet在医学图像分割方面取得了显著的效果。
MFNet: Multi-fusion network for medical image segmentation
Medical image segmentation can distinguish and determine various structures, tissues, or lesions, providing crucial information for clinical diagnosis. However, it faces numerous challenges. On the one hand, medical images possess complex structures, diverse morphologies, uneven contrast, and blurred borders between target tissues and the background, all of which complicate the segmentation process. On the other hand, there exists semantic gaps between low-level and high-level features as well as between the encoder and decoder, which greatly impacts the segmentation effectiveness. In order to overcome these drawbacks, a Multi-Fusion Network (MFNet) is presented to integrate semantic and feature fusion. In this method, two novel modules including Multi-Level Semantic Fusion (MLSF) module and Multi-Scale Progressive Fusion (MSPF) module are designed to heighten the representation capability of capturing diverse semantic and scale information. Moreover, a Multi-Stage Progressive Fusion Decoder (MSPFD) model is developed to substitute the traditional bottom-up aggregation decoder with a hierarchical fusion decoder to integrate features from different levels step by step. Meanwhile, an Interaction and Fusion of Adjacent Levels (IFAL) module is introduced to merge higher-level and lower-level features, effectively learning the semantic consistency and reducing this semantic gap. To evaluate the performance of our designed network, we evaluate it against several SOTA methods on four benchmark datasets including ISIC2018, GlaS, ACDC, and Synapse. Comparative results indicate that MFNet achieves remarkable ability on medical image segmentation.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,