Yugen Yi , Yi He , Hong Li , Xuan Wu , Jiangyan Dai , Siwei Luo , Quancai Li , Wei Zhou
{"title":"MFNet: Multi-fusion network for medical image segmentation","authors":"Yugen Yi , Yi He , Hong Li , Xuan Wu , Jiangyan Dai , Siwei Luo , Quancai Li , Wei Zhou","doi":"10.1016/j.dsp.2025.105219","DOIUrl":null,"url":null,"abstract":"<div><div>Medical image segmentation can distinguish and determine various structures, tissues, or lesions, providing crucial information for clinical diagnosis. However, it faces numerous challenges. On the one hand, medical images possess complex structures, diverse morphologies, uneven contrast, and blurred borders between target tissues and the background, all of which complicate the segmentation process. On the other hand, there exists semantic gaps between low-level and high-level features as well as between the encoder and decoder, which greatly impacts the segmentation effectiveness. In order to overcome these drawbacks, a <strong>M</strong>ulti-<strong>F</strong>usion <strong>Net</strong>work (<strong>MFNet</strong>) is presented to integrate semantic and feature fusion. In this method, two novel modules including Multi-Level Semantic Fusion (MLSF) module and Multi-Scale Progressive Fusion (MSPF) module are designed to heighten the representation capability of capturing diverse semantic and scale information. Moreover, a Multi-Stage Progressive Fusion Decoder (MSPFD) model is developed to substitute the traditional bottom-up aggregation decoder with a hierarchical fusion decoder to integrate features from different levels step by step. Meanwhile, an Interaction and Fusion of Adjacent Levels (IFAL) module is introduced to merge higher-level and lower-level features, effectively learning the semantic consistency and reducing this semantic gap. To evaluate the performance of our designed network, we evaluate it against several SOTA methods on four benchmark datasets including ISIC2018, GlaS, ACDC, and Synapse. Comparative results indicate that MFNet achieves remarkable ability on medical image segmentation.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"163 ","pages":"Article 105219"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425002416","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Medical image segmentation can distinguish and determine various structures, tissues, or lesions, providing crucial information for clinical diagnosis. However, it faces numerous challenges. On the one hand, medical images possess complex structures, diverse morphologies, uneven contrast, and blurred borders between target tissues and the background, all of which complicate the segmentation process. On the other hand, there exists semantic gaps between low-level and high-level features as well as between the encoder and decoder, which greatly impacts the segmentation effectiveness. In order to overcome these drawbacks, a Multi-Fusion Network (MFNet) is presented to integrate semantic and feature fusion. In this method, two novel modules including Multi-Level Semantic Fusion (MLSF) module and Multi-Scale Progressive Fusion (MSPF) module are designed to heighten the representation capability of capturing diverse semantic and scale information. Moreover, a Multi-Stage Progressive Fusion Decoder (MSPFD) model is developed to substitute the traditional bottom-up aggregation decoder with a hierarchical fusion decoder to integrate features from different levels step by step. Meanwhile, an Interaction and Fusion of Adjacent Levels (IFAL) module is introduced to merge higher-level and lower-level features, effectively learning the semantic consistency and reducing this semantic gap. To evaluate the performance of our designed network, we evaluate it against several SOTA methods on four benchmark datasets including ISIC2018, GlaS, ACDC, and Synapse. Comparative results indicate that MFNet achieves remarkable ability on medical image segmentation.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,