Junran Qian , Haiyan Li , Shuran Liao , Zhe Xiao , Weihua Li , Hongsong Li
{"title":"MHS U-Net:用于医学图像分割的多尺度混合减法网络","authors":"Junran Qian , Haiyan Li , Shuran Liao , Zhe Xiao , Weihua Li , Hongsong Li","doi":"10.1016/j.compbiomed.2025.110431","DOIUrl":null,"url":null,"abstract":"<div><div>Medical image segmentation plays a critical role in modern clinical diagnosis. However, existing methods face challenges such as insufficient feature extraction, limited spatial modeling capabilities, and restricted generalization ability with low computational cost. To address these challenges, we propose a Multi-scale Hybrid Subtraction Network (MHS U-Net) for Medical Image Segmentation. First, a pretrained PVTv2-B2 is integrated as the encoder to enhance the model's adaptability and feature extraction capability for complex multi-modal medical images. Second, a Multi-Layer Shift Perception Attention (MSPA) mechanism is designed at the bottleneck to capture fine-grained high-level features by deepening the network structure, while effectively suppressing the surge in computational cost through shift operations. In the decoder, a Multi-Scale Hybrid Convolution Subtraction Decoder (MSHCSD) is developed, to improve the modeling of spatial relationships within and around lesions and significantly enhance the model's generalization ability through integrating group convolution, gating mechanisms, and deep convolutional blocks. Additionally, to address the insufficient utilization of multi-scale feature interactions, a Multi-Scale Subtraction Module (MSSM) is proposed to strengthen cross-scale feature fusion through differential information extraction and complementary feature enhancement, thereby achieving the precise localization and segmentation of lesion regions. Experimental results on 14 public datasets across five imaging modalities demonstrate that MHS U-Net consistently outperforms state-of-the-art methods in metrics and visual results. Moreover, MHS U-Net requires only 5.48G FLOPs and 11.59M parameters, significantly lower than most existing models. Overall, MHS U-Net offers an excellent balance between model performance and size in multi-modal medical image segmentation tasks.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"193 ","pages":"Article 110431"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MHS U-Net: Multi-scale hybrid subtraction network for medical image segmentation\",\"authors\":\"Junran Qian , Haiyan Li , Shuran Liao , Zhe Xiao , Weihua Li , Hongsong Li\",\"doi\":\"10.1016/j.compbiomed.2025.110431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Medical image segmentation plays a critical role in modern clinical diagnosis. However, existing methods face challenges such as insufficient feature extraction, limited spatial modeling capabilities, and restricted generalization ability with low computational cost. To address these challenges, we propose a Multi-scale Hybrid Subtraction Network (MHS U-Net) for Medical Image Segmentation. First, a pretrained PVTv2-B2 is integrated as the encoder to enhance the model's adaptability and feature extraction capability for complex multi-modal medical images. Second, a Multi-Layer Shift Perception Attention (MSPA) mechanism is designed at the bottleneck to capture fine-grained high-level features by deepening the network structure, while effectively suppressing the surge in computational cost through shift operations. In the decoder, a Multi-Scale Hybrid Convolution Subtraction Decoder (MSHCSD) is developed, to improve the modeling of spatial relationships within and around lesions and significantly enhance the model's generalization ability through integrating group convolution, gating mechanisms, and deep convolutional blocks. Additionally, to address the insufficient utilization of multi-scale feature interactions, a Multi-Scale Subtraction Module (MSSM) is proposed to strengthen cross-scale feature fusion through differential information extraction and complementary feature enhancement, thereby achieving the precise localization and segmentation of lesion regions. Experimental results on 14 public datasets across five imaging modalities demonstrate that MHS U-Net consistently outperforms state-of-the-art methods in metrics and visual results. Moreover, MHS U-Net requires only 5.48G FLOPs and 11.59M parameters, significantly lower than most existing models. Overall, MHS U-Net offers an excellent balance between model performance and size in multi-modal medical image segmentation tasks.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"193 \",\"pages\":\"Article 110431\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525007826\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525007826","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
MHS U-Net: Multi-scale hybrid subtraction network for medical image segmentation
Medical image segmentation plays a critical role in modern clinical diagnosis. However, existing methods face challenges such as insufficient feature extraction, limited spatial modeling capabilities, and restricted generalization ability with low computational cost. To address these challenges, we propose a Multi-scale Hybrid Subtraction Network (MHS U-Net) for Medical Image Segmentation. First, a pretrained PVTv2-B2 is integrated as the encoder to enhance the model's adaptability and feature extraction capability for complex multi-modal medical images. Second, a Multi-Layer Shift Perception Attention (MSPA) mechanism is designed at the bottleneck to capture fine-grained high-level features by deepening the network structure, while effectively suppressing the surge in computational cost through shift operations. In the decoder, a Multi-Scale Hybrid Convolution Subtraction Decoder (MSHCSD) is developed, to improve the modeling of spatial relationships within and around lesions and significantly enhance the model's generalization ability through integrating group convolution, gating mechanisms, and deep convolutional blocks. Additionally, to address the insufficient utilization of multi-scale feature interactions, a Multi-Scale Subtraction Module (MSSM) is proposed to strengthen cross-scale feature fusion through differential information extraction and complementary feature enhancement, thereby achieving the precise localization and segmentation of lesion regions. Experimental results on 14 public datasets across five imaging modalities demonstrate that MHS U-Net consistently outperforms state-of-the-art methods in metrics and visual results. Moreover, MHS U-Net requires only 5.48G FLOPs and 11.59M parameters, significantly lower than most existing models. Overall, MHS U-Net offers an excellent balance between model performance and size in multi-modal medical image segmentation tasks.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.