E-SegNet：用于精确分割二维和三维医学图像的e形结构网络。

IF 10.7 1区综合性期刊 Q1 Multidisciplinary

Research Pub Date : 2025-09-03 eCollection Date: 2025-01-01 DOI:10.34133/research.0869

Wei Wu, Xin Yang, Chenggui Yao, Ou Liu, Qi Zhao, Jianwei Shuai

{"title":"E-SegNet：用于精确分割二维和三维医学图像的e形结构网络。","authors":"Wei Wu, Xin Yang, Chenggui Yao, Ou Liu, Qi Zhao, Jianwei Shuai","doi":"10.34133/research.0869","DOIUrl":null,"url":null,"abstract":"U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications. In this study, we provide an E-shaped segmentation framework that discards the traditional step-by-step resolution recovery decoding process, instead directly aggregating multi-scale features extracted by the encoder at each stage for deep cross-level integration. Additionally, we propose an innovative multi-scale large-kernel convolution (MLKConv) module, designed to enhance high-level feature representation by effectively capturing both local and global contextual information. Compared to U-structure, the proposed E-structured approach substantially reduces parameters while delivering superior performance, especially in complex segmentation tasks. Based on this structure, we develop 2 segmentation networks specifically for 2-dimensional (2D) and 3D medical images. 2D E-SegNet is evaluated on four 2D segmentation benchmark datasets (Synapse multi-organ, ACDC, Kvasir-Seg, and BUSI), while 3D E-SegNet is assessed on four 3D segmentation benchmark datasets (Synapse, ACDC, NIH Pancreas, and Lung). Experimental results demonstrate that our approach outperforms the current leading U-shaped models across multiple datasets, achieving new state-of-the-art (SOTA) performance with fewer parameters. In summary, our research introduces a novel approach to medical image segmentation, offering potential improvements and contributing to ongoing advancements in the field. Our code is publicly available on https://github.com/zhaoqi106/E-SegNet.","PeriodicalId":21120,"journal":{"name":"Research","volume":"8 ","pages":"0869"},"PeriodicalIF":10.7000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408157/pdf/","citationCount":"0","resultStr":"{\"title\":\"E-SegNet: E-Shaped Structure Networks for Accurate 2D and 3D Medical Image Segmentation.\",\"authors\":\"Wei Wu, Xin Yang, Chenggui Yao, Ou Liu, Qi Zhao, Jianwei Shuai\",\"doi\":\"10.34133/research.0869\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications. In this study, we provide an E-shaped segmentation framework that discards the traditional step-by-step resolution recovery decoding process, instead directly aggregating multi-scale features extracted by the encoder at each stage for deep cross-level integration. Additionally, we propose an innovative multi-scale large-kernel convolution (MLKConv) module, designed to enhance high-level feature representation by effectively capturing both local and global contextual information. Compared to U-structure, the proposed E-structured approach substantially reduces parameters while delivering superior performance, especially in complex segmentation tasks. Based on this structure, we develop 2 segmentation networks specifically for 2-dimensional (2D) and 3D medical images. 2D E-SegNet is evaluated on four 2D segmentation benchmark datasets (Synapse multi-organ, ACDC, Kvasir-Seg, and BUSI), while 3D E-SegNet is assessed on four 3D segmentation benchmark datasets (Synapse, ACDC, NIH Pancreas, and Lung). Experimental results demonstrate that our approach outperforms the current leading U-shaped models across multiple datasets, achieving new state-of-the-art (SOTA) performance with fewer parameters. In summary, our research introduces a novel approach to medical image segmentation, offering potential improvements and contributing to ongoing advancements in the field. Our code is publicly available on https://github.com/zhaoqi106/E-SegNet.\",\"PeriodicalId\":21120,\"journal\":{\"name\":\"Research\",\"volume\":\"8 \",\"pages\":\"0869\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408157/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.34133/research.0869\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Multidisciplinary\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0869","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}

引用次数: 0

摘要

u型结构已成为医学图像分割的基本方法，在各种分割任务中始终表现出强大的性能。目前大多数模型都基于该框架，定制编码器-解码器组件，以在各种分割挑战中实现更高的精度。然而，这通常是以增加参数计数为代价的，这不可避免地限制了它们在实际应用中的实用性。在本研究中，我们提供了一个e形分割框架，该框架抛弃了传统的分步分辨率恢复解码过程，而是直接聚集编码器在每个阶段提取的多尺度特征进行深度跨级别集成。此外，我们提出了一种创新的多尺度大核卷积（MLKConv）模块，旨在通过有效捕获局部和全局上下文信息来增强高级特征表示。与u结构相比，本文提出的e结构方法大大减少了参数，同时提供了优越的性能，特别是在复杂的分割任务中。基于这种结构，我们开发了两种专门针对二维（2D）和三维医学图像的分割网络。2D E-SegNet在4个2D分割基准数据集（Synapse多器官、ACDC、Kvasir-Seg和BUSI）上进行评估，3D E-SegNet在4个3D分割基准数据集（Synapse、ACDC、NIH胰腺和肺）上进行评估。实验结果表明，我们的方法在多个数据集上优于当前领先的u形模型，以更少的参数实现了新的最先进（SOTA）性能。总之，我们的研究引入了一种新的医学图像分割方法，提供了潜在的改进并有助于该领域的持续进步。我们的代码可以在https://github.com/zhaoqi106/E-SegNet上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

E-SegNet: E-Shaped Structure Networks for Accurate 2D and 3D Medical Image Segmentation.

查看原文本刊更多论文

E-SegNet: E-Shaped Structure Networks for Accurate 2D and 3D Medical Image Segmentation.

U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications. In this study, we provide an E-shaped segmentation framework that discards the traditional step-by-step resolution recovery decoding process, instead directly aggregating multi-scale features extracted by the encoder at each stage for deep cross-level integration. Additionally, we propose an innovative multi-scale large-kernel convolution (MLKConv) module, designed to enhance high-level feature representation by effectively capturing both local and global contextual information. Compared to U-structure, the proposed E-structured approach substantially reduces parameters while delivering superior performance, especially in complex segmentation tasks. Based on this structure, we develop 2 segmentation networks specifically for 2-dimensional (2D) and 3D medical images. 2D E-SegNet is evaluated on four 2D segmentation benchmark datasets (Synapse multi-organ, ACDC, Kvasir-Seg, and BUSI), while 3D E-SegNet is assessed on four 3D segmentation benchmark datasets (Synapse, ACDC, NIH Pancreas, and Lung). Experimental results demonstrate that our approach outperforms the current leading U-shaped models across multiple datasets, achieving new state-of-the-art (SOTA) performance with fewer parameters. In summary, our research introduces a novel approach to medical image segmentation, offering potential improvements and contributing to ongoing advancements in the field. Our code is publicly available on https://github.com/zhaoqi106/E-SegNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Research Multidisciplinary-Multidisciplinary

CiteScore

13.40

自引率

3.60%

发文量

审稿时长

14 weeks

期刊介绍： Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe. Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.