ELK-BiONet: Efficient Large-Kernel Convolution Enhanced Recurrent Bidirectional Connection Encoding and Decoding Structure for Skin Lesions Segmentation

IF 2.5 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-07-30 DOI:10.1002/ima.70172

Jingjing Ma, Zhanxu Liu, Zhiqiang Guo, Ping Wang

{"title":"ELK-BiONet: Efficient Large-Kernel Convolution Enhanced Recurrent Bidirectional Connection Encoding and Decoding Structure for Skin Lesions Segmentation","authors":"Jingjing Ma, Zhanxu Liu, Zhiqiang Guo, Ping Wang","doi":"10.1002/ima.70172","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The size and shape of skin lesions often exhibit significant variability, and enabling deep learning networks to adapt to this variability is crucial for improving the segmentation performance of such lesions. The encoder-decoder architecture has become one of the most commonly used structures for semantic segmentation in deep learning models. However, when the convolution-based UNet network is applied to skin lesion segmentation, several issues remain. (1) Traditional small-kernel convolutions have a limited receptive field, which makes it difficult to adapt to the varying sizes and shapes of skin lesions. (2) The conventional U-Net architecture experiences a substantial increase in parameter count as the network depth grows. (3) Although the U-Net decoder utilizes feature information from the encoder, the features extracted by the decoder are not fully leveraged. To address the above challenges in U-Net for skin lesion segmentation tasks, we propose an efficient large-kernel convolution enhanced recurrent bidirectional connection encoding and decoding structure for skin lesions segmentation (ELK-BiONet). The main innovations of this method are as follows: (1) We propose a large-kernel convolution method that balances large and small receptive fields while maintaining a relatively low parameter count. (2) The network extracts feature information in a recurrent manner, allowing the construction of deeper network architectures while keeping the overall parameter count nearly constant. (3) By employing bidirectional connections, the features extracted by the decoder are fully utilized in the encoder, thereby enhancing the segmentation performance of the network. We evaluated our method on skin lesion segmentation tasks, and the results demonstrate that our ELK-BiONet significantly outperforms other segmentation methods.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 5","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70172","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The size and shape of skin lesions often exhibit significant variability, and enabling deep learning networks to adapt to this variability is crucial for improving the segmentation performance of such lesions. The encoder-decoder architecture has become one of the most commonly used structures for semantic segmentation in deep learning models. However, when the convolution-based UNet network is applied to skin lesion segmentation, several issues remain. (1) Traditional small-kernel convolutions have a limited receptive field, which makes it difficult to adapt to the varying sizes and shapes of skin lesions. (2) The conventional U-Net architecture experiences a substantial increase in parameter count as the network depth grows. (3) Although the U-Net decoder utilizes feature information from the encoder, the features extracted by the decoder are not fully leveraged. To address the above challenges in U-Net for skin lesion segmentation tasks, we propose an efficient large-kernel convolution enhanced recurrent bidirectional connection encoding and decoding structure for skin lesions segmentation (ELK-BiONet). The main innovations of this method are as follows: (1) We propose a large-kernel convolution method that balances large and small receptive fields while maintaining a relatively low parameter count. (2) The network extracts feature information in a recurrent manner, allowing the construction of deeper network architectures while keeping the overall parameter count nearly constant. (3) By employing bidirectional connections, the features extracted by the decoder are fully utilized in the encoder, thereby enhancing the segmentation performance of the network. We evaluated our method on skin lesion segmentation tasks, and the results demonstrate that our ELK-BiONet significantly outperforms other segmentation methods.

查看原文本刊更多论文

ELK-BiONet：有效的大核卷积增强循环双向连接编码和解码结构的皮肤病变分割

皮肤病变的大小和形状通常表现出显著的可变性，使深度学习网络能够适应这种可变性对于提高此类病变的分割性能至关重要。编码器-解码器结构已经成为深度学习模型中最常用的语义分割结构之一。然而，当基于卷积的UNet网络应用于皮肤病变分割时，仍然存在几个问题。(1)传统的小核卷积的接受野有限，难以适应不同大小和形状的皮肤病变。(2)随着网络深度的增加，传统U-Net体系结构的参数数量大幅增加。(3) U-Net解码器虽然利用了编码器的特征信息，但解码器提取的特征并没有被充分利用。为了解决上述问题，我们提出了一种高效的大核卷积增强循环双向连接编码解码结构（ELK-BiONet）。该方法的主要创新点如下：(1)提出了一种大核卷积方法，在保持相对低的参数数的同时平衡大小接受域。(2)网络以循环的方式提取特征信息，允许构建更深层次的网络架构，同时保持总体参数数量几乎不变。(3)通过双向连接，解码器提取的特征在编码器中得到充分利用，从而提高了网络的分割性能。我们在皮肤损伤分割任务中评估了我们的方法，结果表明我们的ELK-BiONet显著优于其他分割方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.